; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G200550 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G200550
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationCiama_Chr11:246183..249575
RNA-Seq ExpressionCaUC11G200550
SyntenyCaUC11G200550
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]2.6e-14783.77Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGV++SGEPFTIG  K+ VQSQQ V+ G+HL FGADGVYKP  + SPTYQS GVGV+GNAG+D S REAFV+MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+  A  S AATQS GGFSPPPT   PSGGSA+ T LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG HKEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN
        TAPHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS  ACNN
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN

XP_004145559.1 AT-hook motif nuclear-localized protein 10 [Cucumis sativus]1.3e-16591.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSV SQQPVMQ MHL FGADGVYKPVA ASPTYQSS VGVAGNAG+DGSAR+AFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APAVR  AATQSSGGFSP PTAAP SG SA+ TSLKK RGRPPGSS+KK  LD S SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

XP_008453016.1 PREDICTED: AT-hook motif nuclear-localized protein 10 [Cucumis melo]4.3e-16691.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSVQSQQPVMQ MHL FGADGVYKPV AASPTYQSS VGVAGNAG+DGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APAVR  AATQSSGGFSP PTAAP SGGS + TSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD  HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

XP_022981864.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita maxima]7.7e-14783.77Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGV++SGEPFTIG  K+ VQSQQ V+ G+HL FGADGVYKP ++ SPTYQS GVGV+GNAG+D S REAFV MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+A A  S AATQS GGFSPPPT   PSGGSA+ T LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG  KEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN
        TAPHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS  ACNN
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]1.7e-17093.97Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGEPFTIGL KNSVQSQQ VMQGMHL FGADGVYKPVAAASPTYQSSGVGVAGNAG+DGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APAVRS AATQ SGGFSPPPTAAPPSGGSA+ T LKKARGRPPGSS+KKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+TDG HKEL  VNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQS  ACNNN I
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein6.1e-16691.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSV SQQPVMQ MHL FGADGVYKPVA ASPTYQSS VGVAGNAG+DGSAR+AFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APAVR  AATQSSGGFSP PTAAP SG SA+ TSLKK RGRPPGSS+KK  LD S SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

A0A1S3BWC6 AT-hook motif nuclear-localized protein2.1e-16691.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSVQSQQPVMQ MHL FGADGVYKPV AASPTYQSS VGVAGNAG+DGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APAVR  AATQSSGGFSP PTAAP SGGS + TSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD  HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

A0A5A7VAQ2 AT-hook motif nuclear-localized protein2.1e-16691.95Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVISSGE FTIGL KNSVQSQQPVMQ MHL FGADGVYKPV AASPTYQSS VGVAGNAG+DGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APAVR  AATQSSGGFSP PTAAP SGGS + TSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD  HKELRQVNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTI

A0A6J1FR30 AT-hook motif nuclear-localized protein4.9e-14783.48Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGV++SGEPFTIG  K+ VQSQQ V+ G+HL FGADGVYKP ++ SPTYQS  VGV+GNAG+D S REAFV+MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+  A  S AATQS GGFSPPPT   PSGGSA+ T LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG HKEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN
        TAPHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS  ACNN
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN

A0A6J1IXR2 AT-hook motif nuclear-localized protein3.7e-14783.77Show/hide
Query:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGV++SGEPFTIG  K+ VQSQQ V+ G+HL FGADGVYKP ++ SPTYQS GVGV+GNAG+D S REAFV MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+A A  S AATQS GGFSPPPT   PSGGSA+ T LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG  KEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN
        TAPHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS  ACNN
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNN

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 106.0e-7853.97Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  +  LSFG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

O80834 AT-hook motif nuclear-localized protein 99.4e-4744.56Show/hide
Query:  GVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNS-----QSEPVKRKRGRPRKYGPDGSM
        G+  SG P   G  +     +    Q      G+ G   P     P+  ++    AG AG+        VNM +        P+KRKRGRPRKYG DGS+
Subjt:  GVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNS-----QSEPVKRKRGRPRKYGPDGSM

Query:  AMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL
        ++A          SS   S   T  P         S K+ RGRPPG S KKQ++   G     S+G+ FTPHVI V  GED++SK+++FSQ GPRA+C+L
Subjt:  AMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL

Query:  TANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV
        +A+GA+S  TL QP+ S G + YEGRFEIL+LS SY+++ +GS R+RTG LSVSL+ PDGRV+GG + G L AASPVQV+VGSF+
Subjt:  TANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV

Q8VYJ2 AT-hook motif nuclear-localized protein 16.9e-5050.42Show/hide
Query:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAT--QTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK
        +K+KRGRPRKYGPDG+ +A++P   S A         P P+  PP          S K+++ +P  S ++ +   Q++  G     S G  FTPH+ITV 
Subjt:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAT--QTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK

Query:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV
         GEDV+ KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++G  RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPV
Subjt:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV

Query:  QVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPI
        QVVVGSF+    H++ +         +++P    PI
Subjt:  QVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPI

Q940I0 AT-hook motif nuclear-localized protein 131.6e-4642.3Show/hide
Query:  VGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDG----------SMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTS--LKKARGRPP
        +G  G+  S  + ++  +      + VK+KRGRPRKY  DG          ++ +AP    P+A+ S GG +        +G +A  +    K+ RGRPP
Subjt:  VGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDG----------SMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTS--LKKARGRPP

Query:  GSSSKKQQLDG-SGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGSQRSR
        GS   K+QLD   G+ GVGFTPHVI VK GED+++KI++F+  GPRA+CIL+A GA++NV LRQ   S   GTV YEGRFEI+SLSGS+L SE+    ++
Subjt:  GSSSKKQQLDG-SGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGSQRSR

Query:  TGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQS--PVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAI
        TG LSVSL+G +GR++GG V G+L A S VQV+VGSFV DG  K+ +   + + +  P +AP  +     G+ G  SP S+G    S     +  N    
Subjt:  TGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQS--PVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAI

Query:  ACNNN
          +NN
Subjt:  ACNNN

Q9FIR1 AT-hook motif nuclear-localized protein 84.2e-4744.04Show/hide
Query:  TYQSSGVGVAGNAGSDGSAREAF-VNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSK
        T +S G G     GS  S    F ++  +Q   VK+KRGRPRKY PDGS+A+  A  SP  + +S  +           G++    +K+ RGRPPGSS  
Subjt:  TYQSSGVGVAGNAGSDGSAREAF-VNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSK

Query:  KQQLDG-SGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVS
        K+QLD   G++GVGFTPHVI V  GED++SK+M+FS  G R +CIL+A+GA+S V LRQ + S G VTYEGRFEI++LSGS L  E     +R+G LSV+
Subjt:  KQQLDG-SGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVS

Query:  LSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQI---EQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNT
        L+GPDG ++GG V G L AA+ VQV+VGSFV +    +   VN        P +AP  +    +   G SS  S       SG P    +      NNN 
Subjt:  LSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQI---EQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNT

Query:  IY
        IY
Subjt:  IY

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein4.3e-7953.97Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  +  LSFG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

AT2G33620.2 AT hook motif DNA-binding family protein4.3e-7953.97Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  +  LSFG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

AT2G33620.3 AT hook motif DNA-binding family protein4.3e-7953.97Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  +  LSFG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

AT2G33620.4 AT hook motif DNA-binding family protein4.3e-7953.97Show/hide
Query:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR
        MSGSETG++++      FT+ LH+    SQ    Q  +  LSFG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVISS---GEPFTIGLHKNSVQSQQPVMQGMH--LSFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG   +    K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKE

Query:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP
         + V Q+  S    P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI  P
Subjt:  LRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIYAP

AT4G12080.1 AT-hook motif nuclear-localized protein 14.9e-5150.42Show/hide
Query:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAT--QTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK
        +K+KRGRPRKYGPDG+ +A++P   S A         P P+  PP          S K+++ +P  S ++ +   Q++  G     S G  FTPH+ITV 
Subjt:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAT--QTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK

Query:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV
         GEDV+ KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++G  RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPV
Subjt:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV

Query:  QVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPI
        QVVVGSF+    H++ +         +++P    PI
Subjt:  QVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGATCTGAGACCGGAGTGATTTCCAGCGGCGAACCCTTCACCATCGGTCTCCACAAGAATTCAGTACAGTCACAACAGCCGGTCATGCAGGGCATGCATTTATC
CTTCGGCGCCGATGGCGTCTACAAGCCCGTTGCCGCCGCCTCTCCCACCTACCAGTCCTCCGGCGTCGGAGTTGCCGGTAATGCTGGTTCCGATGGATCTGCTCGTGAAG
CTTTCGTTAACATGAATTCGCAAAGCGAGCCTGTAAAGAGGAAGAGAGGGAGGCCTCGGAAGTATGGGCCAGATGGCAGTATGGCAATGGCTCCTGCAGTCCGCTCTCCC
GCCGCAACTCAGTCCAGTGGAGGTTTTTCTCCTCCACCCACCGCCGCGCCTCCGTCGGGAGGATCAGCCACTCAAACTTCTTTGAAGAAAGCCAGAGGCAGACCCCCTGG
CTCTTCTAGCAAAAAGCAGCAGTTGGATGGCTCGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTAAAAGCTGGAGAGGATGTATCTTCGAAAATAATGT
CATTTTCACAGAATGGTCCTAGAGCTGTTTGTATCCTTACAGCAAATGGAGCAATATCCAATGTGACTCTACGTCAACCAGCCATGTCTGGTGGAACCGTGACTTACGAG
GGGCGATTCGAGATTTTGTCGCTATCTGGGTCCTATCTCCTCTCTGAGAATGGCAGCCAGCGGAGTCGAACTGGAGGTCTAAGTGTTTCATTATCTGGTCCAGATGGTAG
AGTATTAGGTGGTGGGGTGGCTGGTCTTCTAACGGCAGCGTCTCCTGTACAGGTGGTCGTGGGGAGCTTCGTCACTGATGGGGTGCACAAGGAATTGAGACAAGTAAACC
AAATAGAACAGTCCCCTGTTACTGCACCCCATAAACTTGCTCCAATCCGTGCTGGAATGACGGGAGCCAGCAGCCCGCCATCACGTGGGACTCTCAGTGAATCCTCAGGA
GGGCCCGGGAGTCCGTTTAATCAGAGTGCTATAGCCTGCAATAATAACACCATATATGCACCTGTAATTTATGCTAAGCTAAAGGATATTGTAACTCTTTCCAGTTTGGG
ATGGCTTGGAACCTTGGCCTTGTGCCCGCTGGAAACGAAAGACAGACAGACGACACAGATGGTTTCAGTTAGGAATTAG
mRNA sequenceShow/hide mRNA sequence
TTTTTTTTTAAAAAAGAAAAATTTGGAAAATAATTAGTTACTGAATGATGTGGATCTTCTTCTTCCCGGAGTGGCGAGTGCTAAGATCTGTGCTTGAAAACGGAAAATTG
CTGCAGTTTTATGAAAGTTTATTTGGCTGTTAGGGTTTTTGTTCGAATTTTACTGATTTTGTTCCGAGATTTAAGGTCGGGGAGTGGGACTTCCTTGTATGGAATTTTAG
TTCCAGCGAAGGAGAACAATGTCGGGATCTGAGACCGGAGTGATTTCCAGCGGCGAACCCTTCACCATCGGTCTCCACAAGAATTCAGTACAGTCACAACAGCCGGTCAT
GCAGGGCATGCATTTATCCTTCGGCGCCGATGGCGTCTACAAGCCCGTTGCCGCCGCCTCTCCCACCTACCAGTCCTCCGGCGTCGGAGTTGCCGGTAATGCTGGTTCCG
ATGGATCTGCTCGTGAAGCTTTCGTTAACATGAATTCGCAAAGCGAGCCTGTAAAGAGGAAGAGAGGGAGGCCTCGGAAGTATGGGCCAGATGGCAGTATGGCAATGGCT
CCTGCAGTCCGCTCTCCCGCCGCAACTCAGTCCAGTGGAGGTTTTTCTCCTCCACCCACCGCCGCGCCTCCGTCGGGAGGATCAGCCACTCAAACTTCTTTGAAGAAAGC
CAGAGGCAGACCCCCTGGCTCTTCTAGCAAAAAGCAGCAGTTGGATGGCTCGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTAAAAGCTGGAGAGGATG
TATCTTCGAAAATAATGTCATTTTCACAGAATGGTCCTAGAGCTGTTTGTATCCTTACAGCAAATGGAGCAATATCCAATGTGACTCTACGTCAACCAGCCATGTCTGGT
GGAACCGTGACTTACGAGGGGCGATTCGAGATTTTGTCGCTATCTGGGTCCTATCTCCTCTCTGAGAATGGCAGCCAGCGGAGTCGAACTGGAGGTCTAAGTGTTTCATT
ATCTGGTCCAGATGGTAGAGTATTAGGTGGTGGGGTGGCTGGTCTTCTAACGGCAGCGTCTCCTGTACAGGTGGTCGTGGGGAGCTTCGTCACTGATGGGGTGCACAAGG
AATTGAGACAAGTAAACCAAATAGAACAGTCCCCTGTTACTGCACCCCATAAACTTGCTCCAATCCGTGCTGGAATGACGGGAGCCAGCAGCCCGCCATCACGTGGGACT
CTCAGTGAATCCTCAGGAGGGCCCGGGAGTCCGTTTAATCAGAGTGCTATAGCCTGCAATAATAACACCATATATGCACCTGTAATTTATGCTAAGCTAAAGGATATTGT
AACTCTTTCCAGTTTGGGATGGCTTGGAACCTTGGCCTTGTGCCCGCTGGAAACGAAAGACAGACAGACGACACAGATGGTTTCAGTTAGGAATTAG
Protein sequenceShow/hide protein sequence
MSGSETGVISSGEPFTIGLHKNSVQSQQPVMQGMHLSFGADGVYKPVAAASPTYQSSGVGVAGNAGSDGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSP
AATQSSGGFSPPPTAAPPSGGSATQTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYE
GRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGVHKELRQVNQIEQSPVTAPHKLAPIRAGMTGASSPPSRGTLSESSG
GPGSPFNQSAIACNNNTIYAPVIYAKLKDIVTLSSLGWLGTLALCPLETKDRQTTQMVSVRN