; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G024210 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G024210
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationchr04:31318792..31322400
RNA-Seq ExpressionLsi04G024210
SyntenyLsi04G024210
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]6.4e-15385.47Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGVM+SGEPFTIG QK+ VQSQQ V+ G+HLPFGADGVYKP  + SPTYQS GVGV+GNAGAD S REAFV+MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+  A  S AATQS GGFSPPPT   PSGGSASPT LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGHKEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS  AC NNT SWK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

XP_004145559.1 AT-hook motif nuclear-localized protein 10 [Cucumis sativus]1.1e-17193.45Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSV SQQPVMQ MHLPFGADGVYKPVA ASPTYQSS VGVAGNAGADGSAR+AFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APAVR  AATQSSGGFSP PTAAP SG SASPTSLKK RGRPPGSS+KK  LD S SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

XP_008453016.1 PREDICTED: AT-hook motif nuclear-localized protein 10 [Cucumis melo]3.6e-17293.45Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSVQSQQPVMQ MHLPFGADGVYKPV AASPTYQSS VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APAVR  AATQSSGGFSP PTAAP SGGS SPTSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GHKELRQVNQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

XP_022981864.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita maxima]1.9e-15285.47Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGVM+SGEPFTIG QK+ VQSQQ V+ G+HLPFGADGVYKP ++ SPTYQS GVGV+GNAGAD S REAFV MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+A A  S AATQS GGFSPPPT   PSGGSASPT LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG KEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS  AC NNT SWK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]1.9e-17695.73Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGVMSSGEPFTIGLQKNSVQSQQ VMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APAVRS AATQ SGGFSPPPTAAPPSGGSASPT LKKARGRPPGSS+KKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+TDGGHKEL  VNQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQS  ACNNN I WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein5.1e-17293.45Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSV SQQPVMQ MHLPFGADGVYKPVA ASPTYQSS VGVAGNAGADGSAR+AFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APAVR  AATQSSGGFSP PTAAP SG SASPTSLKK RGRPPGSS+KK  LD S SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

A0A1S3BWC6 AT-hook motif nuclear-localized protein1.8e-17293.45Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSVQSQQPVMQ MHLPFGADGVYKPV AASPTYQSS VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APAVR  AATQSSGGFSP PTAAP SGGS SPTSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GHKELRQVNQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

A0A5A7VAQ2 AT-hook motif nuclear-localized protein1.8e-17293.45Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        MSGSETGV+SSGE FTIGLQKNSVQSQQPVMQ MHLPFGADGVYKPV AASPTYQSS VGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APAVR  AATQSSGGFSP PTAAP SGGS SPTSLKK RGRPPGSS+KK QLD S S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GHKELRQVNQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSA ACNNNTI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

A0A6J1FR30 AT-hook motif nuclear-localized protein1.2e-15285.19Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGVM+SGEPFTIG QK+ VQSQQ V+ G+HLPFGADGVYKP ++ SPTYQS  VGV+GNAGAD S REAFV+MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+  A  S AATQS GGFSPPPT   PSGGSASPT LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGHKEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS  AC NNT SWK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

A0A6J1IXR2 AT-hook motif nuclear-localized protein9.0e-15385.47Show/hide
Query:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS
        M+GSETGVM+SGEPFTIG QK+ VQSQQ V+ G+HLPFGADGVYKP ++ SPTYQS GVGV+GNAGAD S REAFV MN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGS

Query:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+A A  S AATQS GGFSPPPT   PSGGSASPT LKKARGRPPG S KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANG
Subjt:  MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENG QRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG KEL+Q NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK
        TAPHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS  AC NNT SWK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAIACNNNTISWK

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 101.7e-7653.41Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK

O80834 AT-hook motif nuclear-localized protein 93.8e-4746.02Show/hide
Query:  GVMSSGEPFTIGLQKNSVQSQQPV--MQGMHLPF--GADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNS-----QSEPVKRKRGRPRKYGP
        G+  SG P   G    S Q QQ +  +   + PF  G+ G   P     P+  ++    AG AGA        VNM +        P+KRKRGRPRKYG 
Subjt:  GVMSSGEPFTIGLQKNSVQSQQPV--MQGMHLPF--GADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNS-----QSEPVKRKRGRPRKYGP

Query:  DGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA
        DGS+++A          SS   S   T  P +       S K+ RGRPPG S KKQ++   G     S+G+ FTPHVI V  GED++SK+++FSQ GPRA
Subjt:  DGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRA

Query:  VCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV
        +C+L+A+GA+S  TL QP+ S G + YEGRFEIL+LS SY+++ +GS R+RTG LSVSL+ PDGRV+GG + G L AASPVQV+VGSF+
Subjt:  VCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV

Q8GXB3 AT-hook motif nuclear-localized protein 57.1e-4645.49Show/hide
Query:  SQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAS---PTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVIT
        S+   VK+KRGRPRKY PDG +++              G SP P  +  S  S+S   P + K+ARGRPPG + +KQ+L   G     SAG+ F PHVI+
Subjt:  SQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSAS---PTSLKKARGRPPGSSSKKQQLDGSG-----SAGVGFTPHVIT

Query:  VKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAAS
        V +GED+ SK++SFSQ  PRA+CI++  G +S+VTLR+PA +  ++T+EGRFEILSL GSYL++E G  +SRTGGLSVSLSGP+G V+GGG+ G+L AAS
Subjt:  VKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAAS

Query:  PVQVVVGSFV-------TDGGHKELRQVNQIEQPPVTAPHKL----APIRAGMTGASSP---PSRGTLSESSGGPGS
         VQVV  SFV        +  +K ++Q  + +Q P  +  +     AP  A  TG  +P   P++G       G GS
Subjt:  PVQVVVGSFV-------TDGGHKELRQVNQIEQPPVTAPHKL----APIRAGMTGASSP---PSRGTLSESSGGPGS

Q8VYJ2 AT-hook motif nuclear-localized protein 11.8e-4950.42Show/hide
Query:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK
        +K+KRGRPRKYGPDG+ +A++P   S A         P P+  PP          S K+++ +P  S ++ +   Q++  G     S G  FTPH+ITV 
Subjt:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK

Query:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV
         GEDV+ KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++G  RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPV
Subjt:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV

Query:  QVVVGSFVTDGGHKELRQVNQIEQPPVTAPHKLAPI
        QVVVGSF+    H++ +         +++P    PI
Subjt:  QVVVGSFVTDGGHKELRQVNQIEQPPVTAPHKLAPI

Q9FIR1 AT-hook motif nuclear-localized protein 81.1e-4646.36Show/hide
Query:  VNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDG-SGSAGVGFTPHVITVKA
        ++  +Q   VK+KRGRPRKY PDGS+A+  A  SP  + +S  +           G++    +K+ RGRPPGSS  K+QLD   G++GVGFTPHVI V  
Subjt:  VNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDG-SGSAGVGFTPHVITVKA

Query:  GEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ
        GED++SK+M+FS  G R +CIL+A+GA+S V LRQ + S G VTYEGRFEI++LSGS L  E     +R+G LSV+L+GPDG ++GG V G L AA+ VQ
Subjt:  GEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ

Query:  VVVGSFVTDGGHKELRQVN--QIEQP-PVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGP
        V+VGSFV +    +   VN  + + P P +AP  +    +   G SS  S       SG P
Subjt:  VVVGSFVTDGGHKELRQVN--QIEQP-PVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGP

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein1.2e-7753.41Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK

AT2G33620.2 AT hook motif DNA-binding family protein1.2e-7753.41Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK

AT2G33620.3 AT hook motif DNA-binding family protein1.2e-7753.41Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK

AT2G33620.4 AT hook motif DNA-binding family protein1.2e-7753.41Show/hide
Query:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR
        MSGSETG+M++      FT+ L +    SQ    Q  + P  FG D    +YK P+ + SP   YQ +  G       +    E+     + SEPVK++R
Subjt:  MSGSETGVMSS---GEPFTIGLQKNSVQSQQPVMQGMHLP--FGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKR

Query:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN
        GRPRKYGPD G M++     +P+ T S           P SGG       +K RGRPPGSSSK+ +L   GS G+GFTPHV+TV AGEDVSSKIM+ + N
Subjt:  GRPRKYGPD-GSMAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQN

Query:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE
        GPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN  QRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG  + 
Subjt:  GPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKE

Query:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK
         + V Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS     NNTI+  WK
Subjt:  LRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAIACNNNTIS--WK

AT4G12080.1 AT-hook motif nuclear-localized protein 11.3e-5050.42Show/hide
Query:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK
        +K+KRGRPRKYGPDG+ +A++P   S A         P P+  PP          S K+++ +P  S ++ +   Q++  G     S G  FTPH+ITV 
Subjt:  VKRKRGRPRKYGPDGS-MAMAPAVRSPAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSSKKQ---QLDGSG-----SAGVGFTPHVITVK

Query:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV
         GEDV+ KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++G  RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPV
Subjt:  AGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPV

Query:  QVVVGSFVTDGGHKELRQVNQIEQPPVTAPHKLAPI
        QVVVGSF+    H++ +         +++P    PI
Subjt:  QVVVGSFVTDGGHKELRQVNQIEQPPVTAPHKLAPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGATCCGAGACCGGAGTGATGTCCAGCGGCGAACCTTTCACCATCGGTCTCCAGAAGAATTCAGTACAGTCACAACAGCCGGTCATGCAGGGCATGCATTTACC
CTTCGGCGCCGATGGCGTCTACAAGCCCGTTGCCGCCGCCTCTCCCACCTACCAGTCCTCCGGCGTCGGAGTTGCCGGTAATGCTGGTGCCGATGGATCTGCTCGTGAAG
CTTTCGTGAACATGAATTCGCAAAGCGAGCCTGTAAAGAGGAAGAGAGGGAGGCCTCGGAAGTATGGGCCAGATGGCAGTATGGCGATGGCTCCTGCGGTCCGCTCTCCC
GCTGCAACTCAGTCGAGTGGAGGTTTTTCTCCTCCACCCACCGCCGCGCCTCCGTCGGGAGGATCAGCCTCTCCAACTTCTTTGAAGAAAGCCAGAGGCAGACCCCCTGG
CTCGTCTAGCAAAAAGCAGCAGTTGGATGGTTCGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTAAAAGCTGGAGAGGATGTATCTTCGAAAATTATGT
CATTTTCACAGAATGGTCCTAGAGCTGTTTGTATCCTTACAGCAAATGGAGCAATATCTAATGTGACTCTACGTCAACCAGCCATGTCTGGTGGAACTGTAACTTACGAG
GGGCGATTCGAAATTTTGTCACTATCTGGGTCCTATCTCCTCTCCGAGAATGGCAGCCAGCGGAGTCGAACTGGAGGTCTAAGTGTTTCATTATCTGGACCAGATGGTAG
AGTATTAGGTGGTGGGGTGGCTGGTCTTCTAACGGCAGCATCTCCTGTACAGGTGGTCGTGGGGAGCTTCGTCACTGATGGGGGGCACAAGGAATTGAGACAAGTAAACC
AAATAGAACAGCCGCCTGTTACTGCACCCCATAAACTTGCTCCGATCCGTGCTGGAATGACGGGGGCCAGCAGCCCGCCATCACGTGGGACTCTCAGTGAATCCTCAGGA
GGGCCTGGGAGTCCGTTTAATCAGAGTGCTATAGCCTGCAATAACAACACCATATCTTGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
GAGAATCTTTTACTGTGAAATTCAATTTTTTAAAGAAAAAATTGGAAAATAATTAGTTACTGAATGATGTGGGTCTTCTTCTTCCTGGAGTGGCGAGTGTTAAGATCTGT
GCTTGAAAACGAAAAATTGCTGCAGTTGTATGAAAGTTTTGAGGCTGTTAGGGTTTTTGTTCCACTTTTACTGATTTTGTTCCGAGATTTAAGGGAGTGGGACTTCCTTG
TATGGAATTTTAGTTCCAGCGAAGGAGAACAATGTCGGGATCCGAGACCGGAGTGATGTCCAGCGGCGAACCTTTCACCATCGGTCTCCAGAAGAATTCAGTACAGTCAC
AACAGCCGGTCATGCAGGGCATGCATTTACCCTTCGGCGCCGATGGCGTCTACAAGCCCGTTGCCGCCGCCTCTCCCACCTACCAGTCCTCCGGCGTCGGAGTTGCCGGT
AATGCTGGTGCCGATGGATCTGCTCGTGAAGCTTTCGTGAACATGAATTCGCAAAGCGAGCCTGTAAAGAGGAAGAGAGGGAGGCCTCGGAAGTATGGGCCAGATGGCAG
TATGGCGATGGCTCCTGCGGTCCGCTCTCCCGCTGCAACTCAGTCGAGTGGAGGTTTTTCTCCTCCACCCACCGCCGCGCCTCCGTCGGGAGGATCAGCCTCTCCAACTT
CTTTGAAGAAAGCCAGAGGCAGACCCCCTGGCTCGTCTAGCAAAAAGCAGCAGTTGGATGGTTCGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTAAAA
GCTGGAGAGGATGTATCTTCGAAAATTATGTCATTTTCACAGAATGGTCCTAGAGCTGTTTGTATCCTTACAGCAAATGGAGCAATATCTAATGTGACTCTACGTCAACC
AGCCATGTCTGGTGGAACTGTAACTTACGAGGGGCGATTCGAAATTTTGTCACTATCTGGGTCCTATCTCCTCTCCGAGAATGGCAGCCAGCGGAGTCGAACTGGAGGTC
TAAGTGTTTCATTATCTGGACCAGATGGTAGAGTATTAGGTGGTGGGGTGGCTGGTCTTCTAACGGCAGCATCTCCTGTACAGGTGGTCGTGGGGAGCTTCGTCACTGAT
GGGGGGCACAAGGAATTGAGACAAGTAAACCAAATAGAACAGCCGCCTGTTACTGCACCCCATAAACTTGCTCCGATCCGTGCTGGAATGACGGGGGCCAGCAGCCCGCC
ATCACGTGGGACTCTCAGTGAATCCTCAGGAGGGCCTGGGAGTCCGTTTAATCAGAGTGCTATAGCCTGCAATAACAACACCATATCTTGGAAGTGAAACTCGAGTTGCC
TCGTTAACTCACTTGATTTGCTAATATGGCGGGCCAACATGGAATCTGGTGCTCTATTTTCTAATTTGTTCTAGATGCACCTGTAATTTATGCTAAGCTAAAGGTTAATG
TAACTCTTTCCAGTTTGGGTTGGATTGGAGCCTTGCCCTTATGCCCTGCTGGAAACGAAAGACAACACCGATAGTTTCAGTTAGGAATTAGAAGGTTGTGTGAAATAGCT
GTGTTTTGTTTTAGTATTATGAAGATAGTGGTGGCATTACAAATGATGAGTTTATTTCAATTACAGCGCTACGCTAGGTACTTTGATACTATATAATTAAGAGAGATTAT
TACTCAAGGATATTATTACAACGGGGTTTTTTCTTCCCTCAAACTCTTCTATCCGATTTCTTCAGCTATCTATCGTTTGCCAACAGTACATAGGTTGAAGTAAAGCGACT
GACC
Protein sequenceShow/hide protein sequence
MSGSETGVMSSGEPFTIGLQKNSVQSQQPVMQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGADGSAREAFVNMNSQSEPVKRKRGRPRKYGPDGSMAMAPAVRSP
AATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSSKKQQLDGSGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYE
GRFEILSLSGSYLLSENGSQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHKELRQVNQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESSG
GPGSPFNQSAIACNNNTISWK