; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017715 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017715
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationtig00153055:250580..253376
RNA-Seq ExpressionSgr017715
SyntenySgr017715
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-15184.27Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        M+GSETGVMTS EPF+IG QKS   SQ  VL G+HL FGADGVYKP  + SP YQS GVGV+GNAGAD S REAFV++N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA+  A PS AATQSGGGFSPPPT   PSGG ASPT LKKARGRPPGS KKQQLD+LGSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPVT
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG KELKQANQIEQ PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPVT

Query:  APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        APHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS GACNN       T SWK
Subjt:  APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

XP_004145559.1 AT-hook motif nuclear-localized protein 10 [Cucumis sativus]9.6e-15785.99Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKSSQP---PVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E F+IGLQK+S P   PV+Q MHL FGADGVYKPVA ASP YQSS VGVAGNAGADGSAR+AFVN+N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKSSQP---PVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APA+   AATQS GGFSP PT  P SG  ASPTSLKK RGRPPGSS KK  LD+  SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGG KEL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNN+      TI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

XP_008453016.1 PREDICTED: AT-hook motif nuclear-localized protein 10 [Cucumis melo]1.6e-15685.99Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E F+IGLQK+   SQ PV+Q MHL FGADGVYKPV AASP YQSS VGVAGNAGADGSAREAFVN+N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APA+   AATQS GGFSP PT  P SGG  SPTSLKK RGRPPGSS KK QLDS  S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD G KEL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNN+      TI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

XP_022981864.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita maxima]5.5e-15284.55Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        M+GSETGVMTS EPF+IG QKS   SQ  VL G+HL FGADGVYKP ++ SP YQS GVGV+GNAGAD S REAFV +N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA+A A PS AATQSGGGFSPPPT   PSGG ASPT LKKARGRPPGS KKQQLD+LGSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPVT
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG+KELKQANQIEQ PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPVT

Query:  APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        APHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS GACNN       T SWK
Subjt:  APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]1.9e-16088.8Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        MSGSETGVM+S EPF+IGLQK+   SQ  V+QGMHL FGADGVYKPVAAASP YQSSGVGVAGNAGADGSAREAFVN+N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MALAPA+ S AATQ  GGFSPPPT  PPSGG ASPT LKKARGRPPGSS KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+TDGG KEL   NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQS GACNN NP     I WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein4.7e-15785.99Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKSSQP---PVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E F+IGLQK+S P   PV+Q MHL FGADGVYKPVA ASP YQSS VGVAGNAGADGSAR+AFVN+N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKSSQP---PVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+APA+   AATQS GGFSP PT  P SG  ASPTSLKK RGRPPGSS KK  LD+  SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGG KEL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNN+      TI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

A0A1S3BWC6 AT-hook motif nuclear-localized protein7.9e-15785.99Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E F+IGLQK+   SQ PV+Q MHL FGADGVYKPV AASP YQSS VGVAGNAGADGSAREAFVN+N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APA+   AATQS GGFSP PT  P SGG  SPTSLKK RGRPPGSS KK QLDS  S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD G KEL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNN+      TI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

A0A5A7VAQ2 AT-hook motif nuclear-localized protein7.9e-15785.99Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E F+IGLQK+   SQ PV+Q MHL FGADGVYKPV AASP YQSS VGVAGNAGADGSAREAFVN+N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++APA+   AATQS GGFSP PT  P SGG  SPTSLKK RGRPPGSS KK QLDS  S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSS-KKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD G KEL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPV

Query:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        +APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNN+      TI WK
Subjt:  TAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

A0A6J1FR30 AT-hook motif nuclear-localized protein3.8e-15183.99Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        M+GSETGVMTS EPF+IG QKS   SQ  VL G+HL FGADGVYKP ++ SP YQS  VGV+GNAGAD S REAFV++N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA+  A PS AATQSGGGFSPPPT   PSGG ASPT LKKARGRPPGS KKQQLD+LGSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPVT
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG KELKQANQIEQ PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPVT

Query:  APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        APHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS GACNN       T SWK
Subjt:  APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

A0A6J1IXR2 AT-hook motif nuclear-localized protein2.6e-15284.55Show/hide
Query:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS
        M+GSETGVMTS EPF+IG QKS   SQ  VL G+HL FGADGVYKP ++ SP YQS GVGV+GNAGAD S REAFV +N QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFSIGLQKS---SQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGS

Query:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA+A A PS AATQSGGGFSPPPT   PSGG ASPT LKKARGRPPGS KKQQLD+LGSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPVT
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG+KELKQANQIEQ PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPVT

Query:  APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK
        APHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS GACNN       T SWK
Subjt:  APHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGACNNSNPQGLTTISWK

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 107.8e-7753.59Show/hide
Query:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR
        MSGSETG+M +   S  F++ L +      +QP   Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       SEPVK++R
Subjt:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR

Query:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L +LGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR

Query:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS
        +  +   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS G   N+
Subjt:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS

O80834 AT-hook motif nuclear-localized protein 94.8e-5045.26Show/hide
Query:  GADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINM-------QSEPVKRKRGRPRKYGPDGSMALAPALPSVAATQSGGGFSPPPTTGPPSGG
        G+ G   P     P   ++    AG AG    A    + +NM          P+KRKRGRPRKYG DGS++LA +  SV             +T  P+  
Subjt:  GADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINM-------QSEPVKRKRGRPRKYGPDGSMALAPALPSVAATQSGGGFSPPPTTGPPSGG

Query:  PASPTSLKKARGRPPGSSKKQQLDSLG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILS
             S K+ RGRPPGS KKQ++ S+G     S+G+ FTPHVI V  GED++SK+++FSQ GPRA+C+L+A+GA+S  TL QP+ S G + YEGRFEIL+
Subjt:  PASPTSLKKARGRPPGSSKKQQLDSLG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILS

Query:  LSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG----GRKELKQANQIEQ
        LS SY+++ +G  R+RTG LSVSL+ PDGRV+GG + G L AASPVQV+VGSF+        +K  ++A+++ Q
Subjt:  LSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG----GRKELKQANQIEQ

Q8L7L5 AT-hook motif nuclear-localized protein 114.1e-4947.5Show/hide
Query:  AGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLG---
        A +  +A  A V        VKRKRGRPRKYG DG   S+AL+P++ +V                       SP S K+ RGRPPGS KKQ+L S+G   
Subjt:  AGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLG---

Query:  --SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGR
          S G+ FTPHVI V  GED++SK++SFS  GPRA+C+L+A+GA+S  TL QPA S GT+ YEG FE++SLS SYL + +    +RTG L+VSL+ PDGR
Subjt:  --SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGR

Query:  VLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQ
        V+GGG+ G L AAS VQV+VGSF+    + ++K+  +  +
Subjt:  VLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQ

Q8VYJ2 AT-hook motif nuclear-localized protein 19.6e-5152.17Show/hide
Query:  VKRKRGRPRKYGPDGS-MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLG-----SAGVGFTPHVITVKAGEDVS
        +K+KRGRPRKYGPDG+ +AL+P   S A   S     PPP++       +   S  K       +    Q+++LG     S G  FTPH+ITV  GEDV+
Subjt:  VKRKRGRPRKYGPDGS-MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLG-----SAGVGFTPHVITVKAGEDVS

Query:  SKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS
         KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++GG RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPVQVVVGS
Subjt:  SKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS

Query:  FVTDGGRKELKQANQIEQPPVTAPHKLAPI
        F+     ++ K         +++P    PI
Subjt:  FVTDGGRKELKQANQIEQPPVTAPHKLAPI

Q940I0 AT-hook motif nuclear-localized protein 132.1e-5045.69Show/hide
Query:  VGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDG----------SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPA--SPTSLKKARGRPP
        +G  G+  +  + ++  +   +  + VK+KRGRPRKY  DG          ++ LAP  P  +A+ S GG +     G  +G  A  S    K+ RGRPP
Subjt:  VGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDG----------SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPA--SPTSLKKARGRPP

Query:  GSSKKQQLDSL-GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGGQRSRT
        GS KK QLD+L G+ GVGFTPHVI VK GED+++KI++F+  GPRA+CIL+A GA++NV LRQ   S   GTV YEGRFEI+SLSGS+L SE+ G  ++T
Subjt:  GSSKKQQLDSL-GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQ--PPVTAPHKLAPIRAGMTGASSPPSRGT--LSESS--GGPGSPFNQ
        G LSVSL+G +GR++GG V G+L A S VQV+VGSFV D GRK+ + A + +    P +AP  +     G+ G  SP S+G    SESS      SP ++
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQ--PPVTAPHKLAPIRAGMTGASSPPSRGT--LSESS--GGPGSPFNQ

Query:  SAGACNNSNPQGL
         +   NNSN  G+
Subjt:  SAGACNNSNPQGL

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein5.6e-7853.59Show/hide
Query:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR
        MSGSETG+M +   S  F++ L +      +QP   Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       SEPVK++R
Subjt:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR

Query:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L +LGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR

Query:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS
        +  +   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS G   N+
Subjt:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS

AT2G33620.2 AT hook motif DNA-binding family protein5.6e-7853.59Show/hide
Query:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR
        MSGSETG+M +   S  F++ L +      +QP   Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       SEPVK++R
Subjt:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR

Query:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L +LGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR

Query:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS
        +  +   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS G   N+
Subjt:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS

AT2G33620.3 AT hook motif DNA-binding family protein5.6e-7853.59Show/hide
Query:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR
        MSGSETG+M +   S  F++ L +      +QP   Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       SEPVK++R
Subjt:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR

Query:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L +LGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR

Query:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS
        +  +   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS G   N+
Subjt:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS

AT2G33620.4 AT hook motif DNA-binding family protein5.6e-7853.59Show/hide
Query:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR
        MSGSETG+M +   S  F++ L +      +QP   Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       SEPVK++R
Subjt:  MSGSETGVMTS---SEPFSIGLQK-----SSQPPVLQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKR

Query:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L +LGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQ-QLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGR

Query:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS
        +  +   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS G   N+
Subjt:  KELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGACNNS

AT4G12080.1 AT-hook motif nuclear-localized protein 16.8e-5252.17Show/hide
Query:  VKRKRGRPRKYGPDGS-MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLG-----SAGVGFTPHVITVKAGEDVS
        +K+KRGRPRKYGPDG+ +AL+P   S A   S     PPP++       +   S  K       +    Q+++LG     S G  FTPH+ITV  GEDV+
Subjt:  VKRKRGRPRKYGPDGS-MALAPALPSVAATQSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLG-----SAGVGFTPHVITVKAGEDVS

Query:  SKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS
         KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++GG RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPVQVVVGS
Subjt:  SKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS

Query:  FVTDGGRKELKQANQIEQPPVTAPHKLAPI
        F+     ++ K         +++P    PI
Subjt:  FVTDGGRKELKQANQIEQPPVTAPHKLAPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGATCTGAGACCGGAGTGATGACGAGCAGCGAACCCTTCAGCATCGGTCTCCAGAAGAGTTCACAACCGCCGGTCTTGCAGGGCATGCATTTGGCCTTCGGTGC
CGACGGTGTCTACAAGCCTGTCGCCGCCGCCTCACCCCCCTACCAGTCCTCCGGTGTTGGGGTCGCCGGTAATGCCGGTGCGGATGGATCTGCTCGTGAAGCTTTCGTTA
ACATAAATATGCAAAGCGAGCCTGTTAAGAGGAAGAGAGGGAGGCCCAGGAAGTATGGGCCAGATGGCAGTATGGCGCTAGCTCCTGCCCTCCCCTCCGTCGCCGCAACT
CAATCCGGTGGAGGTTTTTCTCCTCCACCCACCACCGGTCCTCCGTCGGGAGGACCAGCCTCTCCAACTTCTTTGAAGAAAGCCAGAGGCAGACCGCCTGGCTCTAGCAA
AAAGCAGCAATTAGATTCTTTGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTGAAAGCTGGAGAGGATGTATCTTCGAAAATAATGTCATTTTCACAGA
ATGGTCCTAGAGCTGTTTGTATCCTTACAGCAAATGGAGCAATATCCAATGTGACTCTACGTCAGCCAGCCATGTCAGGTGGAACCGTGACTTACGAGGGGCGATTTGAG
ATTTTGTCATTATCTGGGTCATATCTCCTATCTGAAAATGGCGGTCAGCGGAGCCGAACTGGGGGTCTAAGTGTTTCATTGTCTGGACCAGATGGTAGAGTATTAGGTGG
TGGGGTGGCTGGTCTTCTAACTGCAGCGTCTCCTGTCCAGGTGGTGGTGGGGAGTTTCGTCACTGATGGGGGGCGCAAGGAATTGAAACAAGCAAACCAAATTGAACAGC
CGCCTGTTACTGCACCACACAAACTTGCTCCGATCCGTGCTGGAATGACGGGGGCCAGCAGCCCACCATCGCGTGGGACTCTCAGTGAATCCTCAGGAGGGCCTGGGAGT
CCTTTTAACCAGAGTGCTGGAGCCTGCAATAACAGTAACCCACAAGGCCTGACGACCATATCTTGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGATCTGAGACCGGAGTGATGACGAGCAGCGAACCCTTCAGCATCGGTCTCCAGAAGAGTTCACAACCGCCGGTCTTGCAGGGCATGCATTTGGCCTTCGGTGC
CGACGGTGTCTACAAGCCTGTCGCCGCCGCCTCACCCCCCTACCAGTCCTCCGGTGTTGGGGTCGCCGGTAATGCCGGTGCGGATGGATCTGCTCGTGAAGCTTTCGTTA
ACATAAATATGCAAAGCGAGCCTGTTAAGAGGAAGAGAGGGAGGCCCAGGAAGTATGGGCCAGATGGCAGTATGGCGCTAGCTCCTGCCCTCCCCTCCGTCGCCGCAACT
CAATCCGGTGGAGGTTTTTCTCCTCCACCCACCACCGGTCCTCCGTCGGGAGGACCAGCCTCTCCAACTTCTTTGAAGAAAGCCAGAGGCAGACCGCCTGGCTCTAGCAA
AAAGCAGCAATTAGATTCTTTGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATCACCGTGAAAGCTGGAGAGGATGTATCTTCGAAAATAATGTCATTTTCACAGA
ATGGTCCTAGAGCTGTTTGTATCCTTACAGCAAATGGAGCAATATCCAATGTGACTCTACGTCAGCCAGCCATGTCAGGTGGAACCGTGACTTACGAGGGGCGATTTGAG
ATTTTGTCATTATCTGGGTCATATCTCCTATCTGAAAATGGCGGTCAGCGGAGCCGAACTGGGGGTCTAAGTGTTTCATTGTCTGGACCAGATGGTAGAGTATTAGGTGG
TGGGGTGGCTGGTCTTCTAACTGCAGCGTCTCCTGTCCAGGTGGTGGTGGGGAGTTTCGTCACTGATGGGGGGCGCAAGGAATTGAAACAAGCAAACCAAATTGAACAGC
CGCCTGTTACTGCACCACACAAACTTGCTCCGATCCGTGCTGGAATGACGGGGGCCAGCAGCCCACCATCGCGTGGGACTCTCAGTGAATCCTCAGGAGGGCCTGGGAGT
CCTTTTAACCAGAGTGCTGGAGCCTGCAATAACAGTAACCCACAAGGCCTGACGACCATATCTTGGAAGTGA
Protein sequenceShow/hide protein sequence
MSGSETGVMTSSEPFSIGLQKSSQPPVLQGMHLAFGADGVYKPVAAASPPYQSSGVGVAGNAGADGSAREAFVNINMQSEPVKRKRGRPRKYGPDGSMALAPALPSVAAT
QSGGGFSPPPTTGPPSGGPASPTSLKKARGRPPGSSKKQQLDSLGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFE
ILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGRKELKQANQIEQPPVTAPHKLAPIRAGMTGASSPPSRGTLSESSGGPGS
PFNQSAGACNNSNPQGLTTISWK