; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021762 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021762
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationscaffold1:231362..234127
RNA-Seq ExpressionMS021762
SyntenyMS021762
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]6.1e-15184.64Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        M+GSETGVMTS EPFTIG QKSPVQSQQ +L G+HL FGADGVYKP  + SP YQS GVGV GNAGAD S REAFV++N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA++ A PSAAATQS GGFSPPPT    SGG ASPT +KKARGRPPGS KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPAS
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG  KEL+QANQIEQ P +
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPAS

Query:  GPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNS
         PHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS G CNN+
Subjt:  GPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNS

XP_004145559.1 AT-hook motif nuclear-localized protein 10 [Cucumis sativus]2.6e-15785.15Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E FTIGLQK+ V SQQP++Q MHL FGADGVYKPVA ASP YQSS VGV GNAGADGSAR+AFVN+N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA++PA+  AAATQSSGGFSP PTA P SG  ASPTS+KK RGRPPGSS KK  LD   SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG  KEL+Q NQIEQPP 
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA

Query:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK
        S PHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAG CNN+       IPWK
Subjt:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK

XP_008453016.1 PREDICTED: AT-hook motif nuclear-localized protein 10 [Cucumis melo]1.1e-15785.15Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E FTIGLQK+ VQSQQP++Q MHL FGADGVYKPV AASP YQSS VGV GNAGADGSAREAFVN+N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M+++PA+  AAATQSSGGFSP PTA P SGG  SPTS+KK RGRPPGSS KK QLD+  S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D   KEL+Q NQIEQPP 
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA

Query:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK
        S PHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAG CNN+       IPWK
Subjt:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK

XP_022981864.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita maxima]4.7e-15184.64Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        M+GSETGVMTS EPFTIG QKSPVQSQQ +L G+HL FGADGVYKP ++ SP YQS GVGV GNAGAD S REAFV +N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA++ A PSAAATQS GGFSPPPT    SGG ASPT +KKARGRPPGS KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPAS
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG +KEL+QANQIEQ P +
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPAS

Query:  GPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNS
         PHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS G CNN+
Subjt:  GPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNS

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]8.5e-16187.68Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        MSGSETGVM+S EPFTIGLQK+ VQSQQ ++QGMHL FGADGVYKPVAAASP YQSSGVGV GNAGADGSAREAFVN+N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MAL+PA+ SAAATQ SGGFSPPPTA P SGG ASPT +KKARGRPPGSS KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+ DG  KEL   NQIEQ P 
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA

Query:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK
        + PHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQS G CNN NP     IPWK
Subjt:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein1.2e-15785.15Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E FTIGLQK+ V SQQP++Q MHL FGADGVYKPVA ASP YQSS VGV GNAGADGSAR+AFVN+N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA++PA+  AAATQSSGGFSP PTA P SG  ASPTS+KK RGRPPGSS KK  LD   SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG  KEL+Q NQIEQPP 
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA

Query:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK
        S PHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAG CNN+       IPWK
Subjt:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK

A0A1S3BWC6 AT-hook motif nuclear-localized protein5.5e-15885.15Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E FTIGLQK+ VQSQQP++Q MHL FGADGVYKPV AASP YQSS VGV GNAGADGSAREAFVN+N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M+++PA+  AAATQSSGGFSP PTA P SGG  SPTS+KK RGRPPGSS KK QLD+  S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D   KEL+Q NQIEQPP 
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA

Query:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK
        S PHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAG CNN+       IPWK
Subjt:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK

A0A5A7VAQ2 AT-hook motif nuclear-localized protein5.5e-15885.15Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        MSGSETGV++S E FTIGLQK+ VQSQQP++Q MHL FGADGVYKPV AASP YQSS VGV GNAGADGSAREAFVN+N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M+++PA+  AAATQSSGGFSP PTA P SGG  SPTS+KK RGRPPGSS KK QLD+  S GVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D   KEL+Q NQIEQPP 
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPA

Query:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK
        S PHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAG CNN+       IPWK
Subjt:  SGPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNSNPQGMTAIPWK

A0A6J1FR30 AT-hook motif nuclear-localized protein1.1e-15084.35Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        M+GSETGVMTS EPFTIG QKSPVQSQQ +L G+HL FGADGVYKP ++ SP YQS  VGV GNAGAD S REAFV++N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA++ A PSAAATQS GGFSPPPT    SGG ASPT +KKARGRPPGS KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPAS
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG  KEL+QANQIEQ P +
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPAS

Query:  GPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNS
         PHKLAPIRAGM GASSP SRG LSESSGG GSPFNQS G CNN+
Subjt:  GPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNS

A0A6J1IXR2 AT-hook motif nuclear-localized protein2.3e-15184.64Show/hide
Query:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS
        M+GSETGVMTS EPFTIG QKSPVQSQQ +L G+HL FGADGVYKP ++ SP YQS GVGV GNAGAD S REAFV +N Q+EPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGS

Query:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA++ A PSAAATQS GGFSPPPT    SGG ASPT +KKARGRPPGS KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPAS
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DG +KEL+QANQIEQ P +
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPAS

Query:  GPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNS
         PHKLAPIRAGMTGASSP SRG LSESSGG GSPFNQS G CNN+
Subjt:  GPHKLAPIRAGMTGASSPPSRGTLSESSGGPGSPFNQSAGGCNNS

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 103.8e-7953.89Show/hide
Query:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR
        MSGSETG+M +   S  FT+ L  Q+   Q+Q    Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       +EPVK++R
Subjt:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR

Query:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L+P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR

Query:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK
        +  Q   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS GG  N+       +PWK
Subjt:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK

O80834 AT-hook motif nuclear-localized protein 91.8e-4948.4Show/hide
Query:  PVAAASPPYQSSGV-GVPGNAGADGSAREA--FVNINM-------QTEPVKRKRGRPRKYGPDGSMALSPALPSAAATQSSGGFSPPPTATPLSGGPASP
        P  + S  + S  + G P  A A G A      + +NM          P+KRKRGRPRKYG DGS++L  AL S++ +          T TP        
Subjt:  PVAAASPPYQSSGV-GVPGNAGADGSAREA--FVNINM-------QTEPVKRKRGRPRKYGPDGSMALSPALPSAAATQSSGGFSPPPTATPLSGGPASP

Query:  TSMKKARGRPPGSSKKQQLDALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGS
         S K+ RGRPPGS KKQ++ ++G     S+G+ FTPHVI V  GED++SK+++FSQ GPRA+C+L+A+GA+S  TL QP+ S G + YEGRFEIL+LS S
Subjt:  TSMKKARGRPPGSSKKQQLDALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGS

Query:  YLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV
        Y+++ +G  R+RTG LSVSL+ PDGRV+GG + G L AASPVQV+VGSF+
Subjt:  YLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV

Q8VYJ2 AT-hook motif nuclear-localized protein 11.3e-5050.2Show/hide
Query:  VKRKRGRPRKYGPDGS-MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALG-----SAGVGFTPHVITVKAGEDVS
        +K+KRGRPRKYGPDG+ +ALSP   S+A   S     PPP++  +    +   S  K       +    Q++ LG     S G  FTPH+ITV  GEDV+
Subjt:  VKRKRGRPRKYGPDGS-MALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALG-----SAGVGFTPHVITVKAGEDVS

Query:  SKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS
         KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++GG RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPVQVVVGS
Subjt:  SKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS

Query:  FVA-----DGWRKELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS
        F+A     D   K+ +    +  P A+ P   A     +   SS P      ++S
Subjt:  FVA-----DGWRKELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS

Q940I0 AT-hook motif nuclear-localized protein 139.7e-5146.01Show/hide
Query:  VGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDG----------SMALSPALPSAAATQSSGGFSPPPTATPLSGGPA--SPTSMKKARGRPP
        +G  G+  +  + ++  +   +  + VK+KRGRPRKY  DG          ++ L+P  P  +A+ S GG +        +G  A  S    K+ RGRPP
Subjt:  VGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDG----------SMALSPALPSAAATQSSGGFSPPPTATPLSGGPA--SPTSMKKARGRPP

Query:  GSSKKQQLDAL-GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGGQRSRT
        GS KK QLDAL G+ GVGFTPHVI VK GED+++KI++F+  GPRA+CIL+A GA++NV LRQ   S   GTV YEGRFEI+SLSGS+L SE+ G  ++T
Subjt:  GSSKKQQLDAL-GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQ--PPASGPHKLAPIRAGMTGASSPPSRGT--LSESS--GGPGSPFNQ
        G LSVSL+G +GR++GG V G+L A S VQV+VGSFV DG RK+ Q A + +    PAS P  +     G+ G  SP S+G    SESS      SP ++
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQ--PPASGPHKLAPIRAGMTGASSPPSRGT--LSESS--GGPGSPFNQ

Query:  SAGGCNNSNPQGM
         +   NNSN  G+
Subjt:  SAGGCNNSNPQGM

Q9FIR1 AT-hook motif nuclear-localized protein 89.1e-4949.8Show/hide
Query:  VKRKRGRPRKYGPDGSMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDAL-GSAGVGFTPHVITVKAGEDVSSKIMS
        VK+KRGRPRKY PDGS+AL  A  S   + +S  +           G +    +K+ RGRPPGSSKK QLDAL G++GVGFTPHVI V  GED++SK+M+
Subjt:  VKRKRGRPRKYGPDGSMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDAL-GSAGVGFTPHVITVKAGEDVSSKIMS

Query:  FSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADG
        FS  G R +CIL+A+GA+S V LRQ + S G VTYEGRFEI++LSGS L  E  G  +R+G LSV+L+GPDG ++GG V G L AA+ VQV+VGSFVA+ 
Subjt:  FSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADG

Query:  WRKELQQAN--QIEQP-PASGPHKLAPIRAGMTGASSPPSRGTLSESSGGP
         + +    N  + + P PAS P  +    +   G SS  S       SG P
Subjt:  WRKELQQAN--QIEQP-PASGPHKLAPIRAGMTGASSPPSRGTLSESSGGP

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein2.7e-8053.89Show/hide
Query:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR
        MSGSETG+M +   S  FT+ L  Q+   Q+Q    Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       +EPVK++R
Subjt:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR

Query:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L+P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR

Query:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK
        +  Q   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS GG  N+       +PWK
Subjt:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK

AT2G33620.2 AT hook motif DNA-binding family protein2.7e-8053.89Show/hide
Query:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR
        MSGSETG+M +   S  FT+ L  Q+   Q+Q    Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       +EPVK++R
Subjt:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR

Query:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L+P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR

Query:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK
        +  Q   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS GG  N+       +PWK
Subjt:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK

AT2G33620.3 AT hook motif DNA-binding family protein2.7e-8053.89Show/hide
Query:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR
        MSGSETG+M +   S  FT+ L  Q+   Q+Q    Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       +EPVK++R
Subjt:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR

Query:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L+P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR

Query:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK
        +  Q   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS GG  N+       +PWK
Subjt:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK

AT2G33620.4 AT hook motif DNA-binding family protein2.7e-8053.89Show/hide
Query:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR
        MSGSETG+M +   S  FT+ L  Q+   Q+Q    Q   L+FG D    +YK P+ + SPP  YQ +  G       +    E+       +EPVK++R
Subjt:  MSGSETGVMTS---SEPFTIGL--QKSPVQSQQPILQGMHLAFGAD---GVYK-PVAAASPP--YQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKR

Query:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS
        GRPRKYGPD    S+ L+P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+ +
Subjt:  GRPRKYGPDG---SMALSPALPSAAATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMSFS

Query:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR
         NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG +
Subjt:  QNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWR

Query:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK
        +  Q   Q+       P ++AP +  MT  SSP SRGT+SESS  GG GSP +QS GG  N+       +PWK
Subjt:  KELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESS--GGPGSPFNQSAGGCNNSNPQGMTAIPWK

AT4G17950.1 AT hook motif DNA-binding family protein6.9e-5246.01Show/hide
Query:  VGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDG----------SMALSPALPSAAATQSSGGFSPPPTATPLSGGPA--SPTSMKKARGRPP
        +G  G+  +  + ++  +   +  + VK+KRGRPRKY  DG          ++ L+P  P  +A+ S GG +        +G  A  S    K+ RGRPP
Subjt:  VGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDG----------SMALSPALPSAAATQSSGGFSPPPTATPLSGGPA--SPTSMKKARGRPP

Query:  GSSKKQQLDAL-GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGGQRSRT
        GS KK QLDAL G+ GVGFTPHVI VK GED+++KI++F+  GPRA+CIL+A GA++NV LRQ   S   GTV YEGRFEI+SLSGS+L SE+ G  ++T
Subjt:  GSSKKQQLDAL-GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQ--PPASGPHKLAPIRAGMTGASSPPSRGT--LSESS--GGPGSPFNQ
        G LSVSL+G +GR++GG V G+L A S VQV+VGSFV DG RK+ Q A + +    PAS P  +     G+ G  SP S+G    SESS      SP ++
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQ--PPASGPHKLAPIRAGMTGASSPPSRGT--LSESS--GGPGSPFNQ

Query:  SAGGCNNSNPQGM
         +   NNSN  G+
Subjt:  SAGGCNNSNPQGM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGATCTGAGACCGGAGTGATGACCAGCAGCGAACCCTTCACCATCGGTCTCCAGAAGAGTCCGGTACAGTCGCAACAGCCGATCCTGCAAGGCATGCATTTAGC
CTTCGGCGCCGACGGCGTCTACAAGCCCGTCGCCGCCGCCTCACCCCCCTACCAGTCCTCCGGTGTCGGAGTTCCCGGCAATGCCGGCGCCGACGGATCTGCTCGTGAAG
CTTTCGTCAACATTAATATGCAAACCGAGCCTGTGAAGAGGAAGAGAGGGAGGCCCAGGAAGTATGGGCCTGATGGCAGTATGGCACTGTCCCCTGCTCTCCCCTCCGCC
GCCGCAACTCAGTCCAGTGGAGGGTTTTCCCCTCCGCCCACCGCCACTCCTCTGTCGGGAGGACCGGCCTCTCCAACTTCTATGAAGAAAGCCAGAGGCAGACCCCCTGG
CTCTAGCAAAAAGCAGCAATTAGATGCTTTGGGGTCAGCTGGAGTTGGATTTACCCCACATGTCATCACCGTGAAAGCTGGAGAGGATGTATCTTCGAAAATAATGTCAT
TTTCGCAGAATGGTCCGAGAGCTGTTTGTATCCTTACGGCAAATGGAGCAATATCCAATGTGACTCTACGTCAACCAGCCATGTCTGGTGGAACTGTGACTTACGAGGGG
CGATTCGAGATCTTGTCACTATCTGGGTCATATCTCCTGTCTGAAAATGGCGGTCAGCGGAGCCGAACTGGGGGTCTAAGTGTTTCATTGTCTGGACCAGACGGTAGAGT
ATTAGGTGGTGGGGTGGCTGGTCTTCTAACGGCAGCGTCTCCTGTTCAGGTGGTGGTGGGGAGTTTCGTCGCAGATGGGTGGCGCAAGGAATTGCAACAAGCAAACCAAA
TTGAACAACCGCCTGCTTCTGGACCTCATAAACTTGCTCCAATCCGTGCTGGAATGACGGGGGCCAGCAGCCCGCCATCGCGTGGGACTCTCAGTGAATCCTCAGGAGGG
CCTGGAAGTCCGTTTAATCAGAGTGCTGGAGGCTGCAACAACAGTAACCCACAAGGCATGACCGCCATACCTTGGAAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGATCTGAGACCGGAGTGATGACCAGCAGCGAACCCTTCACCATCGGTCTCCAGAAGAGTCCGGTACAGTCGCAACAGCCGATCCTGCAAGGCATGCATTTAGC
CTTCGGCGCCGACGGCGTCTACAAGCCCGTCGCCGCCGCCTCACCCCCCTACCAGTCCTCCGGTGTCGGAGTTCCCGGCAATGCCGGCGCCGACGGATCTGCTCGTGAAG
CTTTCGTCAACATTAATATGCAAACCGAGCCTGTGAAGAGGAAGAGAGGGAGGCCCAGGAAGTATGGGCCTGATGGCAGTATGGCACTGTCCCCTGCTCTCCCCTCCGCC
GCCGCAACTCAGTCCAGTGGAGGGTTTTCCCCTCCGCCCACCGCCACTCCTCTGTCGGGAGGACCGGCCTCTCCAACTTCTATGAAGAAAGCCAGAGGCAGACCCCCTGG
CTCTAGCAAAAAGCAGCAATTAGATGCTTTGGGGTCAGCTGGAGTTGGATTTACCCCACATGTCATCACCGTGAAAGCTGGAGAGGATGTATCTTCGAAAATAATGTCAT
TTTCGCAGAATGGTCCGAGAGCTGTTTGTATCCTTACGGCAAATGGAGCAATATCCAATGTGACTCTACGTCAACCAGCCATGTCTGGTGGAACTGTGACTTACGAGGGG
CGATTCGAGATCTTGTCACTATCTGGGTCATATCTCCTGTCTGAAAATGGCGGTCAGCGGAGCCGAACTGGGGGTCTAAGTGTTTCATTGTCTGGACCAGACGGTAGAGT
ATTAGGTGGTGGGGTGGCTGGTCTTCTAACGGCAGCGTCTCCTGTTCAGGTGGTGGTGGGGAGTTTCGTCGCAGATGGGTGGCGCAAGGAATTGCAACAAGCAAACCAAA
TTGAACAACCGCCTGCTTCTGGACCTCATAAACTTGCTCCAATCCGTGCTGGAATGACGGGGGCCAGCAGCCCGCCATCGCGTGGGACTCTCAGTGAATCCTCAGGAGGG
CCTGGAAGTCCGTTTAATCAGAGTGCTGGAGGCTGCAACAACAGTAACCCACAAGGCATGACCGCCATACCTTGGAAG
Protein sequenceShow/hide protein sequence
MSGSETGVMTSSEPFTIGLQKSPVQSQQPILQGMHLAFGADGVYKPVAAASPPYQSSGVGVPGNAGADGSAREAFVNINMQTEPVKRKRGRPRKYGPDGSMALSPALPSA
AATQSSGGFSPPPTATPLSGGPASPTSMKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEG
RFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADGWRKELQQANQIEQPPASGPHKLAPIRAGMTGASSPPSRGTLSESSGG
PGSPFNQSAGGCNNSNPQGMTAIPWK