; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036691 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036691
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationchr2:418593..420957
RNA-Seq ExpressionLag0036691
SyntenyLag0036691
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]1.8e-15285.39Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        M+GSE GVMTS EPFTIG QKSPVQSQQ VL G+HLPFGADGVYKP  + SPTY S  V       AD S REAFV+MNTQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGA
        MA+  A PSAAATQS GGFSPPPT   PSGG ASPT LKKARGRPPGS KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMS SQNG RAVCIL+ANGA
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVT
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGH ELKQANQIEQ PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVT

Query:  GPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNTIPWK
         PHKLAPIRAGM+GASSP SRG LSESSGG GSPFNQS GACNNT  WK
Subjt:  GPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNTIPWK

XP_004145559.1 AT-hook motif nuclear-localized protein 10 [Cucumis sativus]3.2e-15787.75Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSE GV++S E FTIGLQK+ V SQQPV+Q MHLPFGADGVYKPVATASPTY SS V       ADGSAR+AFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG
        MA+APAV  AAATQSSGGFSP PTAAP SG  ASPTSLKK RGRPPGSS KK  LD   SAGVGFTPHVITVKAGEDVSSKIMSFSQNG RAVCILTANG
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGH EL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV

Query:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK
        + PHKLAPIRAGM+GASSPPSRGTLSESSGGPGSPFNQSAGAC NNTIPWK
Subjt:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK

XP_008453016.1 PREDICTED: AT-hook motif nuclear-localized protein 10 [Cucumis melo]2.1e-15687.18Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSE GV++S E FTIGLQK+ VQSQQPV+Q MHLPFGADGVYKPV  ASPTY SS V       ADGSAREAFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG
        M++APAV  AAATQSSGGFSP PTAAP SGG  SPTSLKK RGRPPGSS KK QLD+  S GVGFTPHVITVKAGEDVSSKIMSFSQNG RAVCILTANG
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GH EL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV

Query:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK
        + PHKLAPIRAGM+GASSPPSRGTLSESSGGPGSPFNQSAGAC NNTIPWK
Subjt:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK

XP_022940600.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita moschata]8.2e-15385.39Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        M+GSE GVMTS EPFTIG QKSPVQSQQ VL G+HLPFGADGVYKP ++ SPTY S  V       AD S REAFV+MNTQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGA
        MA+  A PSAAATQS GGFSPPPT   PSGG ASPT LKKARGRPPGS KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMS SQNG RAVCIL+ANGA
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVT
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGH ELKQANQIEQ PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVT

Query:  GPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNTIPWK
         PHKLAPIRAGM+GASSP SRG LSESSGG GSPFNQS GACNNT  WK
Subjt:  GPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNTIPWK

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]6.9e-16089.46Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSE GVM+S EPFTIGLQK+ VQSQQ V+QGMHLPFGADGVYKPVA ASPTY SS V       ADGSAREAFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG
        MALAPAV SAAATQ SGGFSPPPTAAPPSGG ASPT LKKARGRPPGSS KKQQLD  GSAGVGFTPHVITVKAGEDVSSKIMSFSQNG RAVCILTANG
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+TDGGH EL   NQIEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV

Query:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNT-IPWK
        T PHKLAPIRAGM+GASSPPSRGTLSESSGGPGSPFNQS GACNN  IPWK
Subjt:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNT-IPWK

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein1.6e-15787.75Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSE GV++S E FTIGLQK+ V SQQPV+Q MHLPFGADGVYKPVATASPTY SS V       ADGSAR+AFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG
        MA+APAV  AAATQSSGGFSP PTAAP SG  ASPTSLKK RGRPPGSS KK  LD   SAGVGFTPHVITVKAGEDVSSKIMSFSQNG RAVCILTANG
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGH EL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV

Query:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK
        + PHKLAPIRAGM+GASSPPSRGTLSESSGGPGSPFNQSAGAC NNTIPWK
Subjt:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK

A0A1S3BWC6 AT-hook motif nuclear-localized protein1.0e-15687.18Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSE GV++S E FTIGLQK+ VQSQQPV+Q MHLPFGADGVYKPV  ASPTY SS V       ADGSAREAFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG
        M++APAV  AAATQSSGGFSP PTAAP SGG  SPTSLKK RGRPPGSS KK QLD+  S GVGFTPHVITVKAGEDVSSKIMSFSQNG RAVCILTANG
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GH EL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV

Query:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK
        + PHKLAPIRAGM+GASSPPSRGTLSESSGGPGSPFNQSAGAC NNTIPWK
Subjt:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK

A0A5A7VAQ2 AT-hook motif nuclear-localized protein1.0e-15687.18Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSE GV++S E FTIGLQK+ VQSQQPV+Q MHLPFGADGVYKPV  ASPTY SS V       ADGSAREAFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG
        M++APAV  AAATQSSGGFSP PTAAP SGG  SPTSLKK RGRPPGSS KK QLD+  S GVGFTPHVITVKAGEDVSSKIMSFSQNG RAVCILTANG
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSS-KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV
        AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTD GH EL+Q NQIEQPPV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPV

Query:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK
        + PHKLAPIRAGM+GASSPPSRGTLSESSGGPGSPFNQSAGAC NNTIPWK
Subjt:  TGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGAC-NNTIPWK

A0A6J1FR30 AT-hook motif nuclear-localized protein4.0e-15385.39Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        M+GSE GVMTS EPFTIG QKSPVQSQQ VL G+HLPFGADGVYKP ++ SPTY S  V       AD S REAFV+MNTQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGA
        MA+  A PSAAATQS GGFSPPPT   PSGG ASPT LKKARGRPPGS KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMS SQNG RAVCIL+ANGA
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVT
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGGH ELKQANQIEQ PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVT

Query:  GPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNTIPWK
         PHKLAPIRAGM+GASSP SRG LSESSGG GSPFNQS GACNNT  WK
Subjt:  GPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNTIPWK

A0A6J1IXR2 AT-hook motif nuclear-localized protein9.7e-15285.1Show/hide
Query:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS
        M+GSE GVMTS EPFTIG QKSPVQSQQ VL G+HLPFGADGVYKP ++ SPTY S  V       AD S REAFV MNTQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVV-------ADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGA
        MA+A A PSAAATQS GGFSPPPT   PSGG ASPT LKKARGRPPGS KKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMS SQNG RAVCIL+ANGA
Subjt:  MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVT
        ISNVTLRQPAMSGGTVTYEGRFEILSLSG YLL+ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DGG  ELKQANQIEQ PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVT

Query:  GPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNTIPWK
         PHKLAPIRAGM+GASSP SRG LSESSGG GSPFNQS GACNNT  WK
Subjt:  GPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQSAGACNNTIPWK

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 103.8e-7653.62Show/hide
Query:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR
        MSGSE G+M +   S  FT+ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP        + +   + +NMN           T SEPVK+
Subjt:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR

Query:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS
        +RGRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+
Subjt:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS

Query:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG
         + NG RAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG
Subjt:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG

Query:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK
           E +    + Q  ++ P   ++AP +  M+  SSP SRGT+SESS  GG GSP +QS G   NNTI  PWK
Subjt:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK

O49658 AT-hook motif nuclear-localized protein 24.3e-4853.24Show/hide
Query:  SEPVKRKRGRPRKYGPDGSMALAPAVPSAAATQSSGGFSPPPTAAPPSG--GPASPTSLKKARGRPPGSSKKQQLDALG-----SAGVGFTPHVITVKAG
        S P+K++RGRPRKYG DG+       P ++A  ++       T +   G   PA+PT     R        K Q++ LG     SA   FTPH+ITV AG
Subjt:  SEPVKRKRGRPRKYGPDGSMALAPAVPSAAATQSSGGFSPPPTAAPPSG--GPASPTSLKKARGRPPGSSKKQQLDALG-----SAGVGFTPHVITVKAG

Query:  EDVSSKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQV
        EDV+ +I+SFSQ GS A+C+L ANG +S+VTLRQP  SGGT+TYEGRFEILSLSG+++ S++ G RSRTGG+SVSL+ PDGRV+GGGVAGLL AA+P+QV
Subjt:  EDVSSKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQV

Query:  VVGSFVTDGGHNELKQ
        VVG+F+  GG N+ +Q
Subjt:  VVGSFVTDGGHNELKQ

O80834 AT-hook motif nuclear-localized protein 99.7e-4852.97Show/hide
Query:  PVKRKRGRPRKYGPDGSMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALG-----SAGVGFTPHVITVKAGEDVS
        P+KRKRGRPRKYG DGS++LA +  S +            T  P +       S K+ RGRPPGS KKQ++ ++G     S+G+ FTPHVI V  GED++
Subjt:  PVKRKRGRPRKYGPDGSMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALG-----SAGVGFTPHVITVKAGEDVS

Query:  SKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS
        SK+++FSQ G RA+C+L+A+GA+S  TL QP+ S G + YEGRFEIL+LS SY+++ +G  R+RTG LSVSL+ PDGRV+GG + G L AASPVQV+VGS
Subjt:  SKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGS

Query:  FV
        F+
Subjt:  FV

Q8VYJ2 AT-hook motif nuclear-localized protein 14.6e-5048.3Show/hide
Query:  TASPTYHSSVVADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS-MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPA--SPTSLKKARGRPPGSSKK-
        TA P    S V   +   A   ++     +K+KRGRPRKYGPDG+ +AL+P   S+A         P P+  PP          S K+++ +P  S  + 
Subjt:  TASPTYHSSVVADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS-MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPA--SPTSLKKARGRPPGSSKK-

Query:  ---QQLDALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT
            Q++ LG     S G  FTPH+ITV  GEDV+ KI+SFSQ G R++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++GG RSRT
Subjt:  ---QQLDALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVTGPHKLAPI
        GG+SVSL+ PDGRV+GGG+AGLL AASPVQVVVGSF+    H + K         ++ P    PI
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVTGPHKLAPI

Q940I0 AT-hook motif nuclear-localized protein 137.4e-4844.76Show/hide
Query:  QSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVVADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDG----------SMALAPAVPSAAATQSSGGFSP
        Q QQ + Q      G DG   P + A+   HS            +      + VK+KRGRPRKY  DG          ++ LAP  P  +A+ S GG + 
Subjt:  QSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVVADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDG----------SMALAPAVPSAAATQSSGGFSP

Query:  PPTAAPPSGGPA--SPTSLKKARGRPPGSSKKQQLDAL-GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSG--GTV
               +G  A  S    K+ RGRPPGS KK QLDAL G+ GVGFTPHVI VK GED+++KI++F+  G RA+CIL+A GA++NV LRQ   S   GTV
Subjt:  PPTAAPPSGGPA--SPTSLKKARGRPPGSSKKQQLDAL-GSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSG--GTV

Query:  TYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQP-PVTGPHKLAPIRAGMSGA
         YEGRFEI+SLSGS+L SE+ G  ++TG LSVSL+G +GR++GG V G+L A S VQV+VGSFV DG   +         P P + P  +     G+ G 
Subjt:  TYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQP-PVTGPHKLAPIRAGMSGA

Query:  SSPPSRGT--LSESS
         SP S+G    SESS
Subjt:  SSPPSRGT--LSESS

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein2.7e-7753.62Show/hide
Query:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR
        MSGSE G+M +   S  FT+ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP        + +   + +NMN           T SEPVK+
Subjt:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR

Query:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS
        +RGRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+
Subjt:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS

Query:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG
         + NG RAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG
Subjt:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG

Query:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK
           E +    + Q  ++ P   ++AP +  M+  SSP SRGT+SESS  GG GSP +QS G   NNTI  PWK
Subjt:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK

AT2G33620.2 AT hook motif DNA-binding family protein2.7e-7753.62Show/hide
Query:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR
        MSGSE G+M +   S  FT+ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP        + +   + +NMN           T SEPVK+
Subjt:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR

Query:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS
        +RGRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+
Subjt:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS

Query:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG
         + NG RAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG
Subjt:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG

Query:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK
           E +    + Q  ++ P   ++AP +  M+  SSP SRGT+SESS  GG GSP +QS G   NNTI  PWK
Subjt:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK

AT2G33620.3 AT hook motif DNA-binding family protein2.7e-7753.62Show/hide
Query:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR
        MSGSE G+M +   S  FT+ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP        + +   + +NMN           T SEPVK+
Subjt:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR

Query:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS
        +RGRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+
Subjt:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS

Query:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG
         + NG RAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG
Subjt:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG

Query:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK
           E +    + Q  ++ P   ++AP +  M+  SSP SRGT+SESS  GG GSP +QS G   NNTI  PWK
Subjt:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK

AT2G33620.4 AT hook motif DNA-binding family protein2.7e-7753.62Show/hide
Query:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR
        MSGSE G+M +   S  FT+ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP        + +   + +NMN           T SEPVK+
Subjt:  MSGSEAGVMTS---SEPFTIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVATASPTYHSSVVADGSAREAFVNMN-----------TQSEPVKR

Query:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS
        +RGRPRKYGPD    S+ L P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS G+GFTPHV+TV AGEDVSSKIM+
Subjt:  KRGRPRKYGPDG---SMALAPAVPSAAATQSSGGFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQ-QLDALGSAGVGFTPHVITVKAGEDVSSKIMS

Query:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG
         + NG RAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFEILSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVGSF+ DG
Subjt:  FSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDG

Query:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK
           E +    + Q  ++ P   ++AP +  M+  SSP SRGT+SESS  GG GSP +QS G   NNTI  PWK
Subjt:  GHNELKQANQIEQPPVTGP--HKLAPIRAGMSGASSPPSRGTLSESS--GGPGSPFNQSAGA-CNNTI--PWK

AT4G12080.1 AT-hook motif nuclear-localized protein 13.3e-5148.3Show/hide
Query:  TASPTYHSSVVADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS-MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPA--SPTSLKKARGRPPGSSKK-
        TA P    S V   +   A   ++     +K+KRGRPRKYGPDG+ +AL+P   S+A         P P+  PP          S K+++ +P  S  + 
Subjt:  TASPTYHSSVVADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGS-MALAPAVPSAAATQSSGGFSPPPTAAPPSGGPA--SPTSLKKARGRPPGSSKK-

Query:  ---QQLDALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT
            Q++ LG     S G  FTPH+ITV  GEDV+ KI+SFSQ G R++C+L+ANG IS+VTLRQP  SGGT+TYEGRFEILSLSGS++ +++GG RSRT
Subjt:  ---QQLDALG-----SAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSLSGSYLLSENGGQRSRT

Query:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVTGPHKLAPI
        GG+SVSL+ PDGRV+GGG+AGLL AASPVQVVVGSF+    H + K         ++ P    PI
Subjt:  GGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVTGPHKLAPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGATCTGAGGCCGGAGTGATGACCAGCAGCGAACCCTTCACCATCGGTCTTCAGAAGAGTCCAGTACAGTCACAACAGCCGGTCTTGCAGGGCATGCATTTACC
TTTCGGCGCCGACGGCGTCTACAAGCCCGTCGCCACCGCCTCCCCCACCTATCACTCCTCCGTCGTCGCCGATGGATCTGCTCGAGAAGCTTTCGTTAACATGAATACTC
AAAGCGAGCCTGTAAAGCGGAAGAGAGGGAGGCCTAGGAAGTATGGGCCAGATGGCAGTATGGCACTGGCTCCTGCAGTCCCCTCCGCCGCCGCAACTCAGTCCAGTGGA
GGTTTTTCTCCTCCGCCCACCGCCGCTCCTCCGTCGGGAGGACCGGCCTCTCCAACTTCCTTGAAGAAAGCCAGAGGCAGACCTCCTGGCTCTAGCAAAAAGCAGCAGCT
AGATGCTTTGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATTACCGTGAAAGCTGGAGAGGATGTTTCTTCGAAAATAATGTCATTTTCGCAGAATGGTTCTAGAG
CTGTTTGTATCCTTACAGCAAATGGAGCAATATCCAACGTGACTCTACGTCAACCAGCCATGTCTGGTGGAACCGTGACTTATGAGGGGCGATTCGAGATTTTGTCACTA
TCTGGGTCATACCTCCTCTCTGAAAATGGCGGTCAGCGGAGCCGAACTGGGGGTCTAAGTGTTTCATTGTCTGGACCAGATGGTCGAGTATTAGGTGGTGGGGTGGCTGG
TCTTCTAACGGCAGCGTCTCCTGTTCAGGTGGTGGTGGGGAGTTTCGTCACCGATGGGGGGCACAACGAATTGAAACAAGCAAACCAAATAGAACAGCCGCCTGTTACTG
GACCACACAAACTTGCCCCGATCCGTGCTGGAATGTCGGGGGCGAGCAGCCCGCCATCGCGTGGGACTCTCAGTGAATCCTCAGGAGGGCCTGGCAGTCCGTTTAATCAG
AGTGCTGGAGCCTGCAATAACACCATACCTTGGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGATCTGAGGCCGGAGTGATGACCAGCAGCGAACCCTTCACCATCGGTCTTCAGAAGAGTCCAGTACAGTCACAACAGCCGGTCTTGCAGGGCATGCATTTACC
TTTCGGCGCCGACGGCGTCTACAAGCCCGTCGCCACCGCCTCCCCCACCTATCACTCCTCCGTCGTCGCCGATGGATCTGCTCGAGAAGCTTTCGTTAACATGAATACTC
AAAGCGAGCCTGTAAAGCGGAAGAGAGGGAGGCCTAGGAAGTATGGGCCAGATGGCAGTATGGCACTGGCTCCTGCAGTCCCCTCCGCCGCCGCAACTCAGTCCAGTGGA
GGTTTTTCTCCTCCGCCCACCGCCGCTCCTCCGTCGGGAGGACCGGCCTCTCCAACTTCCTTGAAGAAAGCCAGAGGCAGACCTCCTGGCTCTAGCAAAAAGCAGCAGCT
AGATGCTTTGGGGTCAGCAGGAGTTGGATTTACCCCACATGTCATTACCGTGAAAGCTGGAGAGGATGTTTCTTCGAAAATAATGTCATTTTCGCAGAATGGTTCTAGAG
CTGTTTGTATCCTTACAGCAAATGGAGCAATATCCAACGTGACTCTACGTCAACCAGCCATGTCTGGTGGAACCGTGACTTATGAGGGGCGATTCGAGATTTTGTCACTA
TCTGGGTCATACCTCCTCTCTGAAAATGGCGGTCAGCGGAGCCGAACTGGGGGTCTAAGTGTTTCATTGTCTGGACCAGATGGTCGAGTATTAGGTGGTGGGGTGGCTGG
TCTTCTAACGGCAGCGTCTCCTGTTCAGGTGGTGGTGGGGAGTTTCGTCACCGATGGGGGGCACAACGAATTGAAACAAGCAAACCAAATAGAACAGCCGCCTGTTACTG
GACCACACAAACTTGCCCCGATCCGTGCTGGAATGTCGGGGGCGAGCAGCCCGCCATCGCGTGGGACTCTCAGTGAATCCTCAGGAGGGCCTGGCAGTCCGTTTAATCAG
AGTGCTGGAGCCTGCAATAACACCATACCTTGGAAGTAA
Protein sequenceShow/hide protein sequence
MSGSEAGVMTSSEPFTIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVATASPTYHSSVVADGSAREAFVNMNTQSEPVKRKRGRPRKYGPDGSMALAPAVPSAAATQSSG
GFSPPPTAAPPSGGPASPTSLKKARGRPPGSSKKQQLDALGSAGVGFTPHVITVKAGEDVSSKIMSFSQNGSRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFEILSL
SGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVTDGGHNELKQANQIEQPPVTGPHKLAPIRAGMSGASSPPSRGTLSESSGGPGSPFNQ
SAGACNNTIPWK