; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004523 (gene) of Snake gourd v1 genome

Gene IDTan0004523
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationLG01:116716138..116718761
RNA-Seq ExpressionTan0004523
SyntenyTan0004523
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037975.1 AT-hook motif nuclear-localized protein 10 [Cucurbita argyrosperma subsp. argyrosperma]5.3e-16087.68Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        M+GSETGVMTSGEPF+IG QKSPVQSQQ VL G+HLPFGADGVYKP  + SPTYQS GVGV+GNAG+DVS   AFV+MNTQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA+  A PSAAATQS GGFSPPPT   PSGGSASPT LKKARGRPPGS KKQQLDALGSAG+GFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT
        ISNVTLRQPAMSGGTVTYEGRFE+LSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D GH ELKQAN IEQ+PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT

Query:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK
        APHKLAPIRAGMAGASSP SRG LSESSGG GSPFNQS GACNNT SWK
Subjt:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK

XP_008453016.1 PREDICTED: AT-hook motif nuclear-localized protein 10 [Cucumis melo]2.9e-15887.46Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSETGV++SGE F+IGLQK+ VQSQQPV+Q MHLPFGADGVYKPV AASPTYQSS VGVAGNAG+D S   AFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++ PAV  AAATQSSGGFSP PTAAP SGGS SPTSLKK RGRPPGSS KK QLD+  S G+GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV
        AISNVTLRQPAMSGGTVTYEGRFE+LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DVGH EL+Q N IEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV

Query:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGAC-NNTISWK
        +APHKLAPIRAGM GASSP SRGTLSESSGGPGSPFNQSAGAC NNTI WK
Subjt:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGAC-NNTISWK

XP_022940600.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita moschata]7.0e-16087.68Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        M+GSETGVMTSGEPF+IG QKSPVQSQQ VL G+HLPFGADGVYKP ++ SPTYQS  VGV+GNAG+DVS   AFV+MNTQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA+  A PSAAATQS GGFSPPPT   PSGGSASPT LKKARGRPPGS KKQQLDALGSAG+GFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT
        ISNVTLRQPAMSGGTVTYEGRFE+LSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D GH ELKQAN IEQ+PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT

Query:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK
        APHKLAPIRAGMAGASSPQSRG LSESSGG GSPFNQS GACNNT SWK
Subjt:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK

XP_022981864.1 AT-hook motif nuclear-localized protein 10-like isoform X1 [Cucurbita maxima]1.7e-15887.11Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        M+GSETGVMTSGEPF+IG QKSPVQSQQ VL G+HLPFGADGVYKP ++ SPTYQS GVGV+GNAG+DVS   AFV MNTQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA+  A PSAAATQS GGFSPPPT   PSGGSASPT LKKARGRPPGS KKQQLDALGSAG+GFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT
        ISNVTLRQPAMSGGTVTYEGRFE+LSLSG YLL+ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D G  ELKQAN IEQ+PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT

Query:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK
        APHKLAPIRAGM GASSPQSRG LSESSGG GSPFNQS GACNNT SWK
Subjt:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK

XP_038898092.1 AT-hook motif nuclear-localized protein 10-like [Benincasa hispida]2.2e-16189.46Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSETGVM+SGEPF+IGLQK+ VQSQQ V+QGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAG+D S   AFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+ PAV SAAATQ SGGFSPPPTAAPPSGGSASPT LKKARGRPPGSS KKQQLD  GSAG+GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV
        AISNVTLRQPAMSGGTVTYEGRFE+LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSF+ D GH EL   N IEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV

Query:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNT-ISWK
        TAPHKLAPIRAGM GASSP SRGTLSESSGGPGSPFNQS GACNN  I WK
Subjt:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNT-ISWK

TrEMBL top hitse value%identityAlignment
A0A0A0L0T7 AT-hook motif nuclear-localized protein1.0e-15687.18Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSETGV++SGE F+IGLQK+ V SQQPV+Q MHLPFGADGVYKPVA ASPTYQSS VGVAGNAG+D S   AFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        MA+ PAV  AAATQSSGGFSP PTAAP SG SASPTSLKK RGRPPGSS KK  LD   SAG+GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV
        AISNVTLRQPAMSGGTVTYEGRFE+LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D GH EL+Q N IEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV

Query:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGAC-NNTISWK
        +APHKLAPIRAGM GASSP SRGTLSESSGGPGSPFNQSAGAC NNTI WK
Subjt:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGAC-NNTISWK

A0A1S3BWC6 AT-hook motif nuclear-localized protein1.4e-15887.46Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSETGV++SGE F+IGLQK+ VQSQQPV+Q MHLPFGADGVYKPV AASPTYQSS VGVAGNAG+D S   AFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++ PAV  AAATQSSGGFSP PTAAP SGGS SPTSLKK RGRPPGSS KK QLD+  S G+GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV
        AISNVTLRQPAMSGGTVTYEGRFE+LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DVGH EL+Q N IEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV

Query:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGAC-NNTISWK
        +APHKLAPIRAGM GASSP SRGTLSESSGGPGSPFNQSAGAC NNTI WK
Subjt:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGAC-NNTISWK

A0A5A7VAQ2 AT-hook motif nuclear-localized protein1.4e-15887.46Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        MSGSETGV++SGE F+IGLQK+ VQSQQPV+Q MHLPFGADGVYKPV AASPTYQSS VGVAGNAG+D S   AFVNMN+QSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
        M++ PAV  AAATQSSGGFSP PTAAP SGGS SPTSLKK RGRPPGSS KK QLD+  S G+GFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSS-KKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANG

Query:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV
        AISNVTLRQPAMSGGTVTYEGRFE+LSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV DVGH EL+Q N IEQ PV
Subjt:  AISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPV

Query:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGAC-NNTISWK
        +APHKLAPIRAGM GASSP SRGTLSESSGGPGSPFNQSAGAC NNTI WK
Subjt:  TAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGAC-NNTISWK

A0A6J1FR30 AT-hook motif nuclear-localized protein3.4e-16087.68Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        M+GSETGVMTSGEPF+IG QKSPVQSQQ VL G+HLPFGADGVYKP ++ SPTYQS  VGV+GNAG+DVS   AFV+MNTQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA+  A PSAAATQS GGFSPPPT   PSGGSASPT LKKARGRPPGS KKQQLDALGSAG+GFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT
        ISNVTLRQPAMSGGTVTYEGRFE+LSLSG YLL+ENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D GH ELKQAN IEQ+PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT

Query:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK
        APHKLAPIRAGMAGASSPQSRG LSESSGG GSPFNQS GACNNT SWK
Subjt:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK

A0A6J1IXR2 AT-hook motif nuclear-localized protein8.3e-15987.11Show/hide
Query:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS
        M+GSETGVMTSGEPF+IG QKSPVQSQQ VL G+HLPFGADGVYKP ++ SPTYQS GVGV+GNAG+DVS   AFV MNTQSEPVKRKRGRPRKYGPDGS
Subjt:  MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVS---AFVNMNTQSEPVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA
        MA+  A PSAAATQS GGFSPPPT   PSGGSASPT LKKARGRPPGS KKQQLDALGSAG+GFTPHVITVKAGEDVSSKIMS SQNGPRAVCIL+ANGA
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGA

Query:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT
        ISNVTLRQPAMSGGTVTYEGRFE+LSLSG YLL+ENGGQRSRTG LSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV D G  ELKQAN IEQ+PVT
Subjt:  ISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVT

Query:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK
        APHKLAPIRAGM GASSPQSRG LSESSGG GSPFNQS GACNNT SWK
Subjt:  APHKLAPIRAGMAGASSPQSRGTLSESSGGPGSPFNQSAGACNNTISWK

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 106.9e-7854.09Show/hide
Query:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ
        MSGSETG+M +      F++ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP   YQ +  G         ++ +NMN           T 
Subjt:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ

Query:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV
        SEPVK++RGRPRKYGPD    S+ + P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS GIGFTPHV+TV AGEDV
Subjt:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV

Query:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG
        SSKIM+ + NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFE+LSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVG
Subjt:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG

Query:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK
        SF+ D G  E KQ  H+ Q  +++P   ++AP +  M   SSPQSRGT+SESS  GG GSP +QS G   NNTI+  WK
Subjt:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK

O80834 AT-hook motif nuclear-localized protein 98.8e-4944.91Show/hide
Query:  GVMTSGEPFSIGLQKSPVQSQ-QPVLQGMHLPF--GADGVYKPVAAASPTYQSSGVGVAGNAGS-----DVSAFVNMNTQSE-PVKRKRGRPRKYGPDGS
        G+  SG P   G   SP Q Q    L   + PF  G+ G   P     P+  ++    AG AG+      V+        SE P+KRKRGRPRKYG DGS
Subjt:  GVMTSGEPFSIGLQKSPVQSQ-QPVLQGMHLPF--GADGVYKPVAAASPTYQSSGVGVAGNAGS-----DVSAFVNMNTQSE-PVKRKRGRPRKYGPDGS

Query:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALG-----SAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL
        +++      A ++ S    +P               S K+ RGRPPGS KKQ++ ++G     S+G+ FTPHVI V  GED++SK+++FSQ GPRA+C+L
Subjt:  MAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALG-----SAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCIL

Query:  TANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV
        +A+GA+S  TL QP+ S G + YEGRFE+L+LS SY+++ +G  R+RTG LSVSL+ PDGRV+GG + G L AASPVQV+VGSF+
Subjt:  TANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFV

Q8VYJ2 AT-hook motif nuclear-localized protein 11.7e-5251.06Show/hide
Query:  VKRKRGRPRKYGPDGSMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSKK----QQLDALG-----SAGIGFTPHVITVKA
        +K+KRGRPRKYGPDG++  +   P ++A        P P+  PP          S K+++ +P  S  +     Q++ LG     S G  FTPH+ITV  
Subjt:  VKRKRGRPRKYGPDGSMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSKK----QQLDALG-----SAGIGFTPHVITVKA

Query:  GEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ
        GEDV+ KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFE+LSLSGS++ +++GG RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPVQ
Subjt:  GEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ

Query:  VVVGSFVADVGHNELKQANHIEQQPVTAPHKLAPI
        VVVGSF+A   H + K   +     +++P    PI
Subjt:  VVVGSFVADVGHNELKQANHIEQQPVTAPHKLAPI

Q940I0 AT-hook motif nuclear-localized protein 138.8e-4948.3Show/hide
Query:  EPVKRKRGRPRKYGPDG----------SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSKKQQLDAL-GSAGIGFTPHVI
        + VK+KRGRPRKY  DG          ++ + P  P  +A+ S GG +        +G +A  S    K+ RGRPPGS KK QLDAL G+ G+GFTPHVI
Subjt:  EPVKRKRGRPRKYGPDG----------SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSKKQQLDAL-GSAGIGFTPHVI

Query:  TVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLT
         VK GED+++KI++F+  GPRA+CIL+A GA++NV LRQ   S   GTV YEGRFE++SLSGS+L SE+ G  ++TG LSVSL+G +GR++GG V G+L 
Subjt:  TVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSG--GTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLT

Query:  AASPVQVVVGSFVADVGHNELKQANHIEQ--QPVTAPHKLAPIRAGMAGASSPQSRGT--LSESS
        A S VQV+VGSFV D G  + + A   +   +P +AP  +     G+ G  SP+S+G    SESS
Subjt:  AASPVQVVVGSFVADVGHNELKQANHIEQ--QPVTAPHKLAPIRAGMAGASSPQSRGT--LSESS

Q9FIR1 AT-hook motif nuclear-localized protein 85.2e-4942.41Show/hide
Query:  PVQSQQPVLQGMHLPFGAD------GVYKPVAAASPTYQSSGVGVAGNAGSDVSAFVNMNTQSEPVKRKRGRPRKYGPDGSMAMVPAVPSAAATQSSGGF
        P  + QP+ Q   LPFG           +       T +S G G    +   +   ++   Q   VK+KRGRPRKY PDGS+A+  A  S   + +S  +
Subjt:  PVQSQQPVLQGMHLPFGAD------GVYKPVAAASPTYQSSGVGVAGNAGSDVSAFVNMNTQSEPVKRKRGRPRKYGPDGSMAMVPAVPSAAATQSSGGF

Query:  SPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDAL-GSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTY
                   G++    +K+ RGRPPGSSKK QLDAL G++G+GFTPHVI V  GED++SK+M+FS  G R +CIL+A+GA+S V LRQ + S G VTY
Subjt:  SPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDAL-GSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTY

Query:  EGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQ---PVTAPHKLAPIRAGMAGA
        EGRFE+++LSGS L  E  G  +R+G LSV+L+GPDG ++GG V G L AA+ VQV+VGSFVA+    +    N    Q   P +AP  +    +   G 
Subjt:  EGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQ---PVTAPHKLAPIRAGMAGA

Query:  SSPQSRGTLSESSGGP
        SS  S       SG P
Subjt:  SSPQSRGTLSESSGGP

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein4.9e-7954.09Show/hide
Query:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ
        MSGSETG+M +      F++ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP   YQ +  G         ++ +NMN           T 
Subjt:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ

Query:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV
        SEPVK++RGRPRKYGPD    S+ + P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS GIGFTPHV+TV AGEDV
Subjt:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV

Query:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG
        SSKIM+ + NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFE+LSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVG
Subjt:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG

Query:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK
        SF+ D G  E KQ  H+ Q  +++P   ++AP +  M   SSPQSRGT+SESS  GG GSP +QS G   NNTI+  WK
Subjt:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK

AT2G33620.2 AT hook motif DNA-binding family protein4.9e-7954.09Show/hide
Query:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ
        MSGSETG+M +      F++ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP   YQ +  G         ++ +NMN           T 
Subjt:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ

Query:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV
        SEPVK++RGRPRKYGPD    S+ + P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS GIGFTPHV+TV AGEDV
Subjt:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV

Query:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG
        SSKIM+ + NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFE+LSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVG
Subjt:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG

Query:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK
        SF+ D G  E KQ  H+ Q  +++P   ++AP +  M   SSPQSRGT+SESS  GG GSP +QS G   NNTI+  WK
Subjt:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK

AT2G33620.3 AT hook motif DNA-binding family protein4.9e-7954.09Show/hide
Query:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ
        MSGSETG+M +      F++ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP   YQ +  G         ++ +NMN           T 
Subjt:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ

Query:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV
        SEPVK++RGRPRKYGPD    S+ + P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS GIGFTPHV+TV AGEDV
Subjt:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV

Query:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG
        SSKIM+ + NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFE+LSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVG
Subjt:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG

Query:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK
        SF+ D G  E KQ  H+ Q  +++P   ++AP +  M   SSPQSRGT+SESS  GG GSP +QS G   NNTI+  WK
Subjt:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK

AT2G33620.4 AT hook motif DNA-binding family protein4.9e-7954.09Show/hide
Query:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ
        MSGSETG+M +      F++ L  Q+   Q+Q    Q   L FG D    +YK P+ + SP   YQ +  G         ++ +NMN           T 
Subjt:  MSGSETGVMTS---GEPFSIGL--QKSPVQSQQPVLQGMHLPFGAD---GVYK-PVAAASP--TYQSSGVGVAGNAGSDVSAFVNMN-----------TQ

Query:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV
        SEPVK++RGRPRKYGPD    S+ + P  PS   +Q            P SGG       +K RGRPPGSS K+ +L ALGS GIGFTPHV+TV AGEDV
Subjt:  SEPVKRKRGRPRKYGPDG---SMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQ-QLDALGSAGIGFTPHVITVKAGEDV

Query:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG
        SSKIM+ + NGPRAVC+L+ANGAISNVTLRQ A SGGTVTYEGRFE+LSLSGS+ L EN GQRSRTGGLSVSLS PDG VLGG VAGLL AASPVQ+VVG
Subjt:  SSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVG

Query:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK
        SF+ D G  E KQ  H+ Q  +++P   ++AP +  M   SSPQSRGT+SESS  GG GSP +QS G   NNTI+  WK
Subjt:  SFVADVGHNELKQANHIEQQPVTAP--HKLAPIRAGMAGASSPQSRGTLSESS--GGPGSPFNQSAGA-CNNTIS--WK

AT4G12080.1 AT-hook motif nuclear-localized protein 11.2e-5351.06Show/hide
Query:  VKRKRGRPRKYGPDGSMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSKK----QQLDALG-----SAGIGFTPHVITVKA
        +K+KRGRPRKYGPDG++  +   P ++A        P P+  PP          S K+++ +P  S  +     Q++ LG     S G  FTPH+ITV  
Subjt:  VKRKRGRPRKYGPDGSMAMVPAVPSAAATQSSGGFSPPPTAAPPSGGSA--SPTSLKKARGRPPGSSKK----QQLDALG-----SAGIGFTPHVITVKA

Query:  GEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ
        GEDV+ KI+SFSQ GPR++C+L+ANG IS+VTLRQP  SGGT+TYEGRFE+LSLSGS++ +++GG RSRTGG+SVSL+ PDGRV+GGG+AGLL AASPVQ
Subjt:  GEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFELLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQ

Query:  VVVGSFVADVGHNELKQANHIEQQPVTAPHKLAPI
        VVVGSF+A   H + K   +     +++P    PI
Subjt:  VVVGSFVADVGHNELKQANHIEQQPVTAPHKLAPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGATCTGAGACCGGAGTGATGACCAGCGGCGAACCCTTCTCCATCGGTCTCCAGAAGAGTCCAGTACAGTCACAGCAGCCAGTCTTGCAGGGCATGCATTTACC
CTTCGGCGCCGACGGCGTCTACAAGCCCGTCGCCGCCGCCTCGCCCACCTACCAATCCTCCGGCGTCGGAGTTGCTGGCAATGCCGGTTCCGATGTATCAGCTTTCGTTA
ACATGAATACGCAAAGCGAGCCAGTAAAGAGGAAGAGAGGGAGACCTAGGAAGTATGGGCCAGATGGCAGTATGGCAATGGTTCCTGCAGTCCCCTCCGCCGCCGCAACT
CAGTCAAGTGGAGGTTTTTCTCCTCCACCCACCGCCGCCCCTCCGTCGGGAGGATCAGCCTCTCCAACTTCTTTGAAGAAAGCAAGAGGCAGACCCCCTGGCTCTAGCAA
AAAGCAGCAGTTGGATGCTTTGGGGTCAGCAGGAATTGGGTTTACCCCACATGTCATCACCGTGAAAGCTGGAGAGGATGTATCTTCGAAAATAATGTCATTTTCACAGA
ATGGTCCTAGAGCTGTATGTATCCTTACAGCAAATGGAGCAATATCCAATGTGACACTACGTCAACCAGCCATGTCAGGTGGAACTGTGACTTATGAGGGGCGATTCGAG
CTTTTGTCACTATCTGGGTCGTATCTCCTCTCCGAGAATGGCGGTCAGCGGAGCCGAACTGGGGGGCTAAGTGTTTCATTGTCTGGACCAGATGGTAGAGTCTTAGGTGG
TGGGGTGGCTGGTCTTCTAACGGCAGCCTCTCCTGTTCAGGTGGTGGTGGGGAGCTTCGTCGCCGATGTGGGGCATAATGAATTGAAACAAGCAAACCATATAGAACAGC
AGCCTGTTACTGCACCACATAAACTTGCTCCGATCCGTGCTGGGATGGCGGGGGCGAGCAGTCCGCAATCACGTGGAACTCTCAGTGAATCCTCGGGAGGGCCAGGGAGT
CCGTTTAATCAGAGTGCTGGAGCCTGCAATAACACCATATCTTGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGATCTGAGACCGGAGTGATGACCAGCGGCGAACCCTTCTCCATCGGTCTCCAGAAGAGTCCAGTACAGTCACAGCAGCCAGTCTTGCAGGGCATGCATTTACC
CTTCGGCGCCGACGGCGTCTACAAGCCCGTCGCCGCCGCCTCGCCCACCTACCAATCCTCCGGCGTCGGAGTTGCTGGCAATGCCGGTTCCGATGTATCAGCTTTCGTTA
ACATGAATACGCAAAGCGAGCCAGTAAAGAGGAAGAGAGGGAGACCTAGGAAGTATGGGCCAGATGGCAGTATGGCAATGGTTCCTGCAGTCCCCTCCGCCGCCGCAACT
CAGTCAAGTGGAGGTTTTTCTCCTCCACCCACCGCCGCCCCTCCGTCGGGAGGATCAGCCTCTCCAACTTCTTTGAAGAAAGCAAGAGGCAGACCCCCTGGCTCTAGCAA
AAAGCAGCAGTTGGATGCTTTGGGGTCAGCAGGAATTGGGTTTACCCCACATGTCATCACCGTGAAAGCTGGAGAGGATGTATCTTCGAAAATAATGTCATTTTCACAGA
ATGGTCCTAGAGCTGTATGTATCCTTACAGCAAATGGAGCAATATCCAATGTGACACTACGTCAACCAGCCATGTCAGGTGGAACTGTGACTTATGAGGGGCGATTCGAG
CTTTTGTCACTATCTGGGTCGTATCTCCTCTCCGAGAATGGCGGTCAGCGGAGCCGAACTGGGGGGCTAAGTGTTTCATTGTCTGGACCAGATGGTAGAGTCTTAGGTGG
TGGGGTGGCTGGTCTTCTAACGGCAGCCTCTCCTGTTCAGGTGGTGGTGGGGAGCTTCGTCGCCGATGTGGGGCATAATGAATTGAAACAAGCAAACCATATAGAACAGC
AGCCTGTTACTGCACCACATAAACTTGCTCCGATCCGTGCTGGGATGGCGGGGGCGAGCAGTCCGCAATCACGTGGAACTCTCAGTGAATCCTCGGGAGGGCCAGGGAGT
CCGTTTAATCAGAGTGCTGGAGCCTGCAATAACACCATATCTTGGAAGTGA
Protein sequenceShow/hide protein sequence
MSGSETGVMTSGEPFSIGLQKSPVQSQQPVLQGMHLPFGADGVYKPVAAASPTYQSSGVGVAGNAGSDVSAFVNMNTQSEPVKRKRGRPRKYGPDGSMAMVPAVPSAAAT
QSSGGFSPPPTAAPPSGGSASPTSLKKARGRPPGSSKKQQLDALGSAGIGFTPHVITVKAGEDVSSKIMSFSQNGPRAVCILTANGAISNVTLRQPAMSGGTVTYEGRFE
LLSLSGSYLLSENGGQRSRTGGLSVSLSGPDGRVLGGGVAGLLTAASPVQVVVGSFVADVGHNELKQANHIEQQPVTAPHKLAPIRAGMAGASSPQSRGTLSESSGGPGS
PFNQSAGACNNTISWK