; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014491 (gene) of Chayote v1 genome

Gene IDSed0014491
OrganismSechium edule (Chayote v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationLG05:1843743..1855447
RNA-Seq ExpressionSed0014491
SyntenySed0014491
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR017956 - AT hook, DNA-binding motif
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008465966.1 PREDICTED: AT-hook motif nuclear-localized protein 13-like [Cucumis melo]5.0e-11667.29Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA-------ASPYEGSHSGGFNVDSVKRKRGRPRK
        MDS+D+PPPPLSAPSNMA+G  TAYSP    AN NASST+ LN   + M+PPS+RF  N  + PPS+        SPY+GSHS  FNVDS K++RGRPRK
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA-------ASPYEGSHSGGFNVDSVKRKRGRPRK

Query:  YAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGT
        YAP+  NI+LGLA T T  +SV HGD  SA PDS +Q A+K RGRP GSGKKQ N++ SG  G  PHVLLAKPGEDVAAKILSFSQ   RTVFI+SANGT
Subjt:  YAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGT

Query:  ISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVP
        +SN TLRH  +SGG VSYEG Y+II+LSGSFL+SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQVI+GSF+EDDKK NTSMLNSG S+  P
Subjt:  ISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVP

Query:  SQMMNFGGGA----AVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        SQM+NFGGG     A AAASPP LG SSGESS ENG SPL+NRHP +FNN++     +Q      +Q+W A QTQQ
Subjt:  SQMMNFGGGA----AVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

XP_022159894.1 AT-hook motif nuclear-localized protein 13-like [Momordica charantia]2.6e-11770Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY
        MDS+++PPPPLSA SNMAVG  TAYS AMSNAN NASST+GLNP  + MM P+ RF  N +IAP S        +PY+GSHSG FN+DS K+KRGRPRKY
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY

Query:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS
         P+GNI+LGLA T T A+SV HGD  +A PDS +Q AKKARGRP GSGKKQMNA  SG IG  PHV+L KPGEDVAAKI+SF+Q   R VFI+SANGT+S
Subjt:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS

Query:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVPSQ
        + TLRHP  SGG V+YEG YEII+LSGSFL+SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQ+I+GSF+EDDKK N+SMLNS  S G P Q
Subjt:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVPSQ

Query:  MMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        M+NFG  AA  AASPP LG SSGESSAENG SPL NRHP +F NN SQ I +   QMYHH +W A QTQQ
Subjt:  MMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

XP_022931258.1 AT-hook motif nuclear-localized protein 13-like isoform X1 [Cucurbita moschata]1.1e-11868.73Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY
        MDS+D+ PP LSAPSNM VG PTAYSP MSNAN NASST+GLNP  + M+ PS RF  N +IAP S        SPY+GSHSG FN DS K+KRGRPRKY
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY

Query:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS
         P+GNI+LGLA T T A+SV HGD  S  PD  +Q AKKARGRP GSGKKQMNA+ S  +G  PHV+ AKPGEDVAAKIL+FSQ   RTVFI+SANG+IS
Subjt:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS

Query:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS
        N TLRH   SGG V+YEG YEII+LSGSF++SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQV++GSF+E+DKK  NT MLNSG S+  PS
Subjt:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS

Query:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        QM+NFGG AA AAASPP LG SSGESSA+NGGSPL+NRHP +F+N++     MQ     +HQ+W A QTQQ
Subjt:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

XP_022995511.1 AT-hook motif nuclear-localized protein 13-like isoform X1 [Cucurbita maxima]1.3e-11969Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY
        MDS+D+ PP LSAPSNM VG PTAYSP MSNAN NASST+GLNP  + M+PPS RF  N +IAP S        SPY+GSHSG FN DS K+KRGRPRKY
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY

Query:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS
         P+GNI+LGLA T T A+SV HGD  S  PD  +Q AKKARGRP GSGKKQMNA+ S  +G  PHV+ AKPGEDVAAKIL+FSQ   RTVFI+SANG+IS
Subjt:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS

Query:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS
        N TLRH   SGG V+YEG YEII+LSGSF++SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQV++GSF+E+DKK  NT MLNSG S+  PS
Subjt:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS

Query:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        QM+NFGG AA AAASPP LG SSGESSA+NGGSPL+NRHP +F+N++     MQ     +HQ+W A QTQQ
Subjt:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

XP_023532984.1 AT-hook motif nuclear-localized protein 13-like isoform X1 [Cucurbita pepo subsp. pepo]1.7e-11969Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY
        MDS+D+ PP LSAPSNM VG PTAYSP MSNAN NASST+GLNP  + M+PPS RF  N +IAP S        SPY+GSHSG FN DS K+KRGRPRKY
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY

Query:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS
         P+GNI+LGLA T T A+SV HGD  S  PD  +Q AKKARGRP GSGKKQMNA+ S  +G  PHV+ AKPGEDVAAKIL+FSQ   RTVFI+SANG+IS
Subjt:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS

Query:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS
        N TLRH   SGG V+YEG YEII+LSGSF++SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQV++GSF+E+DKK  NT MLNSG S+  PS
Subjt:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS

Query:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        QM+NFGG AA AAASPP LG SSGESSA+NGGSPL+NRHP +F+N++     MQ     +HQ+W A QTQQ
Subjt:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LED1 AT-hook motif nuclear-localized protein9.2e-11667.02Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA-------ASPYEGSHSGGFNVDSVKRKRGRPRK
        MDS+D+PPPPLSAPSNMAVG   AYSP    AN NASST+ LN   + M+PPS+RF  N  + PPS+        SPY+GSHS  FNVDS K++RGRPRK
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA-------ASPYEGSHSGGFNVDSVKRKRGRPRK

Query:  YAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGT
        YAP+  NI+LGLA T T A+S+ HGD  +A PDS +Q A+K RGRP GSGKKQ N++ SG  G  PHVLLAKPGEDVAAKILSFSQ   RTVFI+SANGT
Subjt:  YAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGT

Query:  ISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVP
        +SN TLRH  +SGG VSYEG Y+II+LSGSFL+SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQVI+GSF+EDDKK NTSMLNSG S+  P
Subjt:  ISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVP

Query:  SQMMNFGGGA----AVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        SQM+NFGGG     A AAASPP LG SSGESS ENG SPL+NRHP +FNN++     +Q      +Q+W A QTQQ
Subjt:  SQMMNFGGGA----AVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

A0A1S3CQG0 AT-hook motif nuclear-localized protein2.4e-11667.29Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA-------ASPYEGSHSGGFNVDSVKRKRGRPRK
        MDS+D+PPPPLSAPSNMA+G  TAYSP    AN NASST+ LN   + M+PPS+RF  N  + PPS+        SPY+GSHS  FNVDS K++RGRPRK
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA-------ASPYEGSHSGGFNVDSVKRKRGRPRK

Query:  YAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGT
        YAP+  NI+LGLA T T  +SV HGD  SA PDS +Q A+K RGRP GSGKKQ N++ SG  G  PHVLLAKPGEDVAAKILSFSQ   RTVFI+SANGT
Subjt:  YAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGT

Query:  ISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVP
        +SN TLRH  +SGG VSYEG Y+II+LSGSFL+SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQVI+GSF+EDDKK NTSMLNSG S+  P
Subjt:  ISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVP

Query:  SQMMNFGGGA----AVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        SQM+NFGGG     A AAASPP LG SSGESS ENG SPL+NRHP +FNN++     +Q      +Q+W A QTQQ
Subjt:  SQMMNFGGGA----AVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

A0A6J1E3M0 AT-hook motif nuclear-localized protein1.3e-11770Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY
        MDS+++PPPPLSA SNMAVG  TAYS AMSNAN NASST+GLNP  + MM P+ RF  N +IAP S        +PY+GSHSG FN+DS K+KRGRPRKY
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY

Query:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS
         P+GNI+LGLA T T A+SV HGD  +A PDS +Q AKKARGRP GSGKKQMNA  SG IG  PHV+L KPGEDVAAKI+SF+Q   R VFI+SANGT+S
Subjt:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS

Query:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVPSQ
        + TLRHP  SGG V+YEG YEII+LSGSFL+SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQ+I+GSF+EDDKK N+SMLNS  S G P Q
Subjt:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVPSQ

Query:  MMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        M+NFG  AA  AASPP LG SSGESSAENG SPL NRHP +F NN SQ I +   QMYHH +W A QTQQ
Subjt:  MMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

A0A6J1EXY5 AT-hook motif nuclear-localized protein5.2e-11968.73Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY
        MDS+D+ PP LSAPSNM VG PTAYSP MSNAN NASST+GLNP  + M+ PS RF  N +IAP S        SPY+GSHSG FN DS K+KRGRPRKY
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY

Query:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS
         P+GNI+LGLA T T A+SV HGD  S  PD  +Q AKKARGRP GSGKKQMNA+ S  +G  PHV+ AKPGEDVAAKIL+FSQ   RTVFI+SANG+IS
Subjt:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS

Query:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS
        N TLRH   SGG V+YEG YEII+LSGSF++SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQV++GSF+E+DKK  NT MLNSG S+  PS
Subjt:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS

Query:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        QM+NFGG AA AAASPP LG SSGESSA+NGGSPL+NRHP +F+N++     MQ     +HQ+W A QTQQ
Subjt:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

A0A6J1K244 AT-hook motif nuclear-localized protein6.1e-12069Show/hide
Query:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY
        MDS+D+ PP LSAPSNM VG PTAYSP MSNAN NASST+GLNP  + M+PPS RF  N +IAP S        SPY+GSHSG FN DS K+KRGRPRKY
Subjt:  MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSA------ASPYEGSHSGGFNVDSVKRKRGRPRKY

Query:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS
         P+GNI+LGLA T T A+SV HGD  S  PD  +Q AKKARGRP GSGKKQMNA+ S  +G  PHV+ AKPGEDVAAKIL+FSQ   RTVFI+SANG+IS
Subjt:  APEGNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTIS

Query:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS
        N TLRH   SGG V+YEG YEII+LSGSF++SENNGTRSRTGGLSVLLAG+ GQVLGGGVAGMLMA SQVQV++GSF+E+DKK  NT MLNSG S+  PS
Subjt:  NVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKP-NTSMLNSGFSNGVPS

Query:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ
        QM+NFGG AA AAASPP LG SSGESSA+NGGSPL+NRHP +F+N++     MQ     +HQ+W A QTQQ
Subjt:  QMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ

SwissProt top hitse value%identityAlignment
O22812 AT-hook motif nuclear-localized protein 108.4e-4243.17Show/hide
Query:  GSHSGGF---NVDSVKRKRGRPRKYAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKK--QMNALDSGDIGCVPHVLLAKPG
        G  SGG      + VK++RGRPRKY P+ G +SLGL   A           + + P S     +K RGRP GS  K  ++ AL S  IG  PHVL    G
Subjt:  GSHSGGF---NVDSVKRKRGRPRKYAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKK--QMNALDSGDIGCVPHVLLAKPG

Query:  EDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQV
        EDV++KI++ +    R V ++SANG ISNVTLR    SGG V+YEG +EI++LSGSF + ENNG RSRTGGLSV L+   G VLGG VAG+L+A S VQ+
Subjt:  EDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQV

Query:  IMGSFVED-DKKPNTSMLNSGFSNGV-----PSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNN
        ++GSF+ D +K+P   +   G S+ V     P+Q++       +  +SP   G  S  S     GSP+       +NN
Subjt:  IMGSFVED-DKKPNTSMLNSGFSNGV-----PSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNN

Q8GXB3 AT-hook motif nuclear-localized protein 51.7e-3436.26Show/hide
Query:  IDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSAASPYEGSHSGGFNVDSVKRKRGRPRKYAPEGNISLG
        + +PPPP   P    +  P  + P  SN     S     +    H        S++  +A P+A  P             VK+KRGRPRKY P+G +SLG
Subjt:  IDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSAASPYEGSHSGGFNVDSVKRKRGRPRKYAPEGNISLG

Query:  LATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNA------LDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTISNVT
        L+       S    DSSS    S     K+ARGRP G+G+KQ  A        S  +   PHV+    GED+ +K+LSFSQ   R + I+S  GT+S+VT
Subjt:  LATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNA------LDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTISNVT

Query:  LRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVP-----
        LR P ++   +++EG +EI++L GS+LV+E  G++SRTGGLSV L+G  G V+GGG+ GML+A S VQV+  SFV      + +  N      +      
Subjt:  LRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVP-----

Query:  --SQMMNFGGGAAVAAASP--------PPLGPSSGESSAENGGSPLD-NRHPV
          S+M    G A  AAAS         P  G S    S    G  LD +R+P+
Subjt:  --SQMMNFGGGAAVAAASP--------PPLGPSSGESSAENGGSPLD-NRHPV

Q8VYJ2 AT-hook motif nuclear-localized protein 12.6e-3537.32Show/hide
Query:  APTAYSPAMSNANGNASSTVGLNPVPS----HMMPPSTRFSLNPMIAPPSAASPYEGSHSGGFNVDSVKRKRGRPRKYAPEGNISLGLATTATRAASVAH
        AP+ +  A  + + N S T    P P     H  PP  + S    +   +  +  EG  SGG     +K+KRGRPRKY P+G +        + A + +H
Subjt:  APTAYSPAMSNANGNASSTVGLNPVPS----HMMPPSTRFSLNPMIAPPSAASPYEGSHSGGFNVDSVKRKRGRPRKYAPEGNISLGLATTATRAASVAH

Query:  GDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIG----------CVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGG
            S+         K+++ +P  S  +        ++G            PH++    GEDV  KI+SFSQ   R++ ++SANG IS+VTLR P++SGG
Subjt:  GDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIG----------CVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGG

Query:  CVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFV----EDDKKP
         ++YEG +EI++LSGSF+ +++ GTRSRTGG+SV LA   G+V+GGG+AG+L+A S VQV++GSF+      D+KP
Subjt:  CVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFV----EDDKKP

Q940I0 AT-hook motif nuclear-localized protein 131.1e-4642.09Show/hide
Query:  NPVPSHMMPPSTRFSLNPMIAPPSAASPYEGSHSGGFNVDSVKRKRGRPRKYAPEG--------NISLGLA-TTATRAASVAH---------GDSSSAAP
        +P P   +   T  SL    +P S A+  + S   G +   VK+KRGRPRKYA +G        NI+LGLA T+   +AS ++         GDS+ A  
Subjt:  NPVPSHMMPPSTRFSLNPMIAPPSAASPYEGSHSGGFNVDSVKRKRGRPRKYAPEG--------NISLGLA-TTATRAASVAH---------GDSSSAAP

Query:  DSQQQQAKKARGRPRGSGKKQMNAL-DSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASG--GCVSYEGLYEIITLSG
        +S    AK+ RGRP GSGKKQ++AL  +G +G  PHV+  K GED+A KIL+F+    R + I+SA G ++NV LR    S   G V YEG +EII+LSG
Subjt:  DSQQQQAKKARGRPRGSGKKQMNAL-DSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASG--GCVSYEGLYEIITLSG

Query:  SFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTS---MLNSGFSNGVPSQMMNFGGGAAVAAASPPPLGPS-SG
        SFL SE+NGT ++TG LSV LAG  G+++GG V GML+A SQVQVI+GSFV D +K   S     N+      P+ M++FGG       SP   G   S 
Subjt:  SFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTS---MLNSGFSNGVPSQMMNFGGGAAVAAASPPPLGPS-SG

Query:  ESSAEN-GGSPLDNR-------HPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQ
        ESS EN   SPL  R       +  +F N+  Q +     QMY + +W  +  Q
Subjt:  ESSAEN-GGSPLDNR-------HPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQ

Q9FIR1 AT-hook motif nuclear-localized protein 82.5e-4644.6Show/hide
Query:  VKRKRGRPRKYAPEGNISLGLATTA--TRAASVAHGD----SSSAAPDSQQQQAKKARGRPRGSGKKQMNAL-DSGDIGCVPHVLLAKPGEDVAAKILSF
        VK+KRGRPRKY P+G+I+LGLA T+    AAS ++G+     S    +S     K+ RGRP GS KKQ++AL  +  +G  PHV+    GED+A+K+++F
Subjt:  VKRKRGRPRKYAPEGNISLGLATTA--TRAASVAHGD----SSSAAPDSQQQQAKKARGRPRGSGKKQMNAL-DSGDIGCVPHVLLAKPGEDVAAKILSF

Query:  SQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDK
        S    RT+ I+SA+G +S V LR    S G V+YEG +EIITLSGS L  E NG+ +R+G LSV LAG  G ++GG V G L+A +QVQVI+GSFV + K
Subjt:  SQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDK

Query:  KPNTSMLNSGFSN-----GVPSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQ--MYHHQI
        KP  S +N            P+ M+NFG  +          GPSS  S     GSP  +R     NNN     Q QQQQ  ++ HQ+
Subjt:  KPNTSMLNSGFSN-----GVPSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQ--MYHHQI

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein6.0e-4343.17Show/hide
Query:  GSHSGGF---NVDSVKRKRGRPRKYAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKK--QMNALDSGDIGCVPHVLLAKPG
        G  SGG      + VK++RGRPRKY P+ G +SLGL   A           + + P S     +K RGRP GS  K  ++ AL S  IG  PHVL    G
Subjt:  GSHSGGF---NVDSVKRKRGRPRKYAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKK--QMNALDSGDIGCVPHVLLAKPG

Query:  EDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQV
        EDV++KI++ +    R V ++SANG ISNVTLR    SGG V+YEG +EI++LSGSF + ENNG RSRTGGLSV L+   G VLGG VAG+L+A S VQ+
Subjt:  EDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQV

Query:  IMGSFVED-DKKPNTSMLNSGFSNGV-----PSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNN
        ++GSF+ D +K+P   +   G S+ V     P+Q++       +  +SP   G  S  S     GSP+       +NN
Subjt:  IMGSFVED-DKKPNTSMLNSGFSNGV-----PSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNN

AT2G33620.2 AT hook motif DNA-binding family protein6.0e-4343.17Show/hide
Query:  GSHSGGF---NVDSVKRKRGRPRKYAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKK--QMNALDSGDIGCVPHVLLAKPG
        G  SGG      + VK++RGRPRKY P+ G +SLGL   A           + + P S     +K RGRP GS  K  ++ AL S  IG  PHVL    G
Subjt:  GSHSGGF---NVDSVKRKRGRPRKYAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKK--QMNALDSGDIGCVPHVLLAKPG

Query:  EDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQV
        EDV++KI++ +    R V ++SANG ISNVTLR    SGG V+YEG +EI++LSGSF + ENNG RSRTGGLSV L+   G VLGG VAG+L+A S VQ+
Subjt:  EDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQV

Query:  IMGSFVED-DKKPNTSMLNSGFSNGV-----PSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNN
        ++GSF+ D +K+P   +   G S+ V     P+Q++       +  +SP   G  S  S     GSP+       +NN
Subjt:  IMGSFVED-DKKPNTSMLNSGFSNGV-----PSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNN

AT2G33620.3 AT hook motif DNA-binding family protein6.0e-4343.17Show/hide
Query:  GSHSGGF---NVDSVKRKRGRPRKYAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKK--QMNALDSGDIGCVPHVLLAKPG
        G  SGG      + VK++RGRPRKY P+ G +SLGL   A           + + P S     +K RGRP GS  K  ++ AL S  IG  PHVL    G
Subjt:  GSHSGGF---NVDSVKRKRGRPRKYAPE-GNISLGLATTATRAASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKK--QMNALDSGDIGCVPHVLLAKPG

Query:  EDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQV
        EDV++KI++ +    R V ++SANG ISNVTLR    SGG V+YEG +EI++LSGSF + ENNG RSRTGGLSV L+   G VLGG VAG+L+A S VQ+
Subjt:  EDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQV

Query:  IMGSFVED-DKKPNTSMLNSGFSNGV-----PSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNN
        ++GSF+ D +K+P   +   G S+ V     P+Q++       +  +SP   G  S  S     GSP+       +NN
Subjt:  IMGSFVED-DKKPNTSMLNSGFSNGV-----PSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNN

AT4G17950.1 AT hook motif DNA-binding family protein8.1e-4842.09Show/hide
Query:  NPVPSHMMPPSTRFSLNPMIAPPSAASPYEGSHSGGFNVDSVKRKRGRPRKYAPEG--------NISLGLA-TTATRAASVAH---------GDSSSAAP
        +P P   +   T  SL    +P S A+  + S   G +   VK+KRGRPRKYA +G        NI+LGLA T+   +AS ++         GDS+ A  
Subjt:  NPVPSHMMPPSTRFSLNPMIAPPSAASPYEGSHSGGFNVDSVKRKRGRPRKYAPEG--------NISLGLA-TTATRAASVAH---------GDSSSAAP

Query:  DSQQQQAKKARGRPRGSGKKQMNAL-DSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASG--GCVSYEGLYEIITLSG
        +S    AK+ RGRP GSGKKQ++AL  +G +G  PHV+  K GED+A KIL+F+    R + I+SA G ++NV LR    S   G V YEG +EII+LSG
Subjt:  DSQQQQAKKARGRPRGSGKKQMNAL-DSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASG--GCVSYEGLYEIITLSG

Query:  SFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTS---MLNSGFSNGVPSQMMNFGGGAAVAAASPPPLGPS-SG
        SFL SE+NGT ++TG LSV LAG  G+++GG V GML+A SQVQVI+GSFV D +K   S     N+      P+ M++FGG       SP   G   S 
Subjt:  SFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTS---MLNSGFSNGVPSQMMNFGGGAAVAAASPPPLGPS-SG

Query:  ESSAEN-GGSPLDNR-------HPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQ
        ESS EN   SPL  R       +  +F N+  Q +     QMY + +W  +  Q
Subjt:  ESSAEN-GGSPLDNR-------HPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQ

AT5G46640.1 AT hook motif DNA-binding family protein1.8e-4744.6Show/hide
Query:  VKRKRGRPRKYAPEGNISLGLATTA--TRAASVAHGD----SSSAAPDSQQQQAKKARGRPRGSGKKQMNAL-DSGDIGCVPHVLLAKPGEDVAAKILSF
        VK+KRGRPRKY P+G+I+LGLA T+    AAS ++G+     S    +S     K+ RGRP GS KKQ++AL  +  +G  PHV+    GED+A+K+++F
Subjt:  VKRKRGRPRKYAPEGNISLGLATTA--TRAASVAHGD----SSSAAPDSQQQQAKKARGRPRGSGKKQMNAL-DSGDIGCVPHVLLAKPGEDVAAKILSF

Query:  SQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDK
        S    RT+ I+SA+G +S V LR    S G V+YEG +EIITLSGS L  E NG+ +R+G LSV LAG  G ++GG V G L+A +QVQVI+GSFV + K
Subjt:  SQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLSGSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDK

Query:  KPNTSMLNSGFSN-----GVPSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQ--MYHHQI
        KP  S +N            P+ M+NFG  +          GPSS  S     GSP  +R     NNN     Q QQQQ  ++ HQ+
Subjt:  KPNTSMLNSGFSN-----GVPSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDNRHPVVFNNNNSQAIQMQQQQ--MYHHQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCAATCGATTCTCCTCCGCCGCCGCTTTCGGCGCCGTCCAACATGGCCGTCGGAGCACCGACGGCGTATTCGCCGGCGATGTCCAACGCCAACGGCAACGCCTC
TTCGACGGTAGGTTTGAATCCGGTTCCATCTCATATGATGCCGCCTTCTACGCGATTTTCGCTTAACCCTATGATCGCCCCTCCATCCGCTGCTTCTCCGTACGAAGGAT
CGCATTCCGGAGGTTTTAACGTCGATTCCGTCAAGAGGAAGCGAGGCCGGCCGAGGAAGTACGCGCCTGAAGGCAACATTTCCTTGGGCTTGGCTACTACTGCTACTCGT
GCGGCTTCTGTTGCTCACGGCGATTCGAGCTCGGCCGCTCCCGATTCGCAGCAGCAGCAGGCGAAGAAGGCGAGGGGACGGCCGCGGGGCTCCGGAAAGAAACAGATGAA
TGCACTTGATTCTGGCGATATTGGTTGTGTTCCTCACGTTCTATTGGCAAAGCCTGGAGAGGATGTAGCAGCCAAAATTTTGTCTTTCTCACAGCTAGAACAACGAACAG
TCTTTATTATCTCTGCAAATGGGACCATCAGTAATGTTACCCTTCGGCACCCGGAAGCATCTGGTGGTTGTGTTTCATATGAGGGGCTGTATGAGATAATCACTCTGTCG
GGGTCGTTTTTGGTATCGGAGAATAATGGAACTCGAAGTAGAACAGGTGGTTTGAGTGTGTTGCTGGCTGGGACAGGCGGACAGGTTCTTGGTGGTGGAGTTGCAGGAAT
GCTAATGGCATGTTCCCAAGTACAGGTGATTATGGGAAGTTTTGTAGAGGATGATAAAAAACCCAACACAAGCATGCTAAATTCTGGGTTTTCGAATGGTGTGCCATCCC
AAATGATGAACTTCGGTGGTGGAGCGGCAGTAGCAGCAGCCAGCCCTCCGCCGTTAGGCCCATCGAGCGGGGAGTCGTCTGCTGAAAACGGAGGCAGCCCTCTTGATAAT
AGACATCCTGTTGTGTTCAATAATAACAACAGCCAGGCGATCCAAATGCAACAGCAGCAGATGTACCACCACCAAATATGGGCAGCAAGCCAAACACAGCAGTGA
mRNA sequenceShow/hide mRNA sequence
AATTTCCATAAATTGATTTTCGTTCTTCAGTTTTTGCAACTTCCTTTACGAACTCTTCACTTCTTCTTCTTCATTGGCGCTGAACTCTCACATTTCTCAGAATCTGAACA
ACATTTACGAGAATTTTCGATTCTGAACTCTCTCTGTTTCTCGATTCGGTGCTTTCATGGATTCAATCGATTCTCCTCCGCCGCCGCTTTCGGCGCCGTCCAACATGGCC
GTCGGAGCACCGACGGCGTATTCGCCGGCGATGTCCAACGCCAACGGCAACGCCTCTTCGACGGTAGGTTTGAATCCGGTTCCATCTCATATGATGCCGCCTTCTACGCG
ATTTTCGCTTAACCCTATGATCGCCCCTCCATCCGCTGCTTCTCCGTACGAAGGATCGCATTCCGGAGGTTTTAACGTCGATTCCGTCAAGAGGAAGCGAGGCCGGCCGA
GGAAGTACGCGCCTGAAGGCAACATTTCCTTGGGCTTGGCTACTACTGCTACTCGTGCGGCTTCTGTTGCTCACGGCGATTCGAGCTCGGCCGCTCCCGATTCGCAGCAG
CAGCAGGCGAAGAAGGCGAGGGGACGGCCGCGGGGCTCCGGAAAGAAACAGATGAATGCACTTGATTCTGGCGATATTGGTTGTGTTCCTCACGTTCTATTGGCAAAGCC
TGGAGAGGATGTAGCAGCCAAAATTTTGTCTTTCTCACAGCTAGAACAACGAACAGTCTTTATTATCTCTGCAAATGGGACCATCAGTAATGTTACCCTTCGGCACCCGG
AAGCATCTGGTGGTTGTGTTTCATATGAGGGGCTGTATGAGATAATCACTCTGTCGGGGTCGTTTTTGGTATCGGAGAATAATGGAACTCGAAGTAGAACAGGTGGTTTG
AGTGTGTTGCTGGCTGGGACAGGCGGACAGGTTCTTGGTGGTGGAGTTGCAGGAATGCTAATGGCATGTTCCCAAGTACAGGTGATTATGGGAAGTTTTGTAGAGGATGA
TAAAAAACCCAACACAAGCATGCTAAATTCTGGGTTTTCGAATGGTGTGCCATCCCAAATGATGAACTTCGGTGGTGGAGCGGCAGTAGCAGCAGCCAGCCCTCCGCCGT
TAGGCCCATCGAGCGGGGAGTCGTCTGCTGAAAACGGAGGCAGCCCTCTTGATAATAGACATCCTGTTGTGTTCAATAATAACAACAGCCAGGCGATCCAAATGCAACAG
CAGCAGATGTACCACCACCAAATATGGGCAGCAAGCCAAACACAGCAGTGAGGATTTTATGATTTTGCCAAAAGCATTTTAAATAGAAGGGCTTATGTTAATTTATTTAG
GTTTATTATTTTTAAGATTATTTCAATCTTGCCTTTTTCTGTTTTTCTCCTCCTTTCTACAGCATTGTAGTTCCTCATTTAATTTTATGATTAAATAATTTGAGATTTAT
GTTGCAGAATTTCTAAAGCTTGTCAATGTTCTGTGTATATTTTAACAAGATAGTGAAGACGCCACATTTTCCTTTCTCT
Protein sequenceShow/hide protein sequence
MDSIDSPPPPLSAPSNMAVGAPTAYSPAMSNANGNASSTVGLNPVPSHMMPPSTRFSLNPMIAPPSAASPYEGSHSGGFNVDSVKRKRGRPRKYAPEGNISLGLATTATR
AASVAHGDSSSAAPDSQQQQAKKARGRPRGSGKKQMNALDSGDIGCVPHVLLAKPGEDVAAKILSFSQLEQRTVFIISANGTISNVTLRHPEASGGCVSYEGLYEIITLS
GSFLVSENNGTRSRTGGLSVLLAGTGGQVLGGGVAGMLMACSQVQVIMGSFVEDDKKPNTSMLNSGFSNGVPSQMMNFGGGAAVAAASPPPLGPSSGESSAENGGSPLDN
RHPVVFNNNNSQAIQMQQQQMYHHQIWAASQTQQ