; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g2360 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g2360
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationMC08:32444426..32452052
RNA-Seq ExpressionMC08g2360
SyntenyMC08g2360
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456410.1 PREDICTED: AT-hook motif nuclear-localized protein 14 isoform X1 [Cucumis melo]4.76e-22188.86Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+PTTT  SPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SSSSKAK++   +SSLNA+SASSS FS PSKKSQLAALDVGNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDS
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS

Query:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GGNQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_008456418.1 PREDICTED: AT-hook motif nuclear-localized protein 14 isoform X2 [Cucumis melo]1.41e-21788.32Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+PTTT  SPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SSSSKAK++   +SSLNA+SASSS FS PSKKSQLAAL  GNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDS
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS

Query:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GGNQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_022143141.1 AT-hook motif nuclear-localized protein 14-like isoform X1 [Momordica charantia]1.38e-255100Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
        SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
Subjt:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY

Query:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
        EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
Subjt:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG

Query:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_022143142.1 AT-hook motif nuclear-localized protein 14-like isoform X2 [Momordica charantia]4.09e-25299.46Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
        SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAAL  GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
Subjt:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY

Query:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
        EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
Subjt:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG

Query:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_038890429.1 AT-hook motif nuclear-localized protein 14 [Benincasa hispida]2.87e-21989.67Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSPTTT  SPTNGLLPPTHHLSS AAA+D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SSSSKAK++   +SSLNA+SASSS FSAPSKKSQLA L  GNAGQ FAPHVINVAAGEDVGQKIM+FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+VGGG KGDASAGKLPSP+GGTSMS LRYGS IDS
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS

Query:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GGNQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDLTGRT HHSPENGDYDQIPD
Subjt:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

TrEMBL top hitse value%identityAlignment
A0A1S3C2R6 AT-hook motif nuclear-localized protein2.31e-22188.86Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+PTTT  SPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SSSSKAK++   +SSLNA+SASSS FS PSKKSQLAALDVGNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDS
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS

Query:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GGNQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A1S3C3W0 AT-hook motif nuclear-localized protein6.82e-21888.32Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+PTTT  SPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SSSSKAK++   +SSLNA+SASSS FS PSKKSQLAAL  GNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDS
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS

Query:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GGNQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A5A7VCX4 AT-hook motif nuclear-localized protein1.11e-21686.67Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+PTTT  SPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALD-------VGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPA
        SSSSKAK++   +SSLNA+SASSS FS PSKKSQLAAL         GNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALD-------VGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPA

Query:  TSGGNITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLR
         SGGNI YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LR
Subjt:  TSGGNITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLR

Query:  YGSTIDSGGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        YGS IDSGGNQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  YGSTIDSGGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A6J1CPC9 AT-hook motif nuclear-localized protein6.66e-256100Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
        SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
Subjt:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY

Query:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
        EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
Subjt:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG

Query:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A6J1CPX7 AT-hook motif nuclear-localized protein1.98e-25299.46Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
        SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAAL  GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
Subjt:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY

Query:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
        EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
Subjt:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG

Query:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

SwissProt top hitse value%identityAlignment
A1L4X7 AT-hook motif nuclear-localized protein 146.2e-8553.21Show/hide
Query:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        S YFHH  QHHH  PTT  T         S  NGL PP           DG S   VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A++
Subjt:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN
        +S SSS+K +R+  A +   +S +S S    SKKSQL +  VG  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGGN
Subjt:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN

Query:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYG
        + YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KKD  G G KGDA  S  +L SPV    + G+ + 
Subjt:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYG

Query:  STIDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD
          ++S G N +RGNDE     H   G G   HF++Q P+G+++T  RP++WR G ++ +       YDL+GR  H S ENGDY+ QIPD
Subjt:  STIDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD

O22812 AT-hook motif nuclear-localized protein 101.5e-3035.52Show/hide
Query:  QLSSYFHHHQHHHQSP----------------TTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEE
        Q +   H  Q H Q+                 T     P   + PP  +  ++A     G + V   ++P   S  ++ +  EP +++RGRPRKYG    
Subjt:  QLSSYFHHHQHHHQSP----------------TTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEE

Query:  ALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNAS
         ++      A S + S  +               SS     SK+ +L AL  G+ G  F PHV+ V AGEDV  KIM       R +C+LSA+G+ISN +
Subjt:  ALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNAS

Query:  LRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVID----PKKDVGG-GAKGDASAGKL
        LRQ ATSGG +TYEGRFEI+SL GS+   +  G   +TGGLSV LSS DG+++GG V G L AA PVQ++VG+F+ D    PK+ VG  G          
Subjt:  LRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVID----PKKDVGG-GAKGDASAGKL

Query:  PSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGIG
        P+ V  T  S    G+  +S      G+  HQ  G
Subjt:  PSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGIG

O80834 AT-hook motif nuclear-localized protein 92.1e-3240.34Show/hide
Query:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS
        S + G   P+ H   + A A GG+ G  PH +    ++  P     P +RKRGRPRKYG       A  +++ S+ + ++  KR                
Subjt:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS

Query:  FSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG---GK
             + + +  L   ++G SF PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI++L  SY+    G    +
Subjt:  FSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG---GK

Query:  TGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+
Subjt:  TGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Q8VYJ2 AT-hook motif nuclear-localized protein 16.8e-3140.15Show/hide
Query:  HQSPTTTTTSPTNGLLPPTHH------------LSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK----KAATASSH-S
        +QSPT+ T  P     P +HH             ++T AA +G S G+                  ++KRGRPRKYG     +A       +A A SH  
Subjt:  HQSPTTTTTSPTNGLLPPTHH------------LSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK----KAATASSH-S

Query:  SSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVG---NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNI
          S    D  AS   +    ++SF+      Q+  L      + G +F PH+I V  GEDV  KI+ F QQ  R IC+LSA+G IS+ +LRQP +SGG +
Subjt:  SSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVG---NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNI

Query:  TYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        TYEGRFEI+SL GS++  D GG   +TGG+SV L+S DG ++GGG+ G L AA PVQV+VG+F+
Subjt:  TYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Q9SB31 AT-hook motif nuclear-localized protein 36.1e-3240.15Show/hide
Query:  NENQLSSYFHHHQH--------HHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAA
        N N  SS+    QH        +   P     +P   L+PPT   ++   AA    +   P S+     ++S  E  ++KRGRPRKY      +      
Subjt:  NENQLSSYFHHHQH--------HHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAA

Query:  TASSH----SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNA---GQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASL
          SS     S     KR       N     S  F             VG A   G +F PHV+ V AGEDV  KIM F QQ  R ICILSA+G ISN +L
Subjt:  TASSH----SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNA---GQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASL

Query:  RQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        RQ  TSGG +TYEGRFEI+SL GS+++ D GG   + GG+SVCL+  DG + GGG+ G   AAGPVQV+VGTF+
Subjt:  RQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Arabidopsis top hitse value%identityAlignment
AT2G45850.1 AT hook motif DNA-binding family protein1.5e-3340.34Show/hide
Query:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS
        S + G   P+ H   + A A GG+ G  PH +    ++  P     P +RKRGRPRKYG       A  +++ S+ + ++  KR                
Subjt:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS

Query:  FSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG---GK
             + + +  L   ++G SF PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI++L  SY+    G    +
Subjt:  FSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG---GK

Query:  TGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+
Subjt:  TGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

AT2G45850.2 AT hook motif DNA-binding family protein1.5e-3340.34Show/hide
Query:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS
        S + G   P+ H   + A A GG+ G  PH +    ++  P     P +RKRGRPRKYG       A  +++ S+ + ++  KR                
Subjt:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS

Query:  FSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG---GK
             + + +  L   ++G SF PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI++L  SY+    G    +
Subjt:  FSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG---GK

Query:  TGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+
Subjt:  TGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

AT3G04590.1 AT hook motif DNA-binding family protein2.9e-6958.45Show/hide
Query:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        S YFHH  QHHH  PTT  T         S  NGL PP           DG S   VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A++
Subjt:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN
        +S SSS+K +R+  A +   +S +S S    SKKSQL +  VG  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGGN
Subjt:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN

Query:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDAS-AGKL
        + YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KKD  G G KGDAS +GK+
Subjt:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDAS-AGKL

AT3G04590.2 AT hook motif DNA-binding family protein4.4e-8653.21Show/hide
Query:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        S YFHH  QHHH  PTT  T         S  NGL PP           DG S   VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A++
Subjt:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN
        +S SSS+K +R+  A +   +S +S S    SKKSQL +  VG  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGGN
Subjt:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN

Query:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYG
        + YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KKD  G G KGDA  S  +L SPV    + G+ + 
Subjt:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYG

Query:  STIDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD
          ++S G N +RGNDE     H   G G   HF++Q P+G+++T  RP++WR G ++ +       YDL+GR  H S ENGDY+ QIPD
Subjt:  STIDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD

AT5G28590.1 DNA-binding family protein3.3e-4144.18Show/hide
Query:  LNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYV
        LN L + +  FS    K+          GQ F PH++N+  GEDV +KI+LF QQ K ++C+LSASGSISNASL   A+                     
Subjt:  LNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYV

Query:  RTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGK---LPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQ
         T  GGKTGGLSVCLS+SDG I GGGVGG LKAAGPVQV++GTF ++ KKD   GAKGD ++G    LPSP G  S+ G  Y   ++S G     NDEH 
Subjt:  RTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGK---LPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQ

Query:  GI------GEGHFLLQ-PRGVNLTSQRPTDWRMGLDATNSAYDLTGRTS
         I      G  HF+++ P+G+++T  RP++W        + YDL+G++S
Subjt:  GI------GEGHFLLQ-PRGVNLTSQRPTDWRMGLDATNSAYDLTGRTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCCAATGAAAACCAGCTCAGCTCCTACTTCCACCACCATCAGCACCACCATCAAAGTCCCACCACCACCACCACATCGCCGACCAATGGACTTTTACCCCCCAC
CCACCACCTCTCCTCCACCGCCGCCGCCGCAGACGGCGGCTCTCACGGCGTCTACCCTCACTCCGTGCCCTCCGCCGCGGTCTCCTCCTCGCCCCTCGAGCCCGCTCGCC
GGAAGAGGGGCCGCCCCAGGAAGTACGGCACTCCCGAGGAAGCTTTAGCTGCGAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCTCCAAGGCCAAGAGGGACCAC
CACGCCTCTTCCCTTAATGCTCTCTCCGCTTCTTCTTCTTCCTTCTCTGCCCCTTCCAAGAAATCCCAGTTGGCAGCACTTGATGTAGGTAATGCAGGCCAAAGTTTTGC
ACCACATGTTATTAATGTGGCAGCTGGTGAGGATGTTGGCCAGAAAATTATGCTGTTTATGCAGCAATGCAAGCGGGAAATCTGTATCCTTTCTGCTTCTGGTTCGATCT
CCAATGCATCTCTCCGTCAGCCAGCCACATCCGGAGGCAATATTACGTATGAGGGTCGTTTTGAGATTGTTTCATTATGCGGATCTTATGTACGTACTGACATCGGAGGA
AAGACTGGTGGTCTTAGTGTATGTTTGTCTAGTTCTGATGGTCATATCATAGGAGGGGGAGTCGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGTAC
TTTTGTGATCGACCCAAAGAAGGACGTTGGTGGTGGTGCAAAAGGTGACGCATCTGCTGGCAAGTTGCCCTCACCTGTTGGTGGGACGTCGATGTCGGGTCTACGCTATG
GCTCGACCATCGACTCAGGAGGTAATCAAGTCAGGGGAAATGATGAACACCAAGGCATTGGGGAGGGTCATTTCTTGCTTCAGCCCCGGGGCGTGAATCTGACATCTCAG
CGGCCCACAGACTGGAGGATGGGTCTGGATGCCACAAACAGTGCTTACGATTTGACAGGAAGAACAAGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGA
TTAA
mRNA sequenceShow/hide mRNA sequence
GAAAAAGATTGAAAATGATATATACTTCAAGTACAAAATTATAAATTTCCTCCTTTTGGAGCTTCGCTTTCTACTTCGGTTGGTGTGTTGCAGCAATAAAAAAGGGGAAT
GAATTCATTTGGGTATCGAAGTTTTCGTTATTACCATGTATTTTCTCATGTATTATATTTATAATACTTGTTGAAATGCATAAATTGATTTCTAATTAAAAGACACTAAT
TGTGAAGTAGACTAAATACAAGCTTATTTTAAAAGAGGAATAAAATATTTTTAAACGGATAATCAAATTAAAAAATATCAAAAATGAATCACATTTATTTATTCAATATA
AATCTAAATTTAACATTTGTCATAAATTCAAAAAAAATATATACATTATTTCAATTTTTTCCCTCTAATTTGACTTCTCCAAAATCTTTGAACCCGAAAAGTTACACACA
TATATATAAAGTTCCAAGGTAAAATTTTACTCTTACAAAGAGAGGCAATGAATTAATGGTAATTTAAAAGAAAAGAAAAGGAAGCCAAAAAAGGTGAGAAGAAACCCTAA
AGAAGATGGAGTTAAGTCAAAGGGAAAAAACGAGAGTCAAATCTCATAATACTTAATAAATAAATAAATAAAAAGAATTGTTTATTTATGAGATATTATGGATAAATATA
AATCGGTAGTGTACAGATACACACACAAAAGAAAAAAAGAAAAAAAAAGGAAAGAAAGAAAAGGGCAAATCCTTTTCAAAAAAAGAAAAAAAAGGATTTATTTATTAATT
TTTATTTTTATTTTTCCAAAACACACGCTCTGTTTCTCTCTCTTTCTTTGTTTCTCTCTCTCATCTCAAAAATGGAACCCAATGAAAACCAGCTCAGCTCCTACTTCCAC
CACCATCAGCACCACCATCAAAGTCCCACCACCACCACCACATCGCCGACCAATGGACTTTTACCCCCCACCCACCACCTCTCCTCCACCGCCGCCGCCGCAGACGGCGG
CTCTCACGGCGTCTACCCTCACTCCGTGCCCTCCGCCGCGGTCTCCTCCTCGCCCCTCGAGCCCGCTCGCCGGAAGAGGGGCCGCCCCAGGAAGTACGGCACTCCCGAGG
AAGCTTTAGCTGCGAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCTCCAAGGCCAAGAGGGACCACCACGCCTCTTCCCTTAATGCTCTCTCCGCTTCTTCTTCT
TCCTTCTCTGCCCCTTCCAAGAAATCCCAGTTGGCAGCACTTGATGTAGGTAATGCAGGCCAAAGTTTTGCACCACATGTTATTAATGTGGCAGCTGGTGAGGATGTTGG
CCAGAAAATTATGCTGTTTATGCAGCAATGCAAGCGGGAAATCTGTATCCTTTCTGCTTCTGGTTCGATCTCCAATGCATCTCTCCGTCAGCCAGCCACATCCGGAGGCA
ATATTACGTATGAGGGTCGTTTTGAGATTGTTTCATTATGCGGATCTTATGTACGTACTGACATCGGAGGAAAGACTGGTGGTCTTAGTGTATGTTTGTCTAGTTCTGAT
GGTCATATCATAGGAGGGGGAGTCGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACTTTTGTGATCGACCCAAAGAAGGACGTTGGTGGTGGTGC
AAAAGGTGACGCATCTGCTGGCAAGTTGCCCTCACCTGTTGGTGGGACGTCGATGTCGGGTCTACGCTATGGCTCGACCATCGACTCAGGAGGTAATCAAGTCAGGGGAA
ATGATGAACACCAAGGCATTGGGGAGGGTCATTTCTTGCTTCAGCCCCGGGGCGTGAATCTGACATCTCAGCGGCCCACAGACTGGAGGATGGGTCTGGATGCCACAAAC
AGTGCTTACGATTTGACAGGAAGAACAAGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGATTAAGAGTTAACAAAAGGGACGGGATGGGACAGGACGGG
ACAGGACGGGACTGCAAAGTTTGTCGTAGATAAAATGTACAATATCAACAGTTGCCATGCCAGCAATCTTCTCTTTCACTCTGTCTTAGAGCTAATTCTGGTTGTAGTAT
GCACCATACTGTAAAGGTGATAACAAACTCTTTAGAAGTTCTTTTCTTTCTTTTTTTCTTTATTTTTTCTTTTTTAAATCTTTTTTATGCTCTCTTTGCATTTTATTTTC
CCCTTTCCAAGTTTTCATCCTTCCACCTTTCATGGTGTATTTGCATGTTGTTGCTGTCTAATTCTAATTCTCTCCTGTTTTTTGCTTTAGATGGAACAAAGGTTGTTGTA
GTTACAGTTTGCAATTTCCAATTTGATCATATAAACAAACACCTCTTTGGTGATTTAGATTTCTATGCACTAATCTTCTTGTGATGAACAATGTTATTTGATGAACTCAC
AAAGATTATGATTT
Protein sequenceShow/hide protein sequence
MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDH
HASSLNALSASSSSFSAPSKKSQLAALDVGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG
KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQ
RPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD