; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g43200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g43200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionAT-hook motif nuclear-localized protein
Genome locationchr8:33193050..33199346
RNA-Seq ExpressionMoc08g43200
SyntenyMoc08g43200
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139392.1 AT-hook motif nuclear-localized protein 14 [Cucumis sativus]1.6e-17088.25Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYE
        SSSSKAK++   +SSLNA+SA SSSFS PSKKSQLAALGNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI YE
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYE

Query:  GRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGG
        GRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SA KLPSP+GGTSMS LRYGS IDSGG
Subjt:  GRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGG

Query:  NQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        NQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRT HHSPENGDYDQIPD
Subjt:  NQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_008456418.1 PREDICTED: AT-hook motif nuclear-localized protein 14 isoform X2 [Cucumis melo]4.9e-17288.8Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYE
        SSSSKAK++   +SSLNA+SA SSSFS PSKKSQLAALGNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI YE
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYE

Query:  GRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGG
        GRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDSGG
Subjt:  GRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGG

Query:  NQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        NQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  NQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_022143141.1 AT-hook motif nuclear-localized protein 14-like isoform X1 [Momordica charantia]7.5e-19799.46Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
        SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAAL  GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
Subjt:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY

Query:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
        EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
Subjt:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG

Query:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_022143142.1 AT-hook motif nuclear-localized protein 14-like isoform X2 [Momordica charantia]2.3e-198100Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEG
        SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEG
Subjt:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEG

Query:  RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGN
        RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGN
Subjt:  RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGN

Query:  QVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        QVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  QVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_038890429.1 AT-hook motif nuclear-localized protein 14 [Benincasa hispida]2.6e-17390.16Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSP  TTTSPTNGLLPPTHHLSS AAA+D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYE
        SSSSKAK++   +SSLNA+SA SSSFSAPSKKSQLA LGNAGQ FAPHVINVAAGEDVGQKIM+FMQQCKREICILSASGSISNASLRQPA SGGNI YE
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYE

Query:  GRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGG
        GRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+VGGG KGDASAGKLPSP+GGTSMS LRYGS IDSGG
Subjt:  GRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGG

Query:  NQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        NQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDLTGRT HHSPENGDYDQIPD
Subjt:  NQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

TrEMBL top hitse value%identityAlignment
A0A1S3C2R6 AT-hook motif nuclear-localized protein7.7e-17188.32Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SSSSKAK++   +SSLNA+SA SSSFS PSKKSQLAAL  GNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDS
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDS

Query:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GGNQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  GGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A1S3C3W0 AT-hook motif nuclear-localized protein2.4e-17288.8Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYE
        SSSSKAK++   +SSLNA+SA SSSFS PSKKSQLAALGNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI YE
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYE

Query:  GRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGG
        GRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDSGG
Subjt:  GRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGG

Query:  NQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        NQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  NQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A5A7VCX4 AT-hook motif nuclear-localized protein5.0e-17086.67Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAAL---------GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPA
        SSSSKAK++   +SSLNA+SA SSSFS PSKKSQLAAL         GNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA
Subjt:  SSSSKAKRD-HHASSLNALSASSSSFSAPSKKSQLAAL---------GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPA

Query:  TSGGNITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLR
         SGGNI YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LR
Subjt:  TSGGNITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLR

Query:  YGSTIDSGGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        YGS IDSGGNQ+RGNDEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  YGSTIDSGGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A6J1CPC9 AT-hook motif nuclear-localized protein3.6e-19799.46Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
        SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAAL  GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY
Subjt:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITY

Query:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
        EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG
Subjt:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSG

Query:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  GNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A6J1CPX7 AT-hook motif nuclear-localized protein1.1e-198100Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
        MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSH

Query:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEG
        SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEG
Subjt:  SSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEG

Query:  RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGN
        RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGN
Subjt:  RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGN

Query:  QVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        QVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  QVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

SwissProt top hitse value%identityAlignment
A1L4X7 AT-hook motif nuclear-localized protein 144.3e-8653.23Show/hide
Query:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        S YFHH  QHHH  PTT  T         S  NGL PP           DG S   VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A++
Subjt:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        +S SSS+K +R+  A +   +S +S S    SKKSQL ++G  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGGN+ 
Subjt:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYGST
        YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KKD  G G KGDA  S  +L SPV    + G+ +   
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYGST

Query:  IDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD
        ++S G N +RGNDE     H   G G   HF++Q P+G+++T  RP++WR G ++ +       YDL+GR  H S ENGDY+ QIPD
Subjt:  IDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD

O22812 AT-hook motif nuclear-localized protein 104.7e-3235.74Show/hide
Query:  QLSSYFHHHQHHHQSP----------------TTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEE
        Q +   H  Q H Q+                 T     P   + PP  +  ++A     G + V   ++P   S  ++ +  EP +++RGRPRKYG    
Subjt:  QLSSYFHHHQHHHQSP----------------TTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEE

Query:  ALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLR
         ++      A S + S  +               SS     SK+ +L ALG+ G  F PHV+ V AGEDV  KIM       R +C+LSA+G+ISN +LR
Subjt:  ALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLR

Query:  QPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVID----PKKDVGG-GAKGDASAGKLPS
        Q ATSGG +TYEGRFEI+SL GS+   +  G   +TGGLSV LSS DG+++GG V G L AA PVQ++VG+F+ D    PK+ VG  G          P+
Subjt:  QPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVID----PKKDVGG-GAKGDASAGKLPS

Query:  PVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGIG
         V  T  S    G+  +S      G+  HQ  G
Subjt:  PVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGIG

O80834 AT-hook motif nuclear-localized protein 92.5e-3341.08Show/hide
Query:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS
        S + G   P+ H   + A A GG+ G  PH +    ++  P     P +RKRGRPRKYG       A  +++ S+ + ++  KR                
Subjt:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS

Query:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--
             KK ++A++G     ++G SF PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI++L  SY+    G  
Subjt:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--

Query:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
          +TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+
Subjt:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Q8VYJ2 AT-hook motif nuclear-localized protein 17.9e-3240.53Show/hide
Query:  HQSPTTTTTSPTNGLLPPTHH------------LSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK----KAATASSH-S
        +QSPT+ T  P     P +HH             ++T AA +G S G+                  ++KRGRPRKYG     +A       +A A SH  
Subjt:  HQSPTTTTTSPTNGLLPPTHH------------LSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK----KAATASSH-S

Query:  SSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNI
          S    D  AS   +    ++SF+      Q+  LG     + G +F PH+I V  GEDV  KI+ F QQ  R IC+LSA+G IS+ +LRQP +SGG +
Subjt:  SSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNI

Query:  TYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        TYEGRFEI+SL GS++  D GG   +TGG+SV L+S DG ++GGG+ G L AA PVQV+VG+F+
Subjt:  TYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Q9SB31 AT-hook motif nuclear-localized protein 32.7e-3240Show/hide
Query:  NENQLSSYFHHHQH--------HHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAA----
        N N  SS+    QH        +   P     +P   L+PPT   ++   AA    +   P S+     ++S  E  ++KRGRPRKY  P+  L      
Subjt:  NENQLSSYFHHHQH--------HHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAA----

Query:  ---KKAATASSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNA---GQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNAS
             +   +S     K  R    S+     +    F      + LA +G A   G +F PHV+ V AGEDV  KIM F QQ  R ICILSA+G ISN +
Subjt:  ---KKAATASSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNA---GQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNAS

Query:  LRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        LRQ  TSGG +TYEGRFEI+SL GS+++ D GG   + GG+SVCL+  DG + GGG+ G   AAGPVQV+VGTF+
Subjt:  LRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Arabidopsis top hitse value%identityAlignment
AT2G45850.1 AT hook motif DNA-binding family protein1.8e-3441.08Show/hide
Query:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS
        S + G   P+ H   + A A GG+ G  PH +    ++  P     P +RKRGRPRKYG       A  +++ S+ + ++  KR                
Subjt:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS

Query:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--
             KK ++A++G     ++G SF PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI++L  SY+    G  
Subjt:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--

Query:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
          +TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+
Subjt:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

AT2G45850.2 AT hook motif DNA-binding family protein1.8e-3441.08Show/hide
Query:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS
        S + G   P+ H   + A A GG+ G  PH +    ++  P     P +RKRGRPRKYG       A  +++ S+ + ++  KR                
Subjt:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS

Query:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--
             KK ++A++G     ++G SF PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI++L  SY+    G  
Subjt:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--

Query:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
          +TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+
Subjt:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

AT3G04590.1 AT hook motif DNA-binding family protein2.6e-7058.51Show/hide
Query:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        S YFHH  QHHH  PTT  T         S  NGL PP           DG S   VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A++
Subjt:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        +S SSS+K +R+  A +   +S +S S    SKKSQL ++G  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGGN+ 
Subjt:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDAS-AGKL
        YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KKD  G G KGDAS +GK+
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDAS-AGKL

AT3G04590.2 AT hook motif DNA-binding family protein3.0e-8753.23Show/hide
Query:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        S YFHH  QHHH  PTT  T         S  NGL PP           DG S   VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A++
Subjt:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        +S SSS+K +R+  A +   +S +S S    SKKSQL ++G  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGGN+ 
Subjt:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYGST
        YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KKD  G G KGDA  S  +L SPV    + G+ +   
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYGST

Query:  IDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD
        ++S G N +RGNDE     H   G G   HF++Q P+G+++T  RP++WR G ++ +       YDL+GR  H S ENGDY+ QIPD
Subjt:  IDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD

AT5G28590.1 DNA-binding family protein6.6e-4244.94Show/hide
Query:  LNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRT
        LN L + +  FS         AL   GQ F PH++N+  GEDV +KI+LF QQ K ++C+LSASGSISNASL   A+                      T
Subjt:  LNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRT

Query:  DIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGK---LPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGI
          GGKTGGLSVCLS+SDG I GGGVGG LKAAGPVQV++GTF ++ KKD   GAKGD ++G    LPSP G  S+ G  Y   ++S G     NDEH  I
Subjt:  DIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGK---LPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGI

Query:  ------GEGHFLLQ-PRGVNLTSQRPTDWRMGLDATNSAYDLTGRTS
              G  HF+++ P+G+++T  RP++W        + YDL+G++S
Subjt:  ------GEGHFLLQ-PRGVNLTSQRPTDWRMGLDATNSAYDLTGRTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCCAATGAAAACCAGCTCAGCTCCTACTTCCACCACCATCAGCACCACCATCAAAGTCCCACCACCACCACCACATCGCCGACCAATGGACTTTTACCCCCCAC
CCACCACCTCTCCTCCACCGCCGCCGCCGCAGACGGCGGCTCTCACGGCGTCTACCCTCACTCCGTGCCCTCCGCCGCGGTCTCCTCCTCGCCCCTCGAGCCCGCTCGCC
GGAAGAGGGGCCGCCCCAGGAAGTACGGCACTCCCGAGGAAGCTTTAGCTGCGAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCTCCAAGGCCAAGAGGGACCAC
CACGCCTCTTCCCTTAATGCTCTCTCCGCTTCTTCTTCTTCCTTCTCTGCCCCTTCCAAGAAATCCCAGTTGGCAGCACTTGGTAATGCAGGCCAAAGTTTTGCACCACA
TGTTATTAATGTGGCAGCTGGTGAGGATGTTGGCCAGAAAATTATGCTGTTTATGCAGCAATGCAAGCGGGAAATCTGTATCCTTTCTGCTTCTGGTTCGATCTCCAATG
CATCTCTCCGTCAGCCAGCCACATCCGGAGGCAATATTACGTATGAGGGTCGTTTTGAGATTGTTTCATTATGCGGATCTTATGTACGTACTGACATCGGAGGAAAGACT
GGTGGTCTTAGTGTATGTTTGTCTAGTTCTGATGGTCATATCATAGGAGGGGGAGTCGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACTTTTGT
GATCGACCCAAAGAAGGACGTTGGTGGTGGTGCAAAAGGTGACGCATCTGCTGGCAAGTTGCCCTCACCTGTTGGTGGGACGTCGATGTCGGGTCTACGCTATGGCTCGA
CCATCGACTCAGGAGGTAATCAAGTCAGGGGAAATGATGAACACCAAGGCATTGGGGAGGGTCATTTCTTGCTTCAGCCCCGGGGCGTGAATCTGACATCTCAGCGGCCC
ACAGACTGGAGGATGGGTCTGGATGCCACAAACAGTGCTTACGATTTGACAGGAAGAACAAGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCCAATGAAAACCAGCTCAGCTCCTACTTCCACCACCATCAGCACCACCATCAAAGTCCCACCACCACCACCACATCGCCGACCAATGGACTTTTACCCCCCAC
CCACCACCTCTCCTCCACCGCCGCCGCCGCAGACGGCGGCTCTCACGGCGTCTACCCTCACTCCGTGCCCTCCGCCGCGGTCTCCTCCTCGCCCCTCGAGCCCGCTCGCC
GGAAGAGGGGCCGCCCCAGGAAGTACGGCACTCCCGAGGAAGCTTTAGCTGCGAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCTCCAAGGCCAAGAGGGACCAC
CACGCCTCTTCCCTTAATGCTCTCTCCGCTTCTTCTTCTTCCTTCTCTGCCCCTTCCAAGAAATCCCAGTTGGCAGCACTTGGTAATGCAGGCCAAAGTTTTGCACCACA
TGTTATTAATGTGGCAGCTGGTGAGGATGTTGGCCAGAAAATTATGCTGTTTATGCAGCAATGCAAGCGGGAAATCTGTATCCTTTCTGCTTCTGGTTCGATCTCCAATG
CATCTCTCCGTCAGCCAGCCACATCCGGAGGCAATATTACGTATGAGGGTCGTTTTGAGATTGTTTCATTATGCGGATCTTATGTACGTACTGACATCGGAGGAAAGACT
GGTGGTCTTAGTGTATGTTTGTCTAGTTCTGATGGTCATATCATAGGAGGGGGAGTCGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACTTTTGT
GATCGACCCAAAGAAGGACGTTGGTGGTGGTGCAAAAGGTGACGCATCTGCTGGCAAGTTGCCCTCACCTGTTGGTGGGACGTCGATGTCGGGTCTACGCTATGGCTCGA
CCATCGACTCAGGAGGTAATCAAGTCAGGGGAAATGATGAACACCAAGGCATTGGGGAGGGTCATTTCTTGCTTCAGCCCCGGGGCGTGAATCTGACATCTCAGCGGCCC
ACAGACTGGAGGATGGGTCTGGATGCCACAAACAGTGCTTACGATTTGACAGGAAGAACAAGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGATTAA
Protein sequenceShow/hide protein sequence
MEPNENQLSSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDH
HASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGGKT
GGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRP
TDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD