; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005259 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005259
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationscaffold83:368506..374785
RNA-Seq ExpressionMS005259
SyntenyMS005259
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139392.1 AT-hook motif nuclear-localized protein 14 [Cucumis sativus]1.5e-16587.99Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAK+
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL
        +   +SSLNA+SA SSSFS PSKKSQLAALGNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI YEGRFEIVSL
Subjt:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL

Query:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE
        CGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SA KLPSP+GGTSMS LRYGS IDSGGNQ+RGNDE
Subjt:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE

Query:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        HQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRT HHSPENGDYDQIPD
Subjt:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_008456418.1 PREDICTED: AT-hook motif nuclear-localized protein 14 isoform X2 [Cucumis melo]4.7e-16788.55Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAK+
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL
        +   +SSLNA+SA SSSFS PSKKSQLAALGNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI YEGRFEIVSL
Subjt:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL

Query:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE
        CGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDSGGNQ+RGNDE
Subjt:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE

Query:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        HQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_022143141.1 AT-hook motif nuclear-localized protein 14-like isoform X1 [Momordica charantia]7.1e-19299.44Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  DHHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVS
        DHHASSLNALSASSSSFSAPSKKSQLAAL  GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVS
Subjt:  DHHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVS

Query:  LCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGND
        LCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGND
Subjt:  LCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGND

Query:  EHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        EHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  EHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_022143142.1 AT-hook motif nuclear-localized protein 14-like isoform X2 [Momordica charantia]2.2e-193100Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  DHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLC
        DHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLC
Subjt:  DHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLC

Query:  GSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEH
        GSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEH
Subjt:  GSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEH

Query:  QGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        QGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  QGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

XP_038890429.1 AT-hook motif nuclear-localized protein 14 [Benincasa hispida]2.5e-16889.94Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQSP  TTTSPTNGLLPPTHHLSS AAA+D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAK+
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL
        +   +SSLNA+SA SSSFSAPSKKSQLA LGNAGQ FAPHVINVAAGEDVGQKIM+FMQQCKREICILSASGSISNASLRQPA SGGNI YEGRFEIVSL
Subjt:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL

Query:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE
        CGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+VGGG KGDASAGKLPSP+GGTSMS LRYGS IDSGGNQ+RGNDE
Subjt:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE

Query:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        HQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDLTGRT HHSPENGDYDQIPD
Subjt:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

TrEMBL top hitse value%identityAlignment
A0A0A0LJ73 AT-hook motif nuclear-localized protein7.3e-16687.99Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAK+
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL
        +   +SSLNA+SA SSSFS PSKKSQLAALGNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI YEGRFEIVSL
Subjt:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL

Query:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE
        CGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SA KLPSP+GGTSMS LRYGS IDSGGNQ+RGNDE
Subjt:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE

Query:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        HQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRT HHSPENGDYDQIPD
Subjt:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A1S3C2R6 AT-hook motif nuclear-localized protein7.3e-16688.06Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAK+
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  D-HHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIV
        +   +SSLNA+SA SSSFS PSKKSQLAAL  GNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI YEGRFEIV
Subjt:  D-HHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIV

Query:  SLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGN
        SLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDSGGNQ+RGN
Subjt:  SLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGN

Query:  DEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        DEHQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  DEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A1S3C3W0 AT-hook motif nuclear-localized protein2.3e-16788.55Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQ+P  TTTSPTNGLLPPTHHLS+ AA++D G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAK+
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL
        +   +SSLNA+SA SSSFS PSKKSQLAALGNAGQ FAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI YEGRFEIVSL
Subjt:  D-HHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL

Query:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE
        CGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+ GGG KGD SAGKLPSP+GGTSMS LRYGS IDSGGNQ+RGNDE
Subjt:  CGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDE

Query:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        HQG+GE HFLLQPRGVNLTS R TDWR GLDATN+AYDL+GRTSHHSPENGDYDQIPD
Subjt:  HQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A6J1CPC9 AT-hook motif nuclear-localized protein3.5e-19299.44Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  DHHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVS
        DHHASSLNALSASSSSFSAPSKKSQLAAL  GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVS
Subjt:  DHHASSLNALSASSSSFSAPSKKSQLAAL--GNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVS

Query:  LCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGND
        LCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGND
Subjt:  LCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGND

Query:  EHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        EHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  EHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

A0A6J1CPX7 AT-hook motif nuclear-localized protein1.1e-193100Show/hide
Query:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
        SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR
Subjt:  SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKR

Query:  DHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLC
        DHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLC
Subjt:  DHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLC

Query:  GSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEH
        GSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEH
Subjt:  GSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEH

Query:  QGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
        QGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD
Subjt:  QGIGEGHFLLQPRGVNLTSQRPTDWRMGLDATNSAYDLTGRTSHHSPENGDYDQIPD

SwissProt top hitse value%identityAlignment
A1L4X7 AT-hook motif nuclear-localized protein 147.1e-8653.23Show/hide
Query:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        S YFHH  QHHH  PTT  T         S  NGL PP           DG S   VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A++
Subjt:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        +S SSS+K +R+  A +   +S +S S    SKKSQL ++G  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGGN+ 
Subjt:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYGST
        YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KKD  G G KGDA  S  +L SPV    + G+ +   
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYGST

Query:  IDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD
        ++S G N +RGNDE     H   G G   HF++Q P+G+++T  RP++WR G ++ +       YDL+GR  H S ENGDY+ QIPD
Subjt:  IDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD

O22812 AT-hook motif nuclear-localized protein 104.1e-3337.87Show/hide
Query:  TTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALS
        T     P   + PP  +  ++A     G + V   ++P   S  ++ +  EP +++RGRPRKYG     ++      A S + S  +             
Subjt:  TTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALS

Query:  ASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG-
          SS     SK+ +L ALG+ G  F PHV+ V AGEDV  KIM       R +C+LSA+G+ISN +LRQ ATSGG +TYEGRFEI+SL GS+   +  G 
Subjt:  ASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG-

Query:  --KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVID----PKKDVGG-GAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGI
          +TGGLSV LSS DG+++GG V G L AA PVQ++VG+F+ D    PK+ VG  G          P+ V  T  S    G+  +S      G+  HQ  
Subjt:  --KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVID----PKKDVGG-GAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGI

Query:  G
        G
Subjt:  G

O80834 AT-hook motif nuclear-localized protein 92.4e-3341.08Show/hide
Query:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS
        S + G   P+ H   + A A GG+ G  PH +    ++  P     P +RKRGRPRKYG       A  +++ S+ + ++  KR                
Subjt:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS

Query:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--
             KK ++A++G     ++G SF PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI++L  SY+    G  
Subjt:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--

Query:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
          +TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+
Subjt:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Q8VYJ2 AT-hook motif nuclear-localized protein 11.0e-3140.53Show/hide
Query:  HQSPTTTTTSPTNGLLPPTHH------------LSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK----KAATASSH-S
        +QSPT+ T  P     P +HH             ++T AA +G S G+                  ++KRGRPRKYG     +A       +A A SH  
Subjt:  HQSPTTTTTSPTNGLLPPTHH------------LSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK----KAATASSH-S

Query:  SSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNI
          S    D  AS   +    ++SF+      Q+  LG     + G +F PH+I V  GEDV  KI+ F QQ  R IC+LSA+G IS+ +LRQP +SGG +
Subjt:  SSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNI

Query:  TYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        TYEGRFEI+SL GS++  D GG   +TGG+SV L+S DG ++GGG+ G L AA PVQV+VG+F+
Subjt:  TYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Q9SB31 AT-hook motif nuclear-localized protein 31.6e-3241.11Show/hide
Query:  HHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAA-------KKAATASSHSSSSKAKRDH
        +   P     +P   L+PPT   ++   AA    +   P S+     ++S  E  ++KRGRPRKY  P+  L           +   +S     K  R  
Subjt:  HHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAA-------KKAATASSHSSSSKAKRDH

Query:  HASSLNALSASSSSFSAPSKKSQLAALGNA---GQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL
          S+     +    F      + LA +G A   G +F PHV+ V AGEDV  KIM F QQ  R ICILSA+G ISN +LRQ  TSGG +TYEGRFEI+SL
Subjt:  HASSLNALSASSSSFSAPSKKSQLAALGNA---GQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSL

Query:  CGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
         GS+++ D GG   + GG+SVCL+  DG + GGG+ G   AAGPVQV+VGTF+
Subjt:  CGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Arabidopsis top hitse value%identityAlignment
AT2G45850.1 AT hook motif DNA-binding family protein1.7e-3441.08Show/hide
Query:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS
        S + G   P+ H   + A A GG+ G  PH +    ++  P     P +RKRGRPRKYG       A  +++ S+ + ++  KR                
Subjt:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS

Query:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--
             KK ++A++G     ++G SF PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI++L  SY+    G  
Subjt:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--

Query:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
          +TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+
Subjt:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

AT2G45850.2 AT hook motif DNA-binding family protein1.7e-3441.08Show/hide
Query:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS
        S + G   P+ H   + A A GG+ G  PH +    ++  P     P +RKRGRPRKYG       A  +++ S+ + ++  KR                
Subjt:  SPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSP---LEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNALSASSSS

Query:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--
             KK ++A++G     ++G SF PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI++L  SY+    G  
Subjt:  FSAPSKKSQLAALG-----NAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIG--

Query:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
          +TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+
Subjt:  -GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

AT3G04590.1 AT hook motif DNA-binding family protein4.3e-7058.51Show/hide
Query:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        S YFHH  QHHH  PTT  T         S  NGL PP           DG S   VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A++
Subjt:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        +S SSS+K +R+  A +   +S +S S    SKKSQL ++G  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGGN+ 
Subjt:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDAS-AGKL
        YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KKD  G G KGDAS +GK+
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDAS-AGKL

AT3G04590.2 AT hook motif DNA-binding family protein5.1e-8753.23Show/hide
Query:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        S YFHH  QHHH  PTT  T         S  NGL PP           DG S   VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A++
Subjt:  SSYFHHH-QHHHQSPTTTTT---------SPTNGLLPPTHHLSSTAAAADGGSH-GVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        +S SSS+K +R+  A +   +S +S S    SKKSQL ++G  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGGN+ 
Subjt:  SSHSSSSKAKRDHHASSLNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYGST
        YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KKD  G G KGDA  S  +L SPV    + G+ +   
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKD-VGGGAKGDA--SAGKLPSPVGGTSMSGLRYGST

Query:  IDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD
        ++S G N +RGNDE     H   G G   HF++Q P+G+++T  RP++WR G ++ +       YDL+GR  H S ENGDY+ QIPD
Subjt:  IDS-GGNQVRGNDE-----HQGIGEG---HFLLQ-PRGVNLTSQRPTDWRMGLDATN-----SAYDLTGRTSHHSPENGDYD-QIPD

AT5G28590.1 DNA-binding family protein6.5e-4244.94Show/hide
Query:  LNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRT
        LN L + +  FS         AL   GQ F PH++N+  GEDV +KI+LF QQ K ++C+LSASGSISNASL   A+                      T
Subjt:  LNALSASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRT

Query:  DIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGK---LPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGI
          GGKTGGLSVCLS+SDG I GGGVGG LKAAGPVQV++GTF ++ KKD   GAKGD ++G    LPSP G  S+ G  Y   ++S G     NDEH  I
Subjt:  DIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGK---LPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGI

Query:  ------GEGHFLLQ-PRGVNLTSQRPTDWRMGLDATNSAYDLTGRTS
              G  HF+++ P+G+++T  RP++W        + YDL+G++S
Subjt:  ------GEGHFLLQ-PRGVNLTSQRPTDWRMGLDATNSAYDLTGRTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGCTCCTACTTCCACCACCATCAGCACCACCATCAAAGTCCCACCACCACCACCACATCGCCGACCAATGGACTTTTACCCCCCACCCACCACCTCTCCTCCACCGCCGC
CGCCGCAGACGGCGGCTCTCACGGCGTCTACCCTCACTCCGTGCCCTCCGCCGCGGTCTCCTCCTCGCCCCTCGAGCCCGCTCGCCGGAAGAGGGGCCGCCCCAGGAAGT
ACGGCACTCCCGAGGAAGCTTTAGCTGCGAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCTCCAAGGCCAAGAGGGACCACCACGCCTCTTCCCTTAATGCTCTC
TCCGCTTCTTCTTCTTCCTTCTCTGCCCCTTCCAAGAAATCCCAGTTGGCAGCACTTGGTAATGCAGGCCAAAGTTTTGCACCACATGTTATTAATGTGGCAGCTGGTGA
GGATGTTGGCCAGAAAATTATGCTGTTTATGCAGCAATGCAAGCGGGAAATCTGTATCCTTTCTGCTTCTGGTTCGATCTCCAATGCATCTCTCCGTCAGCCAGCCACAT
CCGGAGGCAATATTACGTATGAGGGTCGTTTTGAGATTGTTTCATTATGCGGATCTTATGTACGTACTGACATCGGAGGAAAGACTGGTGGTCTTAGTGTATGTTTGTCT
AGTTCTGATGGCCATATCATAGGAGGGGGAGTCGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACTTTTGTGATCGACCCAAAGAAGGACGTTGG
TGGTGGTGCAAAAGGTGACGCATCTGCTGGCAAGTTGCCCTCACCTGTTGGTGGGACGTCGATGTCGGGTCTACGCTATGGCTCGACCATCGACTCAGGAGGTAATCAAG
TCAGGGGAAATGATGAACACCAAGGCATTGGGGAGGGTCATTTCTTGCTTCAGCCCCGGGGCGTGAATCTGACATCTCAGCGGCCCACAGACTGGAGGATGGGTCTGGAT
GCCACAAACAGTGCTTACGATTTGACAGGAAGAACAAGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGAT
mRNA sequenceShow/hide mRNA sequence
AGCTCCTACTTCCACCACCATCAGCACCACCATCAAAGTCCCACCACCACCACCACATCGCCGACCAATGGACTTTTACCCCCCACCCACCACCTCTCCTCCACCGCCGC
CGCCGCAGACGGCGGCTCTCACGGCGTCTACCCTCACTCCGTGCCCTCCGCCGCGGTCTCCTCCTCGCCCCTCGAGCCCGCTCGCCGGAAGAGGGGCCGCCCCAGGAAGT
ACGGCACTCCCGAGGAAGCTTTAGCTGCGAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCTCCAAGGCCAAGAGGGACCACCACGCCTCTTCCCTTAATGCTCTC
TCCGCTTCTTCTTCTTCCTTCTCTGCCCCTTCCAAGAAATCCCAGTTGGCAGCACTTGGTAATGCAGGCCAAAGTTTTGCACCACATGTTATTAATGTGGCAGCTGGTGA
GGATGTTGGCCAGAAAATTATGCTGTTTATGCAGCAATGCAAGCGGGAAATCTGTATCCTTTCTGCTTCTGGTTCGATCTCCAATGCATCTCTCCGTCAGCCAGCCACAT
CCGGAGGCAATATTACGTATGAGGGTCGTTTTGAGATTGTTTCATTATGCGGATCTTATGTACGTACTGACATCGGAGGAAAGACTGGTGGTCTTAGTGTATGTTTGTCT
AGTTCTGATGGCCATATCATAGGAGGGGGAGTCGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACTTTTGTGATCGACCCAAAGAAGGACGTTGG
TGGTGGTGCAAAAGGTGACGCATCTGCTGGCAAGTTGCCCTCACCTGTTGGTGGGACGTCGATGTCGGGTCTACGCTATGGCTCGACCATCGACTCAGGAGGTAATCAAG
TCAGGGGAAATGATGAACACCAAGGCATTGGGGAGGGTCATTTCTTGCTTCAGCCCCGGGGCGTGAATCTGACATCTCAGCGGCCCACAGACTGGAGGATGGGTCTGGAT
GCCACAAACAGTGCTTACGATTTGACAGGAAGAACAAGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGAT
Protein sequenceShow/hide protein sequence
SSYFHHHQHHHQSPTTTTTSPTNGLLPPTHHLSSTAAAADGGSHGVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKRDHHASSLNAL
SASSSSFSAPSKKSQLAALGNAGQSFAPHVINVAAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLS
SSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKDVGGGAKGDASAGKLPSPVGGTSMSGLRYGSTIDSGGNQVRGNDEHQGIGEGHFLLQPRGVNLTSQRPTDWRMGLD
ATNSAYDLTGRTSHHSPENGDYDQIPD