; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015865 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015865
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationtig00006144:514111..520707
RNA-Seq ExpressionSgr015865
SyntenySgr015865
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139392.1 AT-hook motif nuclear-localized protein 14 [Cucumis sativus]1.6e-15788Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
        MEPNENQLSSYFHHHQHHHQ+P TTTSPTNGLLP THH+S+ AAS   +DAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS

Query:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SHSSS+KAKK+ A   SLNAV+A SSSFS PSKKSQLAALGNAGQGFAPHVINV AGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTIDSG
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE    KGD SA KLPSP+GGTSMS+LRYGS IDSG
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTIDSG

Query:  GNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        GNQ+RGNDEHQG+G+SHFLLQ RGVNLTSPRSTDWRTGLDATN+ AYDL+
Subjt:  GNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

XP_008456418.1 PREDICTED: AT-hook motif nuclear-localized protein 14 isoform X2 [Cucumis melo]4.8e-15988.57Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
        MEPNENQLSSYFHHHQHHHQ+P TTTSPTNGLLP THH+S+ AAS   +DAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS

Query:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SHSSS+KAKK+ A   SLNAV+A SSSFS PSKKSQLAALGNAGQGFAPHVINV AGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTIDSG
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE    KGD SAGKLPSP+GGTSMS+LRYGS IDSG
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTIDSG

Query:  GNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        GNQ+RGNDEHQG+G+SHFLLQ RGVNLTSPRSTDWRTGLDATN AAYDL+
Subjt:  GNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

XP_022143141.1 AT-hook motif nuclear-localized protein 14-like isoform X1 [Momordica charantia]4.0e-16190.08Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        MEPNENQLSSYFHHHQHHHQSP TTTTSPTNGLLP THH+SS AA   AAD G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSAKAKKDH--ASLNAVAASSSSFSGPSKKSQLAAL--GNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN
        SSHSSS+KAK+DH  +SLNA++ASSSSFS PSKKSQLAAL  GNAGQ FAPHVINV AGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN
Subjt:  SSHSSSAKAKKDH--ASLNAVAASSSSFSGPSKKSQLAAL--GNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN

Query:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE----AKGDASAGKLPSPVGGTSMSSLRYGSTI
        ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+    AKGDASAGKLPSPVGGTSMS LRYGSTI
Subjt:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE----AKGDASAGKLPSPVGGTSMSSLRYGSTI

Query:  DSGGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        DSGGNQVRGNDEHQGIG+ HFLLQ RGVNLTS R TDWR GLDATNS AYDLT
Subjt:  DSGGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

XP_022143142.1 AT-hook motif nuclear-localized protein 14-like isoform X2 [Momordica charantia]1.2e-16290.6Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        MEPNENQLSSYFHHHQHHHQSP TTTTSPTNGLLP THH+SS AA   AAD G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSAKAKKDH--ASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SSHSSS+KAK+DH  +SLNA++ASSSSFS PSKKSQLAALGNAGQ FAPHVINV AGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
Subjt:  SSHSSSAKAKKDH--ASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE----AKGDASAGKLPSPVGGTSMSSLRYGSTIDS
        YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+    AKGDASAGKLPSPVGGTSMS LRYGSTIDS
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE----AKGDASAGKLPSPVGGTSMSSLRYGSTIDS

Query:  GGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        GGNQVRGNDEHQGIG+ HFLLQ RGVNLTS R TDWR GLDATNS AYDLT
Subjt:  GGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

XP_038890429.1 AT-hook motif nuclear-localized protein 14 [Benincasa hispida]2.6e-16089.14Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
        MEPNENQLSSYFHHHQHHHQSP TTTSPTNGLLP THH+SS AAS    DAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS

Query:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SHSSS+KAKK+ A   SLNAV+A SSSFS PSKKSQLA LGNAGQGFAPHVINV AGEDVGQKIM+FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTIDSG
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE    KGDASAGKLPSP+GGTSMS+LRYGS IDSG
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTIDSG

Query:  GNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        GNQ+RGNDEHQG+G+SHFLLQ RGVNLTSPRSTDWRTGLDATN+ AYDLT
Subjt:  GNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

TrEMBL top hitse value%identityAlignment
A0A1S3C2R6 AT-hook motif nuclear-localized protein7.6e-15888.07Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
        MEPNENQLSSYFHHHQHHHQ+P TTTSPTNGLLP THH+S+ AAS   +DAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS

Query:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAAL--GNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN
        SHSSS+KAKK+ A   SLNAV+A SSSFS PSKKSQLAAL  GNAGQGFAPHVINV AGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGN
Subjt:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAAL--GNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN

Query:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTID
        I YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE    KGD SAGKLPSP+GGTSMS+LRYGS ID
Subjt:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTID

Query:  SGGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        SGGNQ+RGNDEHQG+G+SHFLLQ RGVNLTSPRSTDWRTGLDATN AAYDL+
Subjt:  SGGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

A0A1S3C3W0 AT-hook motif nuclear-localized protein2.3e-15988.57Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
        MEPNENQLSSYFHHHQHHHQ+P TTTSPTNGLLP THH+S+ AAS   +DAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS

Query:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SHSSS+KAKK+ A   SLNAV+A SSSFS PSKKSQLAALGNAGQGFAPHVINV AGEDVGQKIM FMQQCKREICILSASGSISNASLRQPA SGGNI 
Subjt:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTIDSG
        YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE    KGD SAGKLPSP+GGTSMS+LRYGS IDSG
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSLRYGSTIDSG

Query:  GNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        GNQ+RGNDEHQG+G+SHFLLQ RGVNLTSPRSTDWRTGLDATN AAYDL+
Subjt:  GNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

A0A5A7VCX4 AT-hook motif nuclear-localized protein4.9e-15786.35Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
        MEPNENQLSSYFHHHQHHHQ+P TTTSPTNGLLP THH+S+ AAS   +DAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATAS

Query:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAAL---------GNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQ
        SHSSS+KAKK+ A   SLNAV+A SSSFS PSKKSQLAAL         GNAGQGFAPHVINV AGEDVGQKIM FMQQCKREICILSASGSISNASLRQ
Subjt:  SHSSSAKAKKDHA---SLNAVAASSSSFSGPSKKSQLAAL---------GNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQ

Query:  PATSGGNITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSL
        PA SGGNI YEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE    KGD SAGKLPSP+GGTSMS+L
Subjt:  PATSGGNITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE---AKGDASAGKLPSPVGGTSMSSL

Query:  RYGSTIDSGGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        RYGS IDSGGNQ+RGNDEHQG+G+SHFLLQ RGVNLTSPRSTDWRTGLDATN AAYDL+
Subjt:  RYGSTIDSGGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

A0A6J1CPC9 AT-hook motif nuclear-localized protein1.9e-16190.08Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        MEPNENQLSSYFHHHQHHHQSP TTTTSPTNGLLP THH+SS AA   AAD G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSAKAKKDH--ASLNAVAASSSSFSGPSKKSQLAAL--GNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN
        SSHSSS+KAK+DH  +SLNA++ASSSSFS PSKKSQLAAL  GNAGQ FAPHVINV AGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN
Subjt:  SSHSSSAKAKKDH--ASLNAVAASSSSFSGPSKKSQLAAL--GNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGN

Query:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE----AKGDASAGKLPSPVGGTSMSSLRYGSTI
        ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+    AKGDASAGKLPSPVGGTSMS LRYGSTI
Subjt:  ITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE----AKGDASAGKLPSPVGGTSMSSLRYGSTI

Query:  DSGGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        DSGGNQVRGNDEHQGIG+ HFLLQ RGVNLTS R TDWR GLDATNS AYDLT
Subjt:  DSGGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

A0A6J1CPX7 AT-hook motif nuclear-localized protein6.0e-16390.6Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
        MEPNENQLSSYFHHHQHHHQSP TTTTSPTNGLLP THH+SS AA   AAD G H VYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATA

Query:  SSHSSSAKAKKDH--ASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
        SSHSSS+KAK+DH  +SLNA++ASSSSFS PSKKSQLAALGNAGQ FAPHVINV AGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT
Subjt:  SSHSSSAKAKKDH--ASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNIT

Query:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE----AKGDASAGKLPSPVGGTSMSSLRYGSTIDS
        YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKK+    AKGDASAGKLPSPVGGTSMS LRYGSTIDS
Subjt:  YEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE----AKGDASAGKLPSPVGGTSMSSLRYGSTIDS

Query:  GGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT
        GGNQVRGNDEHQGIG+ HFLLQ RGVNLTS R TDWR GLDATNS AYDLT
Subjt:  GGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWRTGLDATNSAAYDLT

SwissProt top hitse value%identityAlignment
A1L4X7 AT-hook motif nuclear-localized protein 142.4e-7650.94Show/hide
Query:  SSYFHHH-QHHHQSPTT--TTSPT--------NGLL---PATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK
        S YFHH  QHHH  PTT  TT+ T        NGL    P   H  ++ +S+ A       VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAK
Subjt:  SSYFHHH-QHHHQSPTT--TTSPT--------NGLL---PATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK

Query:  KAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGG
        K A+++S SSSAK +++ A++     S++  SG SKKSQL ++G  GQ F PH++N+  GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGG
Subjt:  KAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGG

Query:  NITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEA-----KGDA--SAGKLPSPVGGTSMSSLRY
        N+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KK+A     KGDA  S  +L SPV    +  + +
Subjt:  NITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEA-----KGDA--SAGKLPSPVGGTSMSSLRY

Query:  GSTIDS-GGNQVRGNDE------HQ-GI-GDSHFLLQS-RGVNLTSPRSTDWR----TGLDATNSAAYDLT
           ++S G N +RGNDE      HQ G+ G  HF++Q+ +G+++T  R ++WR    +G D      YDL+
Subjt:  GSTIDS-GGNQVRGNDE------HQ-GI-GDSHFLLQS-RGVNLTSPRSTDWR----TGLDATNSAAYDLT

O22812 AT-hook motif nuclear-localized protein 101.9e-3644.26Show/hide
Query:  AGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYG--TPEEALAAKKAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFA
        AG + V   ++P   S  ++ +  EP +++RGRPRKYG  + E +L     A + + S  +               SS     SK+ +L ALG+ G GF 
Subjt:  AGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYG--TPEEALAAKKAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFA

Query:  PHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGP
        PHV+ V+AGEDV  KIM       R +C+LSA+G+ISN +LRQ ATSGG +TYEGRFEI+SL GS+   +  G   +TGGLSV LSS DG+++GG V G 
Subjt:  PHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGP

Query:  LKAAGPVQVIVGTFVIDPKKEAKGDASAGKLPSPV
        L AA PVQ++VG+F+ D +KE K       L SPV
Subjt:  LKAAGPVQVIVGTFVIDPKKEAKGDASAGKLPSPV

O49658 AT-hook motif nuclear-localized protein 21.2e-3039Show/hide
Query:  TTTSPTNGL------LPATHHM-----SSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALA------AKKAATASSHSSSA
        TTT    G+       P+  HM     +SN    + A   P        PSAA+      P +++RGRPRKYG    A+       +  A T S     +
Subjt:  TTTSPTNGL------LPATHHM-----SSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALA------AKKAATASSHSSSA

Query:  KAKKDHASLNAVAASSSSFSGPSKKSQLAALG-----NAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGR
           +    +     + SSF  P  K Q+  LG     +A   F PH+I V AGEDV ++I+ F QQ    IC+L A+G +S+ +LRQP +SGG +TYEGR
Subjt:  KAKKDHASLNAVAASSSSFSGPSKKSQLAALG-----NAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGR

Query:  FEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        FEI+SL G+++ +D  G   +TGG+SV L+S DG ++GGGV G L AA P+QV+VGTF+
Subjt:  FEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

O80834 AT-hook motif nuclear-localized protein 95.2e-3140.91Show/hide
Query:  HQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSAKAKK
        H  +  SP  + S   G  P+ H   S A +A  A A PH +    V   A    P E P +RKRGRPRKY   G+   AL++   +T + ++S+ + + 
Subjt:  HQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSAKAKK

Query:  DHASLNAVAASSSSFSGPSKKSQLAALG-----NAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIV
                        G  KK ++A++G     ++G  F PHVI V  GED+  K++ F QQ  R IC+LSASG++S A+L QP+ S G I YEGRFEI+
Subjt:  DHASLNAVAASSSSFSGPSKKSQLAALG-----NAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIV

Query:  SLCGSYVRTDIG---GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV-IDPKKEAK
        +L  SY+    G    +TG LSV L+S DG +IGG +GGPL AA PVQVIVG+F+   PK ++K
Subjt:  SLCGSYVRTDIG---GKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV-IDPKKEAK

Q9SB31 AT-hook motif nuclear-localized protein 31.5e-3039.93Show/hide
Query:  NENQLSSYFHHHQHHHQS---------PTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAA-
        N N  SS+    QH   +         P    +P   L+P T   +  AA+  AA    +   P S+     ++S  E  ++KRGRPRKY  P+  L   
Subjt:  NENQLSSYFHHHQHHHQS---------PTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAA-

Query:  ------KKAATASSHSSSAKAKKDHASLNAVAASSS--SFSGPSKKSQLAALGNA---GQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSIS
                +   +S     K  +     N     S    F      + LA +G A   G  F PHV+ V AGEDV  KIM F QQ  R ICILSA+G IS
Subjt:  ------KKAATASSHSSSAKAKKDHASLNAVAASSS--SFSGPSKKSQLAALGNA---GQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSIS

Query:  NASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV
        N +LRQ  TSGG +TYEGRFEI+SL GS+++ D GG   + GG+SVCL+  DG + GGG+ G   AAGPVQV+VGTF+
Subjt:  NASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFV

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein1.3e-3744.26Show/hide
Query:  AGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYG--TPEEALAAKKAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFA
        AG + V   ++P   S  ++ +  EP +++RGRPRKYG  + E +L     A + + S  +               SS     SK+ +L ALG+ G GF 
Subjt:  AGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYG--TPEEALAAKKAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFA

Query:  PHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGP
        PHV+ V+AGEDV  KIM       R +C+LSA+G+ISN +LRQ ATSGG +TYEGRFEI+SL GS+   +  G   +TGGLSV LSS DG+++GG V G 
Subjt:  PHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGP

Query:  LKAAGPVQVIVGTFVIDPKKEAKGDASAGKLPSPV
        L AA PVQ++VG+F+ D +KE K       L SPV
Subjt:  LKAAGPVQVIVGTFVIDPKKEAKGDASAGKLPSPV

AT2G33620.2 AT hook motif DNA-binding family protein1.3e-3744.26Show/hide
Query:  AGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYG--TPEEALAAKKAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFA
        AG + V   ++P   S  ++ +  EP +++RGRPRKYG  + E +L     A + + S  +               SS     SK+ +L ALG+ G GF 
Subjt:  AGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYG--TPEEALAAKKAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFA

Query:  PHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGP
        PHV+ V+AGEDV  KIM       R +C+LSA+G+ISN +LRQ ATSGG +TYEGRFEI+SL GS+   +  G   +TGGLSV LSS DG+++GG V G 
Subjt:  PHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGP

Query:  LKAAGPVQVIVGTFVIDPKKEAKGDASAGKLPSPV
        L AA PVQ++VG+F+ D +KE K       L SPV
Subjt:  LKAAGPVQVIVGTFVIDPKKEAKGDASAGKLPSPV

AT2G33620.3 AT hook motif DNA-binding family protein1.3e-3744.26Show/hide
Query:  AGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYG--TPEEALAAKKAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFA
        AG + V   ++P   S  ++ +  EP +++RGRPRKYG  + E +L     A + + S  +               SS     SK+ +L ALG+ G GF 
Subjt:  AGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYG--TPEEALAAKKAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFA

Query:  PHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGP
        PHV+ V+AGEDV  KIM       R +C+LSA+G+ISN +LRQ ATSGG +TYEGRFEI+SL GS+   +  G   +TGGLSV LSS DG+++GG V G 
Subjt:  PHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSDGHIIGGGVGGP

Query:  LKAAGPVQVIVGTFVIDPKKEAKGDASAGKLPSPV
        L AA PVQ++VG+F+ D +KE K       L SPV
Subjt:  LKAAGPVQVIVGTFVIDPKKEAKGDASAGKLPSPV

AT3G04590.1 AT hook motif DNA-binding family protein1.1e-6856.68Show/hide
Query:  SSYFHHH-QHHHQSPTT--TTSPT--------NGLL---PATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK
        S YFHH  QHHH  PTT  TT+ T        NGL    P   H  ++ +S+ A       VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAK
Subjt:  SSYFHHH-QHHHQSPTT--TTSPT--------NGLL---PATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK

Query:  KAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGG
        K A+++S SSSAK +++ A++     S++  SG SKKSQL ++G  GQ F PH++N+  GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGG
Subjt:  KAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGG

Query:  NITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEAKGDASAG
        N+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KK+A G    G
Subjt:  NITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEAKGDASAG

AT3G04590.2 AT hook motif DNA-binding family protein1.7e-7750.94Show/hide
Query:  SSYFHHH-QHHHQSPTT--TTSPT--------NGLL---PATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK
        S YFHH  QHHH  PTT  TT+ T        NGL    P   H  ++ +S+ A       VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAK
Subjt:  SSYFHHH-QHHHQSPTT--TTSPT--------NGLL---PATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK

Query:  KAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGG
        K A+++S SSSAK +++ A++     S++  SG SKKSQL ++G  GQ F PH++N+  GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA SGG
Subjt:  KAATASSHSSSAKAKKDHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGG

Query:  NITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEA-----KGDA--SAGKLPSPVGGTSMSSLRY
        N+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS+SDG IIGG +G  L AAGPVQVI+GTF +D KK+A     KGDA  S  +L SPV    +  + +
Subjt:  NITYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEA-----KGDA--SAGKLPSPVGGTSMSSLRY

Query:  GSTIDS-GGNQVRGNDE------HQ-GI-GDSHFLLQS-RGVNLTSPRSTDWR----TGLDATNSAAYDLT
           ++S G N +RGNDE      HQ G+ G  HF++Q+ +G+++T  R ++WR    +G D      YDL+
Subjt:  GSTIDS-GGNQVRGNDE------HQ-GI-GDSHFLLQS-RGVNLTSPRSTDWR----TGLDATNSAAYDLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCCAATGAAAACCAGCTCAGCTCCTACTTCCACCACCATCAGCACCACCATCAAAGTCCCACCACCACCACATCGCCGACCAATGGACTTTTACCGGCCACCCA
CCACATGTCTTCCAACGCCGCCTCCGCCACCGCCGCAGACGCTGGCCCCCATGTCGTCTATCCTCACTCCGTGCCCTCTGCGGCGGTGTCGTCCTCGCCGCTCGAGCCTG
CTCGGCGGAAGAGGGGACGGCCCAGGAAGTATGGTACTCCGGAAGAAGCTTTAGCTGCCAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCGTCCGCCAAGGCTAAGAAG
GACCACGCTTCCCTTAATGCTGTCGCCGCTTCTTCTTCTTCCTTCTCCGGGCCTTCCAAGAAATCCCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACA
TGTTATTAATGTGGTAGCTGGTGAGGACGTTGGCCAGAAAATTATGCTGTTTATGCAACAATGTAAGCGGGAAATCTGTATCCTTTCTGCGTCTGGTTCAATCTCCAATG
CATCTCTCCGTCAGCCAGCCACATCTGGAGGCAATATTACGTATGAGGGTCGTTTTGAGATTGTTTCATTATGCGGATCTTATGTACGTACTGACATTGGCGGAAAGACT
GGTGGTCTTAGTGTATGTTTGTCGAGTTCTGATGGCCATATCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCTGCGGGACCCGTACAGGTTATTGTTGGTACCTTTGT
AATTGACCCAAAGAAGGAAGCTAAAGGCGATGCATCTGCTGGCAAGTTGCCCTCACCTGTTGGTGGGACGTCGATGTCAAGTCTACGCTATGGCTCGACCATTGACTCGG
GAGGTAATCAAGTCAGGGGCAATGATGAGCACCAAGGTATTGGGGACAGTCATTTCTTGCTTCAGTCCCGGGGAGTGAATCTGACGTCTCCACGGTCCACTGACTGGAGG
ACAGGTCTGGATGCCACAAACAGTGCTGCTTATGATTTGACAGATGATGAGCTTTTTTGGCAGGAAGAACAGGCCATCAATCTCCTGAAAACGGAGATTACGATCAGATT
CCCGATTAAGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCCAATGAAAACCAGCTCAGCTCCTACTTCCACCACCATCAGCACCACCATCAAAGTCCCACCACCACCACATCGCCGACCAATGGACTTTTACCGGCCACCCA
CCACATGTCTTCCAACGCCGCCTCCGCCACCGCCGCAGACGCTGGCCCCCATGTCGTCTATCCTCACTCCGTGCCCTCTGCGGCGGTGTCGTCCTCGCCGCTCGAGCCTG
CTCGGCGGAAGAGGGGACGGCCCAGGAAGTATGGTACTCCGGAAGAAGCTTTAGCTGCCAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCGTCCGCCAAGGCTAAGAAG
GACCACGCTTCCCTTAATGCTGTCGCCGCTTCTTCTTCTTCCTTCTCCGGGCCTTCCAAGAAATCCCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACA
TGTTATTAATGTGGTAGCTGGTGAGGACGTTGGCCAGAAAATTATGCTGTTTATGCAACAATGTAAGCGGGAAATCTGTATCCTTTCTGCGTCTGGTTCAATCTCCAATG
CATCTCTCCGTCAGCCAGCCACATCTGGAGGCAATATTACGTATGAGGGTCGTTTTGAGATTGTTTCATTATGCGGATCTTATGTACGTACTGACATTGGCGGAAAGACT
GGTGGTCTTAGTGTATGTTTGTCGAGTTCTGATGGCCATATCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCTGCGGGACCCGTACAGGTTATTGTTGGTACCTTTGT
AATTGACCCAAAGAAGGAAGCTAAAGGCGATGCATCTGCTGGCAAGTTGCCCTCACCTGTTGGTGGGACGTCGATGTCAAGTCTACGCTATGGCTCGACCATTGACTCGG
GAGGTAATCAAGTCAGGGGCAATGATGAGCACCAAGGTATTGGGGACAGTCATTTCTTGCTTCAGTCCCGGGGAGTGAATCTGACGTCTCCACGGTCCACTGACTGGAGG
ACAGGTCTGGATGCCACAAACAGTGCTGCTTATGATTTGACAGATGATGAGCTTTTTTGGCAGGAAGAACAGGCCATCAATCTCCTGAAAACGGAGATTACGATCAGATT
CCCGATTAAGAGCTAA
Protein sequenceShow/hide protein sequence
MEPNENQLSSYFHHHQHHHQSPTTTTSPTNGLLPATHHMSSNAASATAADAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSAKAKK
DHASLNAVAASSSSFSGPSKKSQLAALGNAGQGFAPHVINVVAGEDVGQKIMLFMQQCKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLCGSYVRTDIGGKT
GGLSVCLSSSDGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEAKGDASAGKLPSPVGGTSMSSLRYGSTIDSGGNQVRGNDEHQGIGDSHFLLQSRGVNLTSPRSTDWR
TGLDATNSAAYDLTDDELFWQEEQAINLLKTEITIRFPIKS