; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020420 (gene) of Chayote v1 genome

Gene IDSed0020420
OrganismSechium edule (Chayote v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationLG08:30909944..30919910
RNA-Seq ExpressionSed0020420
SyntenySed0020420
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599338.1 AT-hook motif nuclear-localized protein 14, partial [Cucurbita argyrosperma subsp. sororia]4.2e-15283.93Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK
        MEP+ENQL+SYF   HHQHHHQSPT SPTNGLLP THHL SS A   VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSHSSS+K
Subjt:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK

Query:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV
        AKKDLAS SSLNAVSASSSFSA SKKS L A GNAGQGF+PHVINVAAGEDVGQKIMLFMQQCK+EICILSASGSISNASLRQPA SGGNI YEGRFEIV
Subjt:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV

Query:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI
        SLCGSY+RTD GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTF+I  KKEV GGV GDASA KLPSP GGT MSNLRYG N+D+GGNQ+RG 
Subjt:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI

Query:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        DEHQGIGESHFLLQPRG         DWRM LD TN AYDLTGRT  HHSPENGDYDQI D
Subjt:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

XP_004139392.1 AT-hook motif nuclear-localized protein 14 [Cucumis sativus]6.4e-15383.88Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSADA-----VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSH
        MEP+ENQL+SYF  HHHQHHHQ+P T SPTNGLLP THHL +++A +     VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSH
Subjt:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSADA-----VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSH

Query:  SSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEG
        SSS+KAKK+LAS SSLNAVSASSSFS PSKKS L A GNAGQGFAPHVINVAAGEDVGQKIM FMQQCK+EICILSASGSISNASLRQPAASGGNIAYEG
Subjt:  SSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEG

Query:  RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGN
        RFEIVSLCGSYVRTD+GGKTGGLSVCLSS+ GHIIGGGVGGPLKAAGPVQVIVGTFVI  KKE GGG  GD SA KLPSPIGGTSMSNLRYG NIDSGGN
Subjt:  RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGN

Query:  QIRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        QIRG DEHQG+GESHFLLQPRG         DWR GLD TNTAYDL+GRTG HHSPENGDYDQI D
Subjt:  QIRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

XP_022946672.1 AT-hook motif nuclear-localized protein 14-like [Cucurbita moschata]3.2e-15284.21Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK
        MEP+ENQL+SYF   HHQHHHQSPT SPTNGLLP THHL SS A   VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSHSSS+K
Subjt:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK

Query:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV
        AKKDLAS SSLNAVSASSSFSA SKKS L A GNAGQGF+PHVINVAAGEDVGQKIMLFMQQCK+EICILSASGSISNASLRQPA SGGNI YEGRFEIV
Subjt:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV

Query:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI
        SLCGSY+RTD GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVI  KKEV GGV GDASA KLPSP GGT MSNLRYG N+D+GGNQ+RG 
Subjt:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI

Query:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        DEHQGIGESHFLLQPRG         DWRM LD TN AYDLTGRT  HHSPENGDYDQI D
Subjt:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

XP_023546506.1 AT-hook motif nuclear-localized protein 14-like [Cucurbita pepo subsp. pepo]1.4e-15284.49Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK
        MEP+ENQL+SYF   HHQHHHQSPT SPTNGLLP THHL SS A   VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSHSSS+K
Subjt:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK

Query:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV
        AKKDLAS SSLNAVSASSSFSA SKKS L A GNAGQGF+PHVINVAAGEDVGQKIMLFMQQCK+EICILSASGSISNASLRQPA SGGNI YEGRFEIV
Subjt:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV

Query:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI
        SLCGSY+RTD GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVI  KKEV GGV GDASA KLPSP GGT MSNLRYG N+DSGGNQ+RG 
Subjt:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI

Query:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        DEHQGIGESHFLLQPRG         DWRM LD TN AYDLTGRT  HHSPENGDYDQI D
Subjt:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

XP_038890429.1 AT-hook motif nuclear-localized protein 14 [Benincasa hispida]1.1e-15586.03Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSA-DA---VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHS
        MEP+ENQL+SYF  HHHQHHHQSP T SPTNGLLP THHL S++A DA   VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSHS
Subjt:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSA-DA---VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHS

Query:  SSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGR
        SS+KAKK+LAS SSLNAVSASSSFSAPSKKS L   GNAGQGFAPHVINVAAGEDVGQKIM+FMQQCK+EICILSASGSISNASLRQPAASGGNIAYEGR
Subjt:  SSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGR

Query:  FEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQ
        FEIVSLCGSYVRTD+GGKTGGLSVCLSS+ GHIIGGGVGGPLKAAGPVQVIVGTFVI  KKEVGGG  GDASA KLPSPIGGTSMSNLRYG NIDSGGNQ
Subjt:  FEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQ

Query:  IRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        IRG DEHQG+GESHFLLQPRG         DWR GLD TNTAYDLTGRTG HHSPENGDYDQI D
Subjt:  IRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

TrEMBL top hitse value%identityAlignment
A0A1S3C2R6 AT-hook motif nuclear-localized protein8.5e-15182.88Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSADA-----VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSH
        MEP+ENQL+SYF  HHHQHHHQ+P T SPTNGLLP THHL +++A +     VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSH
Subjt:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSADA-----VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSH

Query:  SSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPA--PGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAY
        SSS+KAKK+LAS SSLNAVSASSSFS PSKKS L A   GNAGQGFAPHVINVAAGEDVGQKIM FMQQCK+EICILSASGSISNASLRQPAASGGNIAY
Subjt:  SSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPA--PGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAY

Query:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSG
        EGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS+ GHIIGGGVGGPLKAAGPVQVIVGTFVI  KKE GGG  GD SA KLPSPIGGTSMSNLRYG NIDSG
Subjt:  EGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSG

Query:  GNQIRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        GNQIRG DEHQG+GESHFLLQPRG         DWR GLD TN AYDL+GRT  HHSPENGDYDQI D
Subjt:  GNQIRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

A0A1S3C3W0 AT-hook motif nuclear-localized protein3.4e-15283.33Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSADA-----VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSH
        MEP+ENQL+SYF  HHHQHHHQ+P T SPTNGLLP THHL +++A +     VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSH
Subjt:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSADA-----VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSH

Query:  SSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEG
        SSS+KAKK+LAS SSLNAVSASSSFS PSKKS L A GNAGQGFAPHVINVAAGEDVGQKIM FMQQCK+EICILSASGSISNASLRQPAASGGNIAYEG
Subjt:  SSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEG

Query:  RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGN
        RFEIVSLCGSYVRTD+GGKTGGLSVCLSS+ GHIIGGGVGGPLKAAGPVQVIVGTFVI  KKE GGG  GD SA KLPSPIGGTSMSNLRYG NIDSGGN
Subjt:  RFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGN

Query:  QIRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        QIRG DEHQG+GESHFLLQPRG         DWR GLD TN AYDL+GRT  HHSPENGDYDQI D
Subjt:  QIRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

A0A5A7VCX4 AT-hook motif nuclear-localized protein7.2e-15081.33Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSADA-----VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSH
        MEP+ENQL+SYF  HHHQHHHQ+P T SPTNGLLP THHL +++A +     VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSH
Subjt:  MEPHENQLNSYFHHHHHQHHHQSP-TASPTNGLLPSTHHLPSSSADA-----VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSH

Query:  SSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAP---------GNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAA
        SSS+KAKK+LAS SSLNAVSASSSFS PSKKS L A          GNAGQGFAPHVINVAAGEDVGQKIM FMQQCK+EICILSASGSISNASLRQPAA
Subjt:  SSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAP---------GNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAA

Query:  SGGNIAYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRY
        SGGNIAYEGRFEIVSLCGSYVRTD+GGKTGGLSVCLSS+ GHIIGGGVGGPLKAAGPVQVIVGTFVI  KKE GGG  GD SA KLPSPIGGTSMSNLRY
Subjt:  SGGNIAYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRY

Query:  GLNIDSGGNQIRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        G NIDSGGNQIRG DEHQG+GESHFLLQPRG         DWR GLD TN AYDL+GRT  HHSPENGDYDQI D
Subjt:  GLNIDSGGNQIRGIDEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

A0A6J1G4C2 AT-hook motif nuclear-localized protein1.5e-15284.21Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK
        MEP+ENQL+SYF   HHQHHHQSPT SPTNGLLP THHL SS A   VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSHSSS+K
Subjt:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK

Query:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV
        AKKDLAS SSLNAVSASSSFSA SKKS L A GNAGQGF+PHVINVAAGEDVGQKIMLFMQQCK+EICILSASGSISNASLRQPA SGGNI YEGRFEIV
Subjt:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV

Query:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI
        SLCGSY+RTD GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVI  KKEV GGV GDASA KLPSP GGT MSNLRYG N+D+GGNQ+RG 
Subjt:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI

Query:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        DEHQGIGESHFLLQPRG         DWRM LD TN AYDLTGRT  HHSPENGDYDQI D
Subjt:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

A0A6J1K9V5 AT-hook motif nuclear-localized protein2.2e-15183.66Show/hide
Query:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK
        MEP+ENQL+SYF   HHQHHHQSPT SPTNGLLP THHL SS A   VVYPHSVPSAAVS   LEPARRKRGRPRKYGTPEEALAAKKA+TASSHSSS+K
Subjt:  MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADA-VVYPHSVPSAAVS---LEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAK

Query:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV
        AKKDLAS SSLNAVSASSSFSA SKKS L A GNAGQGF+PHVINVAAGEDVGQKIMLFMQQCK+EICILSASGSISNASLRQPA SGGNI YEGRFEIV
Subjt:  AKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIV

Query:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI
        SLCGSY+RTD GGKTGGLSVCLSS++GHIIGGGVGGPLKAAGPVQVIVGTFVI  KKEV GGV GDAS  KLPSP GGT MSNLRYG  +D+GGNQ+RG 
Subjt:  SLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGI

Query:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD
        DEHQGIGESHFLLQPRG         DWRM LD TN AYDLTGRT  HHSPENGDYDQI D
Subjt:  DEHQGIGESHFLLQPRG------PQQDWRMGLDGTNTAYDLTGRTGHHHSPENGDYDQIAD

SwissProt top hitse value%identityAlignment
A1L4X7 AT-hook motif nuclear-localized protein 149.9e-7249.1Show/hide
Query:  EPHENQLNSYFHHHHHQHHHQSPTA------------SPTNGLL---PSTHHLPS-SSADAVVYPHSVPSAAVS--LEPARRKRGRPRKYGTPEEALAAK
        +  + +L S + HH  QHHH  PT             S  NGL    P   H P+  S+   VYPHSVPS+AV+  +EP +RKRGRPRKY TPE+ALAAK
Subjt:  EPHENQLNSYFHHHHHQHHHQSPTA------------SPTNGLL---PSTHHLPS-SSADAVVYPHSVPSAAVS--LEPARRKRGRPRKYGTPEEALAAK

Query:  KASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAAS
        K ++++S SSSAK +++LA+ +        S+ S  SKKS L + G  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA S
Subjt:  KASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAAS

Query:  GGNIAYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE-VGGGVIGDA--SASKLPSPIGGTSMSNL
        GGN+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS+S+G IIGG +G  L AAGPVQVI+GTF +  KK+  G G  GDA  S S+L SP+    +  +
Subjt:  GGNIAYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE-VGGGVIGDA--SASKLPSPIGGTSMSNL

Query:  RYGLNIDS-GGNQIRGIDE------HQ-GI-GESHFLLQ-PRG------PQQDWR----MGLDGT-NTAYDLTGRTGHHHSPENGDYDQ
         +   ++S G N +RG DE      HQ G+ G  HF++Q P+G         +WR     G DG     YDL+GR G H S ENGDY+Q
Subjt:  RYGLNIDS-GGNQIRGIDE------HQ-GI-GESHFLLQ-PRG------PQQDWR----MGLDGT-NTAYDLTGRTGHHHSPENGDYDQ

O22812 AT-hook motif nuclear-localized protein 106.5e-3139.41Show/hide
Query:  PTNGLLPSTHHLPSSSADAVVYPHSVPS------AAVSLEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSK
        P   + P   + P+S+ +  V   ++P            EP +++RGRPRKYG     ++      A S + S  +               SS     SK
Subjt:  PTNGLLPSTHHLPSSSADAVVYPHSVPS------AAVSLEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSK

Query:  KSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCL
        +  L A G+ G GF PHV+ V AGEDV  KIM       + +C+LSA+G+ISN +LRQ A SGG + YEGRFEI+SL GS+   +  G   +TGGLSV L
Subjt:  KSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCL

Query:  SSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE
        SS +G+++GG V G L AA PVQ++VG+F+   +KE
Subjt:  SSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE

O80834 AT-hook motif nuclear-localized protein 92.5e-3040Show/hide
Query:  HQHHHQSPTASPTNGL-LPSTHHLPS---SSADAVVYPHSV------PSAAVSLEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAKAKKDLASPSS
        H  +  SP  S + G   PS H  PS   ++  A   PH +      P    S  P +RKRGRPRKYG       A  +S+ S+ + +   K+    P  
Subjt:  HQHHHQSPTASPTNGL-LPSTHHLPS---SSADAVVYPHSV------PSAAVSLEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAKAKKDLASPSS

Query:  LNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTD
               +S            P ++G  F PHVI V+ GED+  K++ F QQ  + IC+LSASG++S A+L QP+AS G I YEGRFEI++L  SY+   
Subjt:  LNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTD

Query:  IG---GKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKK
         G    +TG LSV L+S +G +IGG +GGPL AA PVQVIVG+F+  + K
Subjt:  IG---GKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKK

Q8VYJ2 AT-hook motif nuclear-localized protein 14.9e-3140.08Show/hide
Query:  HQSPTASPTNGLLPSTHHLPSSSADAVVYPHSVPSAA---VSLEPARRKRGRPRKYGTPEE--ALAAKKASTASSHSSSAKAKK---DLASPSSLNAVSA
        +QSPT+       PS+HH             +  +AA   +S    ++KRGRPRKYG      AL+ K  S+A + S          D ++    + V  
Subjt:  HQSPTASPTNGLLPSTHHLPSSSADAVVYPHSVPSAA---VSLEPARRKRGRPRKYGTPEE--ALAAKKASTASSHSSSAKAKK---DLASPSSLNAVSA

Query:  SSSFSAPSKKSHLP-----APGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDI
        ++SF+       +      AP + G  F PH+I V  GEDV  KI+ F QQ  + IC+LSA+G IS+ +LRQP +SGG + YEGRFEI+SL GS++  D 
Subjt:  SSSFSAPSKKSHLP-----APGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDI

Query:  GG---KTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGS
        GG   +TGG+SV L+S +G ++GGG+ G L AA PVQV+VG+F+ G+
Subjt:  GG---KTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGS

Q9SB31 AT-hook motif nuclear-localized protein 31.4e-3042.73Show/hide
Query:  AVVYPHSVPSAAVSLEPARRKRGRPRKY---GTPEEALAAKKASTASSHSSSAKAKKDLASPSSLN---AVSASSSFSAPSKKSHLPAPGNA---GQGFA
        A  +  ++P+   S E  ++KRGRPRKY   GT    L+    S++   +S    +K        N     S    F      ++L   G A   G  F 
Subjt:  AVVYPHSVPSAAVSLEPARRKRGRPRKY---GTPEEALAAKKASTASSHSSSAKAKKDLASPSSLN---AVSASSSFSAPSKKSHLPAPGNA---GQGFA

Query:  PHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSNGHIIGGGVGGP
        PHV+ V AGEDV  KIM F QQ  + ICILSA+G ISN +LRQ   SGG + YEGRFEI+SL GS+++ D GG   + GG+SVCL+  +G + GGG+ G 
Subjt:  PHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCLSSSNGHIIGGGVGGP

Query:  LKAAGPVQVIVGTFVIGSKK
          AAGPVQV+VGTF+ G ++
Subjt:  LKAAGPVQVIVGTFVIGSKK

Arabidopsis top hitse value%identityAlignment
AT2G33620.1 AT hook motif DNA-binding family protein4.6e-3239.41Show/hide
Query:  PTNGLLPSTHHLPSSSADAVVYPHSVPS------AAVSLEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSK
        P   + P   + P+S+ +  V   ++P            EP +++RGRPRKYG     ++      A S + S  +               SS     SK
Subjt:  PTNGLLPSTHHLPSSSADAVVYPHSVPS------AAVSLEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSK

Query:  KSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCL
        +  L A G+ G GF PHV+ V AGEDV  KIM       + +C+LSA+G+ISN +LRQ A SGG + YEGRFEI+SL GS+   +  G   +TGGLSV L
Subjt:  KSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCL

Query:  SSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE
        SS +G+++GG V G L AA PVQ++VG+F+   +KE
Subjt:  SSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE

AT2G33620.2 AT hook motif DNA-binding family protein4.6e-3239.41Show/hide
Query:  PTNGLLPSTHHLPSSSADAVVYPHSVPS------AAVSLEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSK
        P   + P   + P+S+ +  V   ++P            EP +++RGRPRKYG     ++      A S + S  +               SS     SK
Subjt:  PTNGLLPSTHHLPSSSADAVVYPHSVPS------AAVSLEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSK

Query:  KSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCL
        +  L A G+ G GF PHV+ V AGEDV  KIM       + +C+LSA+G+ISN +LRQ A SGG + YEGRFEI+SL GS+   +  G   +TGGLSV L
Subjt:  KSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDIGG---KTGGLSVCL

Query:  SSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE
        SS +G+++GG V G L AA PVQ++VG+F+   +KE
Subjt:  SSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE

AT3G04590.1 AT hook motif DNA-binding family protein3.8e-6353.87Show/hide
Query:  EPHENQLNSYFHHHHHQHHHQSPTA------------SPTNGLL---PSTHHLPS-SSADAVVYPHSVPSAAVS--LEPARRKRGRPRKYGTPEEALAAK
        +  + +L S + HH  QHHH  PT             S  NGL    P   H P+  S+   VYPHSVPS+AV+  +EP +RKRGRPRKY TPE+ALAAK
Subjt:  EPHENQLNSYFHHHHHQHHHQSPTA------------SPTNGLL---PSTHHLPS-SSADAVVYPHSVPSAAVS--LEPARRKRGRPRKYGTPEEALAAK

Query:  KASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAAS
        K ++++S SSSAK +++LA+ +        S+ S  SKKS L + G  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA S
Subjt:  KASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAAS

Query:  GGNIAYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE-VGGGVIGDASAS
        GGN+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS+S+G IIGG +G  L AAGPVQVI+GTF +  KK+  G G  GDAS S
Subjt:  GGNIAYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE-VGGGVIGDASAS

AT3G04590.2 AT hook motif DNA-binding family protein7.0e-7349.1Show/hide
Query:  EPHENQLNSYFHHHHHQHHHQSPTA------------SPTNGLL---PSTHHLPS-SSADAVVYPHSVPSAAVS--LEPARRKRGRPRKYGTPEEALAAK
        +  + +L S + HH  QHHH  PT             S  NGL    P   H P+  S+   VYPHSVPS+AV+  +EP +RKRGRPRKY TPE+ALAAK
Subjt:  EPHENQLNSYFHHHHHQHHHQSPTA------------SPTNGLL---PSTHHLPS-SSADAVVYPHSVPSAAVS--LEPARRKRGRPRKYGTPEEALAAK

Query:  KASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAAS
        K ++++S SSSAK +++LA+ +        S+ S  SKKS L + G  GQ F PH++N+A GEDV QKIM+F  Q K E+C+LSASG+ISNASLRQPA S
Subjt:  KASTASSHSSSAKAKKDLASPSSLNAVSASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAAS

Query:  GGNIAYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE-VGGGVIGDA--SASKLPSPIGGTSMSNL
        GGN+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS+S+G IIGG +G  L AAGPVQVI+GTF +  KK+  G G  GDA  S S+L SP+    +  +
Subjt:  GGNIAYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKE-VGGGVIGDA--SASKLPSPIGGTSMSNL

Query:  RYGLNIDS-GGNQIRGIDE------HQ-GI-GESHFLLQ-PRG------PQQDWR----MGLDGT-NTAYDLTGRTGHHHSPENGDYDQ
         +   ++S G N +RG DE      HQ G+ G  HF++Q P+G         +WR     G DG     YDL+GR G H S ENGDY+Q
Subjt:  RYGLNIDS-GGNQIRGIDE------HQ-GI-GESHFLLQ-PRG------PQQDWR----MGLDGT-NTAYDLTGRTGHHHSPENGDYDQ

AT4G12080.1 AT-hook motif nuclear-localized protein 13.5e-3240.08Show/hide
Query:  HQSPTASPTNGLLPSTHHLPSSSADAVVYPHSVPSAA---VSLEPARRKRGRPRKYGTPEE--ALAAKKASTASSHSSSAKAKK---DLASPSSLNAVSA
        +QSPT+       PS+HH             +  +AA   +S    ++KRGRPRKYG      AL+ K  S+A + S          D ++    + V  
Subjt:  HQSPTASPTNGLLPSTHHLPSSSADAVVYPHSVPSAA---VSLEPARRKRGRPRKYGTPEE--ALAAKKASTASSHSSSAKAKK---DLASPSSLNAVSA

Query:  SSSFSAPSKKSHLP-----APGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDI
        ++SF+       +      AP + G  F PH+I V  GEDV  KI+ F QQ  + IC+LSA+G IS+ +LRQP +SGG + YEGRFEI+SL GS++  D 
Subjt:  SSSFSAPSKKSHLP-----APGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDI

Query:  GG---KTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGS
        GG   +TGG+SV L+S +G ++GGG+ G L AA PVQV+VG+F+ G+
Subjt:  GG---KTGGLSVCLSSSNGHIIGGGVGGPLKAAGPVQVIVGTFVIGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCCCATGAAAACCAGCTCAATTCCTACTTCCACCACCACCATCACCAACACCACCATCAAAGTCCCACCGCCTCCCCCACCAATGGCCTTTTACCCTCCACCCA
CCACCTCCCCTCCTCCTCCGCCGACGCCGTCGTTTACCCTCACTCCGTCCCCTCCGCCGCCGTCTCCCTCGAGCCCGCCCGCCGGAAGAGAGGCCGGCCTAGAAAGTACG
GCACGCCGGAGGAGGCGTTAGCGGCGAAGAAAGCTTCCACGGCTTCCTCTCACTCCTCCTCCGCCAAGGCCAAGAAGGACCTCGCCTCCCCCTCTTCCCTTAACGCCGTT
TCCGCTTCTTCTTCCTTCTCTGCCCCTTCCAAGAAATCTCACTTACCTGCACCTGGTAATGCAGGCCAAGGTTTTGCACCACATGTTATTAATGTGGCAGCTGGTGAGGA
TGTGGGCCAGAAGATTATGCTGTTTATGCAACAATGTAAGCAGGAAATTTGCATCCTTTCTGCATCTGGTTCGATCTCCAATGCATCTCTCCGTCAGCCAGCCGCATCCG
GTGGCAACATTGCCTATGAGGGTCGTTTTGAGATTGTTTCGTTATGTGGATCTTATGTACGAACTGACATTGGAGGAAAGACTGGTGGTCTTAGTGTGTGTTTGTCGAGC
TCTAATGGCCATATCATAGGAGGGGGAGTTGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACCTTTGTAATCGGCTCAAAGAAGGAAGTTGGCGG
AGGTGTTATTGGCGATGCATCTGCTAGCAAGTTGCCCTCACCTATTGGTGGGACATCGATGTCAAATCTACGCTATGGCTTGAACATCGACTCGGGAGGTAATCAAATAA
GGGGAATTGATGAGCACCAAGGTATCGGGGAGAGTCATTTCTTGCTTCAGCCCCGAGGTCCACAACAGGACTGGAGGATGGGTCTGGATGGCACAAACACTGCATATGAT
TTGACAGGAAGAACAGGCCACCATCATTCTCCCGAAAATGGAGATTACGATCAGATTGCTGATTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAGGAGAGGCAAATCCATCTCAGAAAAAAAAAAAAGATTTTTATTGATTTAATAATTTTATTTTCTCAGTTTTTCTCTCTCTCATCTAAAAAATGGAACCCCATGA
AAACCAGCTCAATTCCTACTTCCACCACCACCATCACCAACACCACCATCAAAGTCCCACCGCCTCCCCCACCAATGGCCTTTTACCCTCCACCCACCACCTCCCCTCCT
CCTCCGCCGACGCCGTCGTTTACCCTCACTCCGTCCCCTCCGCCGCCGTCTCCCTCGAGCCCGCCCGCCGGAAGAGAGGCCGGCCTAGAAAGTACGGCACGCCGGAGGAG
GCGTTAGCGGCGAAGAAAGCTTCCACGGCTTCCTCTCACTCCTCCTCCGCCAAGGCCAAGAAGGACCTCGCCTCCCCCTCTTCCCTTAACGCCGTTTCCGCTTCTTCTTC
CTTCTCTGCCCCTTCCAAGAAATCTCACTTACCTGCACCTGGTAATGCAGGCCAAGGTTTTGCACCACATGTTATTAATGTGGCAGCTGGTGAGGATGTGGGCCAGAAGA
TTATGCTGTTTATGCAACAATGTAAGCAGGAAATTTGCATCCTTTCTGCATCTGGTTCGATCTCCAATGCATCTCTCCGTCAGCCAGCCGCATCCGGTGGCAACATTGCC
TATGAGGGTCGTTTTGAGATTGTTTCGTTATGTGGATCTTATGTACGAACTGACATTGGAGGAAAGACTGGTGGTCTTAGTGTGTGTTTGTCGAGCTCTAATGGCCATAT
CATAGGAGGGGGAGTTGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACCTTTGTAATCGGCTCAAAGAAGGAAGTTGGCGGAGGTGTTATTGGCG
ATGCATCTGCTAGCAAGTTGCCCTCACCTATTGGTGGGACATCGATGTCAAATCTACGCTATGGCTTGAACATCGACTCGGGAGGTAATCAAATAAGGGGAATTGATGAG
CACCAAGGTATCGGGGAGAGTCATTTCTTGCTTCAGCCCCGAGGTCCACAACAGGACTGGAGGATGGGTCTGGATGGCACAAACACTGCATATGATTTGACAGGAAGAAC
AGGCCACCATCATTCTCCCGAAAATGGAGATTACGATCAGATTGCTGATTGAGAGCTAACATATGGGATGGACAGGACGAGACTGCAAAGTTTGTCGTAGATAAATGTAC
AATATCAAGAGTTGCATTGCCAGCAATCTTCGCTTCACTCTCTGTTAGCTAATTCTCTGGCTGTAGTATGTGCACCATACTGTAAAGGTGGTAACAAGCTCTTTAGAAGT
TTTTATGTTCTTGCATTTTATTTTCCCATTTTCTAAGTTCTTATGTTCTTGTATCCATCCATCCACCTTTCATGGTTTCATGGTGTATTTGCATGAACTAATTCTAATTC
TCCTGTTTTTTTTTTCCCTCTTAGATGAACAAGTTTGCTGTAATTTTTACAGTCTGCAATTTGATCGTATAAACAAACACCTCTCTGGTGATTTTGAATTCCTTAATTAA
TCTTCAGGTGATGAACCATGTCATTTGATTCATTCATAAAATTTATTGTTAAGAGAGGTAAAAGATCAACACTTTCCCTTCATAACTTTTATTTACTCTTTGACGCTGTT
TATTTATATATATTGTTCTATATAGTTTTTGTTTCAACCATATTTAGAGTTGGATGATTTGGATGCTAGTTTAGTTTAATTTTTATTGGCCTTTTGTTCTATAGTTCTC
Protein sequenceShow/hide protein sequence
MEPHENQLNSYFHHHHHQHHHQSPTASPTNGLLPSTHHLPSSSADAVVYPHSVPSAAVSLEPARRKRGRPRKYGTPEEALAAKKASTASSHSSSAKAKKDLASPSSLNAV
SASSSFSAPSKKSHLPAPGNAGQGFAPHVINVAAGEDVGQKIMLFMQQCKQEICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSS
SNGHIIGGGVGGPLKAAGPVQVIVGTFVIGSKKEVGGGVIGDASASKLPSPIGGTSMSNLRYGLNIDSGGNQIRGIDEHQGIGESHFLLQPRGPQQDWRMGLDGTNTAYD
LTGRTGHHHSPENGDYDQIAD