; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G004120 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G004120
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationchr11:4296550..4303328
RNA-Seq ExpressionLsi11G004120
SyntenyLsi11G004120
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063571.1 AT-hook motif nuclear-localized protein 14 [Cucumis melo var. makuwa]3.8e-18594.32Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
        MEPNENQLSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS

Query:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAAL---------GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG
        SSKAKKELASSSSLNAVSASSSFS PSKKSQLAAL         GNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASG
Subjt:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAAL---------GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG

Query:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSN
        GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKLPSPIGGTSMSNLRYGSN
Subjt:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSN

Query:  IDSGGNQVRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        IDSGGNQ+RGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+GRT HHSPENGDYDQIP
Subjt:  IDSGGNQVRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

XP_004139392.1 AT-hook motif nuclear-localized protein 14 [Cucumis sativus]1.1e-18796.95Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
        MEPNENQLSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS

Query:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF
        SSKAKKELASSSSLNAVSASSSFS PSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF
Subjt:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF

Query:  EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVR
        EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SA KLPSPIGGTSMSNLRYGSNIDSGGNQ+R
Subjt:  EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVR

Query:  GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDL+GRTGHHSPENGDYDQIP
Subjt:  GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

XP_008456410.1 PREDICTED: AT-hook motif nuclear-localized protein 14 isoform X1 [Cucumis melo]5.9e-18696.14Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
        MEPNENQLSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS

Query:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAAL--GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEG
        SSKAKKELASSSSLNAVSASSSFS PSKKSQLAAL  GNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEG
Subjt:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAAL--GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEG

Query:  RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQ
        RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKLPSPIGGTSMSNLRYGSNIDSGGNQ
Subjt:  RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQ

Query:  VRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        +RGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+GRT HHSPENGDYDQIP
Subjt:  VRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

XP_008456418.1 PREDICTED: AT-hook motif nuclear-localized protein 14 isoform X2 [Cucumis melo]1.8e-18796.68Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
        MEPNENQLSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS

Query:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF
        SSKAKKELASSSSLNAVSASSSFS PSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF
Subjt:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF

Query:  EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVR
        EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKLPSPIGGTSMSNLRYGSNIDSGGNQ+R
Subjt:  EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVR

Query:  GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+GRT HHSPENGDYDQIP
Subjt:  GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

XP_038890429.1 AT-hook motif nuclear-localized protein 14 [Benincasa hispida]4.0e-19098.89Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
        MEPNENQLSSYFHHHQHHHQSP TTSPTNGLLPPTHHLSSAAA SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS

Query:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF
        SSKAKKELASSSSLNAVSASSSFSAPSKKSQLA LGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF
Subjt:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF

Query:  EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVR
        EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQ+R
Subjt:  EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVR

Query:  GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
Subjt:  GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

TrEMBL top hitse value%identityAlignment
A0A0A0LJ73 AT-hook motif nuclear-localized protein3.0e-18396.88Show/hide
Query:  SSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKEL
        SSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKEL
Subjt:  SSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKEL

Query:  ASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGS
        ASSSSLNAVSASSSFS PSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGS
Subjt:  ASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGS

Query:  YVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVRGNDEHQGL
        YVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SA KLPSPIGGTSMSNLRYGSNIDSGGNQ+RGNDEHQGL
Subjt:  YVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVRGNDEHQGL

Query:  GESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        GESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDL+GRTGHHSPENGDYDQIP
Subjt:  GESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

A0A1S3C2R6 AT-hook motif nuclear-localized protein2.9e-18696.14Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
        MEPNENQLSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS

Query:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAAL--GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEG
        SSKAKKELASSSSLNAVSASSSFS PSKKSQLAAL  GNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEG
Subjt:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAAL--GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEG

Query:  RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQ
        RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKLPSPIGGTSMSNLRYGSNIDSGGNQ
Subjt:  RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQ

Query:  VRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        +RGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+GRT HHSPENGDYDQIP
Subjt:  VRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

A0A1S3C3W0 AT-hook motif nuclear-localized protein8.9e-18896.68Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
        MEPNENQLSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS

Query:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF
        SSKAKKELASSSSLNAVSASSSFS PSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF
Subjt:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRF

Query:  EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVR
        EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKLPSPIGGTSMSNLRYGSNIDSGGNQ+R
Subjt:  EIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVR

Query:  GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+GRT HHSPENGDYDQIP
Subjt:  GNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

A0A5A7VCX4 AT-hook motif nuclear-localized protein1.9e-18594.32Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
        MEPNENQLSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS
Subjt:  MEPNENQLSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSS

Query:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAAL---------GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG
        SSKAKKELASSSSLNAVSASSSFS PSKKSQLAAL         GNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASG
Subjt:  SSKAKKELASSSSLNAVSASSSFSAPSKKSQLAAL---------GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG

Query:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSN
        GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKLPSPIGGTSMSNLRYGSN
Subjt:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSN

Query:  IDSGGNQVRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        IDSGGNQ+RGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+GRT HHSPENGDYDQIP
Subjt:  IDSGGNQVRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

A0A6J1G4C2 AT-hook motif nuclear-localized protein6.4e-17893.06Show/hide
Query:  MEPNENQLSSYFHHHQHHHQSPTTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSS
        MEPNENQLSSYF HHQHHHQSPTTSPTNGLLPPTHHLSS    SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSS
Subjt:  MEPNENQLSSYFHHHQHHHQSPTTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSS

Query:  SKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFE
        SKAKK+LASSSSLNAVSASSSFSA SKKSQLAALGNAGQGF+PHVINVAAGEDVGQKIM+FMQQCKREICILSASGSISNASLRQPA SGGNI YEGRFE
Subjt:  SKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFE

Query:  IVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVRG
        IVSLCGSY+RTD GGKTGGLSVCLSSA+GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGG KGDASAGKLPSP GGT MSNLRYGSN+D+GGNQVRG
Subjt:  IVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVRG

Query:  NDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP
        NDEHQG+GESHFLLQPRGVNLTS RSTDWR  LDATN AYDLTGRT HHSPENGDYDQIP
Subjt:  NDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLTGRTGHHSPENGDYDQIP

SwissProt top hitse value%identityAlignment
A1L4X7 AT-hook motif nuclear-localized protein 141.7e-8251.95Show/hide
Query:  SSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAA
        S YFHH  QHHH  PTT            S  NGL PP     H  +  + S A    VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A
Subjt:  SSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAA

Query:  TASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGN
        +++S SSS+K ++ELA+ +        S+ S  SKKSQL ++G  GQ F PH++N+A GEDV QKIMMF  Q K E+C+LSASG+ISNASLRQPA SGGN
Subjt:  TASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGN

Query:  IAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDA--SAGKLPSPIGGTSMSNLRYG
        + YEG++EI+SL GSY+RT+ GGK+GGLSV LS+++G IIGG +G  L AAGPVQVI+GTF +D KK+    GGKGDA  S  +L SP+    +  + + 
Subjt:  IAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDA--SAGKLPSPIGGTSMSNLRYG

Query:  SNIDS-GGNQVRGNDE------HQ-GL-GESHFLLQ-PRGVNLTSPRSTDWRTGLDATNT-----AYDLTGRTGHHSPENGDYDQ
          ++S G N +RGNDE      HQ GL G  HF++Q P+G+++T  R ++WR G ++ +       YDL+GR GH S ENGDY+Q
Subjt:  SNIDS-GGNQVRGNDE------HQ-GL-GESHFLLQ-PRGVNLTSPRSTDWRTGLDATNT-----AYDLTGRTGHHSPENGDYDQ

O22812 AT-hook motif nuclear-localized protein 103.9e-3137.29Show/hide
Query:  PTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELASSSSLNAVSASSSF
        P   + PP  +  ++     AG + V   ++P   S  ++ +  EP +++RGRPRKYG     ++      A S + S  +               SS  
Subjt:  PTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELASSSSLNAVSASSSF

Query:  SAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGG
           SK+ +L ALG+ G GF PHV+ V AGEDV  KIM       R +C+LSA+G+ISN +LRQ A SGG + YEGRFEI+SL GS+   +  G   +TGG
Subjt:  SAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGG

Query:  LSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVID----PKKEVG--GGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVRGNDEHQGLG
        LSV LSS +G+++GG V G L AA PVQ++VG+F+ D    PK+ VG  G          P+ +  T  S    G+  +S      G+  HQ  G
Subjt:  LSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVID----PKKEVG--GGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVRGNDEHQGLG

O49658 AT-hook motif nuclear-localized protein 24.8e-2939.68Show/hide
Query:  TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHS--VPSAAVSSSPLEPARRKRGRPRKYGTPEEALA-----AKKAATASSHSSSSKAKKELASSSSLNA
        + +P++  + P    S+    S A P    P +   PSAA+      P +++RGRPRKYG    A+         AA  +SH        E        A
Subjt:  TTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHS--VPSAAVSSSPLEPARRKRGRPRKYGTPEEALA-----AKKAATASSHSSSSKAKKELASSSSLNA

Query:  VSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVR
            SSF  P  K Q+  LG     +A   F PH+I V AGEDV ++I+ F QQ    IC+L A+G +S+ +LRQP +SGG + YEGRFEI+SL G+++ 
Subjt:  VSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVR

Query:  TDLGG---KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV
        +D  G   +TGG+SV L+S +G ++GGGV G L AA P+QV+VGTF+
Subjt:  TDLGG---KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV

O80834 AT-hook motif nuclear-localized protein 92.7e-3241.8Show/hide
Query:  HQHHHQSPTTSPTNGLLPPTHH----LSSAAAGSDAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSSKAKKE
        H  +  SP  S + G   P+ H    L++AA G+ A PH +    V   A    P E P +RKRGRPRKY   G+   AL++   +T + ++S+ + +  
Subjt:  HQHHHQSPTTSPTNGLLPPTHH----LSSAAAGSDAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEI
           S                KK ++A++G     ++G  F PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+AS G I YEGRFEI
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEI

Query:  VSLCGSY-VRTD--LGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV
        ++L  SY V TD     +TG LSV L+S +G +IGG +GGPL AA PVQVIVG+F+
Subjt:  VSLCGSY-VRTD--LGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV

Q9SB31 AT-hook motif nuclear-localized protein 32.1e-2939.42Show/hide
Query:  NENQLSSYFHHHQH--------HHQSPTTSPTNG---LLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKY---GTPEEALAAK
        N N  SS+    QH        +   P   P N    L+PPT   ++A   +    +   P S+     ++S  E  ++KRGRPRKY   GT    L+  
Subjt:  NENQLSSYFHHHQH--------HHQSPTTSPTNG---LLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKY---GTPEEALAAK

Query:  KAATASSHSSSSKAKKELASSSSLN---AVSASSSFSAPSKKSQLAALGNA---GQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASL
          +++   +S    +K        N     S    F      + LA +G A   G  F PHV+ V AGEDV  KIM F QQ  R ICILSA+G ISN +L
Subjt:  KAATASSHSSSSKAKKELASSSSLN---AVSASSSFSAPSKKSQLAALGNA---GQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASL

Query:  RQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV
        RQ   SGG + YEGRFEI+SL GS+++ D GG   + GG+SVCL+  +G + GGG+ G   AAGPVQV+VGTF+
Subjt:  RQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV

Arabidopsis top hitse value%identityAlignment
AT2G45850.1 AT hook motif DNA-binding family protein1.9e-3341.8Show/hide
Query:  HQHHHQSPTTSPTNGLLPPTHH----LSSAAAGSDAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSSKAKKE
        H  +  SP  S + G   P+ H    L++AA G+ A PH +    V   A    P E P +RKRGRPRKY   G+   AL++   +T + ++S+ + +  
Subjt:  HQHHHQSPTTSPTNGLLPPTHH----LSSAAAGSDAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEI
           S                KK ++A++G     ++G  F PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+AS G I YEGRFEI
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEI

Query:  VSLCGSY-VRTD--LGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV
        ++L  SY V TD     +TG LSV L+S +G +IGG +GGPL AA PVQVIVG+F+
Subjt:  VSLCGSY-VRTD--LGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV

AT2G45850.2 AT hook motif DNA-binding family protein1.9e-3341.8Show/hide
Query:  HQHHHQSPTTSPTNGLLPPTHH----LSSAAAGSDAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSSKAKKE
        H  +  SP  S + G   P+ H    L++AA G+ A PH +    V   A    P E P +RKRGRPRKY   G+   AL++   +T + ++S+ + +  
Subjt:  HQHHHQSPTTSPTNGLLPPTHH----LSSAAAGSDAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEI
           S                KK ++A++G     ++G  F PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+AS G I YEGRFEI
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEI

Query:  VSLCGSY-VRTD--LGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV
        ++L  SY V TD     +TG LSV L+S +G +IGG +GGPL AA PVQVIVG+F+
Subjt:  VSLCGSY-VRTD--LGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV

AT3G04590.1 AT hook motif DNA-binding family protein4.1e-6857.04Show/hide
Query:  SSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAA
        S YFHH  QHHH  PTT            S  NGL PP     H  +  + S A    VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A
Subjt:  SSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAA

Query:  TASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGN
        +++S SSS+K ++ELA+ +        S+ S  SKKSQL ++G  GQ F PH++N+A GEDV QKIMMF  Q K E+C+LSASG+ISNASLRQPA SGGN
Subjt:  TASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGN

Query:  IAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDAS-AGKL
        + YEG++EI+SL GSY+RT+ GGK+GGLSV LS+++G IIGG +G  L AAGPVQVI+GTF +D KK+    GGKGDAS +GK+
Subjt:  IAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDAS-AGKL

AT3G04590.2 AT hook motif DNA-binding family protein1.2e-8351.95Show/hide
Query:  SSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAA
        S YFHH  QHHH  PTT            S  NGL PP     H  +  + S A    VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A
Subjt:  SSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAA

Query:  TASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGN
        +++S SSS+K ++ELA+ +        S+ S  SKKSQL ++G  GQ F PH++N+A GEDV QKIMMF  Q K E+C+LSASG+ISNASLRQPA SGGN
Subjt:  TASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGN

Query:  IAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDA--SAGKLPSPIGGTSMSNLRYG
        + YEG++EI+SL GSY+RT+ GGK+GGLSV LS+++G IIGG +G  L AAGPVQVI+GTF +D KK+    GGKGDA  S  +L SP+    +  + + 
Subjt:  IAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDA--SAGKLPSPIGGTSMSNLRYG

Query:  SNIDS-GGNQVRGNDE------HQ-GL-GESHFLLQ-PRGVNLTSPRSTDWRTGLDATNT-----AYDLTGRTGHHSPENGDYDQ
          ++S G N +RGNDE      HQ GL G  HF++Q P+G+++T  R ++WR G ++ +       YDL+GR GH S ENGDY+Q
Subjt:  SNIDS-GGNQVRGNDE------HQ-GL-GESHFLLQ-PRGVNLTSPRSTDWRTGLDATNT-----AYDLTGRTGHHSPENGDYDQ

AT5G28590.1 DNA-binding family protein1.5e-3844.89Show/hide
Query:  ALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHII
        AL   GQ F PH++N+  GEDV +KI++F QQ K ++C+LSASGSISNASL    ASG                    T  GGKTGGLSVCLS+++G I 
Subjt:  ALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHII

Query:  GGGVGGPLKAAGPVQVIVGTFVIDPKKE-VGGGKGDASAGK---LPSPIGGTSMSNLRYGSNIDSGGNQVRGNDEHQGL------GESHFLLQ-PRGVNL
        GGGVGG LKAAGPVQV++GTF ++ KK+   G KGD ++G    LPSP G  S+  L Y  +++S G     NDEH  +      G +HF+++ P+G+++
Subjt:  GGGVGGPLKAAGPVQVIVGTFVIDPKKE-VGGGKGDASAGK---LPSPIGGTSMSNLRYGSNIDSGGNQVRGNDEHQGL------GESHFLLQ-PRGVNL

Query:  TSPRSTDWRTGLDATNTAYDLTGRT
        T  R ++W        T YDL+G++
Subjt:  TSPRSTDWRTGLDATNTAYDLTGRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCCAATGAAAACCAGCTCAGCTCCTACTTCCACCACCATCAACACCACCATCAGAGTCCCACCACATCGCCGACCAATGGCCTTTTACCACCCACCCACCACCT
CTCCTCCGCCGCTGCCGGCTCCGACGCCGGCCCTCATGTCGTATACCCTCACTCTGTCCCTTCCGCCGCCGTGTCCTCGTCTCCCCTCGAGCCCGCACGCCGGAAGAGAG
GCCGGCCGAGGAAGTACGGAACGCCGGAGGAAGCTTTAGCGGCTAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCGTCCTCTAAGGCCAAGAAGGAACTCGCTTCTTCT
TCTTCCCTTAATGCCGTTTCCGCTTCTTCTTCCTTCTCCGCGCCTTCCAAAAAATCTCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACATGTTATTAA
TGTGGCAGCTGGTGAGGATGTGGGACAGAAAATTATGATGTTTATGCAACAATGTAAGCGGGAAATTTGTATCCTTTCTGCATCTGGTTCGATCTCCAATGCATCTCTCC
GTCAGCCAGCTGCATCTGGAGGCAATATTGCATATGAGGGTCGTTTTGAGATTGTTTCATTGTGCGGATCTTATGTACGAACTGACCTCGGAGGAAAGACCGGTGGTCTT
AGTGTATGTCTATCAAGTGCTGAAGGCCATATCATAGGAGGGGGAGTTGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGAACCTTCGTAATTGACCC
GAAGAAGGAAGTTGGTGGTGGTAAAGGTGATGCATCTGCTGGCAAGTTGCCCTCACCTATTGGTGGGACGTCGATGTCAAATCTACGCTATGGCTCCAACATTGACTCGG
GAGGTAATCAAGTCAGGGGAAATGATGAACACCAGGGTCTTGGGGAGAGTCATTTTTTGCTTCAGCCCCGGGGAGTGAATCTGACATCACCGCGATCAACGGATTGGAGG
ACGGGTCTGGATGCCACAAACACTGCTTATGATTTGACAGGAAGAACAGGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGGTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAGAAAAAAAGGGAAAAAAAGGCAAATCCTTTTCAAAAAAAGAAGAAAAAAATTTAATAATTTTAATTTCCAAAACACACGCTCTCTCTTTCTCTCTCTCATCT
CAAAAATGGAACCCAATGAAAACCAGCTCAGCTCCTACTTCCACCACCATCAACACCACCATCAGAGTCCCACCACATCGCCGACCAATGGCCTTTTACCACCCACCCAC
CACCTCTCCTCCGCCGCTGCCGGCTCCGACGCCGGCCCTCATGTCGTATACCCTCACTCTGTCCCTTCCGCCGCCGTGTCCTCGTCTCCCCTCGAGCCCGCACGCCGGAA
GAGAGGCCGGCCGAGGAAGTACGGAACGCCGGAGGAAGCTTTAGCGGCTAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCGTCCTCTAAGGCCAAGAAGGAACTCGCTT
CTTCTTCTTCCCTTAATGCCGTTTCCGCTTCTTCTTCCTTCTCCGCGCCTTCCAAAAAATCTCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACATGTT
ATTAATGTGGCAGCTGGTGAGGATGTGGGACAGAAAATTATGATGTTTATGCAACAATGTAAGCGGGAAATTTGTATCCTTTCTGCATCTGGTTCGATCTCCAATGCATC
TCTCCGTCAGCCAGCTGCATCTGGAGGCAATATTGCATATGAGGGTCGTTTTGAGATTGTTTCATTGTGCGGATCTTATGTACGAACTGACCTCGGAGGAAAGACCGGTG
GTCTTAGTGTATGTCTATCAAGTGCTGAAGGCCATATCATAGGAGGGGGAGTTGGTGGACCATTGAAGGCTGCTGGACCCGTGCAGGTTATTGTTGGAACCTTCGTAATT
GACCCGAAGAAGGAAGTTGGTGGTGGTAAAGGTGATGCATCTGCTGGCAAGTTGCCCTCACCTATTGGTGGGACGTCGATGTCAAATCTACGCTATGGCTCCAACATTGA
CTCGGGAGGTAATCAAGTCAGGGGAAATGATGAACACCAGGGTCTTGGGGAGAGTCATTTTTTGCTTCAGCCCCGGGGAGTGAATCTGACATCACCGCGATCAACGGATT
GGAGGACGGGTCTGGATGCCACAAACACTGCTTATGATTTGACAGGAAGAACAGGCCATCATTCTCCCGAAAACGGAGATTACGATCAGATTCCCGGTTGAGAGCTAACA
TACGAGGCAGAACGGGACTGCAAAGTTTGTCGTAGATAAATGTACAATATCAACAGTTGCAATGCCAGTAATCTTCTCTTCACTCCTTGTTAGCTAATTCTGGTTGTAGT
ATGCACCATACTGTAAAGGTGATAACAAACTCTTTAGAAGTTTTTATGCTCTTCTTGCATTTCATTTTCCTCCTTTCTCAAGTTTTTACCCATCCATCCACCTTTCATGG
TGTATTTGCATGTTAAGTCTGTCTAATTCTAATTCTCCTGTTTTTCCTTTAGATGAACAAGTTTGTTGTAGTTGTTACAGTTTGCAATTTCCAATTTGATCATATAAACA
CCACCTCTTTGGTGATTTAGAATTCTTTGCAGTAATCTTCTGGTGATGAACAATGCCTCTTTGATTGATTCTTTGGTTGCAAGAGGTAAAGAATATTACTATCTACACAC
AATTTACTTTTTGAC
Protein sequenceShow/hide protein sequence
MEPNENQLSSYFHHHQHHHQSPTTSPTNGLLPPTHHLSSAAAGSDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELASS
SSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTGGL
SVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLPSPIGGTSMSNLRYGSNIDSGGNQVRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWR
TGLDATNTAYDLTGRTGHHSPENGDYDQIPG