; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G034220 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G034220
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationCicolChr02:30009690..30027969
RNA-Seq ExpressionCcUC02G034220
SyntenyCcUC02G034220
Gene Ontology termsGO:0001510 - RNA methylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003680 - AT DNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR005175 - PPC domain
IPR013217 - Methyltransferase type 12
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase
IPR039605 - AT-hook motif nuclear-localized protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063571.1 AT-hook motif nuclear-localized protein 14 [Cucumis melo var. makuwa]2.0e-16893.35Show/hide
Query:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
        LSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA+SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
Subjt:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAAL---------GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEG
        LASSSSLNAVSASSSFS PSKKSQLAAL         GNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEG
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAAL---------GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEG

Query:  RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNH
        RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKL SPIGGT MSNLRYGSNIDSGGN 
Subjt:  RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNH

Query:  VRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        +RGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+
Subjt:  VRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

XP_004139392.1 AT-hook motif nuclear-localized protein 14 [Cucumis sativus]2.8e-17095.85Show/hide
Query:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
        LSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA+SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
Subjt:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG
        LASSSSLNAVSASSSFS PSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG

Query:  SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQG
        SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SA KL SPIGGT MSNLRYGSNIDSGGN +RGNDEHQG
Subjt:  SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQG

Query:  LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDL+
Subjt:  LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

XP_008456410.1 PREDICTED: AT-hook motif nuclear-localized protein 14 isoform X1 [Cucumis melo]3.1e-16995.28Show/hide
Query:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
        LSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA+SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
Subjt:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAAL--GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSL
        LASSSSLNAVSASSSFS PSKKSQLAAL  GNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSL
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAAL--GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSL

Query:  CGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEH
        CGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKL SPIGGT MSNLRYGSNIDSGGN +RGNDEH
Subjt:  CGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEH

Query:  QGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        QGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+
Subjt:  QGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

XP_008456418.1 PREDICTED: AT-hook motif nuclear-localized protein 14 isoform X2 [Cucumis melo]9.7e-17195.85Show/hide
Query:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
        LSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA+SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
Subjt:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG
        LASSSSLNAVSASSSFS PSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG

Query:  SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQG
        SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKL SPIGGT MSNLRYGSNIDSGGN +RGNDEHQG
Subjt:  SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQG

Query:  LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        LGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+
Subjt:  LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

XP_038890429.1 AT-hook motif nuclear-localized protein 14 [Benincasa hispida]1.4e-17297.92Show/hide
Query:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
        LSSYFHHHQHHHQSP TTSPTNGLLPPTHHLSS AAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
Subjt:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG
        LASSSSLNAVSASSSFSAPSKKSQLA LGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG

Query:  SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQG
        SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKL SPIGGT MSNLRYGSNIDSGGN +RGNDEHQG
Subjt:  SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQG

Query:  LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
Subjt:  LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

TrEMBL top hitse value%identityAlignment
A0A0A0LJ73 AT-hook motif nuclear-localized protein5.2e-17095.83Show/hide
Query:  SSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKEL
        SSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA+SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKEL
Subjt:  SSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKEL

Query:  ASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGS
        ASSSSLNAVSASSSFS PSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGS
Subjt:  ASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGS

Query:  YVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQGL
        YVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SA KL SPIGGT MSNLRYGSNIDSGGN +RGNDEHQGL
Subjt:  YVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQGL

Query:  GESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        GESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDL+
Subjt:  GESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

A0A1S3C2R6 AT-hook motif nuclear-localized protein1.5e-16995.28Show/hide
Query:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
        LSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA+SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
Subjt:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAAL--GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSL
        LASSSSLNAVSASSSFS PSKKSQLAAL  GNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSL
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAAL--GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSL

Query:  CGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEH
        CGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKL SPIGGT MSNLRYGSNIDSGGN +RGNDEH
Subjt:  CGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEH

Query:  QGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        QGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+
Subjt:  QGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

A0A1S3C3W0 AT-hook motif nuclear-localized protein4.7e-17195.85Show/hide
Query:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
        LSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA+SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
Subjt:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG
        LASSSSLNAVSASSSFS PSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCG

Query:  SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQG
        SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKL SPIGGT MSNLRYGSNIDSGGN +RGNDEHQG
Subjt:  SYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQG

Query:  LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        LGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+
Subjt:  LGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

A0A5A7VCX4 AT-hook motif nuclear-localized protein9.8e-16993.35Show/hide
Query:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
        LSSYFHHHQHHHQ+P TTSPTNGLLPPTHHLS+AAA+SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE
Subjt:  LSSYFHHHQHHHQSP-TTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAAL---------GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEG
        LASSSSLNAVSASSSFS PSKKSQLAAL         GNAGQGFAPHVINVAAGEDVGQKIM FMQQCKREICILSASGSISNASLRQPAASGGNIAYEG
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAAL---------GNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEG

Query:  RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNH
        RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKE GGGKGD SAGKL SPIGGT MSNLRYGSNIDSGGN 
Subjt:  RFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNH

Query:  VRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        +RGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATN AYDL+
Subjt:  VRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

A0A6J1G4C2 AT-hook motif nuclear-localized protein2.0e-16192.26Show/hide
Query:  LSSYFHHHQHHHQSPTTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKEL
        LSSYF HHQHHHQSPTTSPTNGLLPPTHHLSS    SDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKK+L
Subjt:  LSSYFHHHQHHHQSPTTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKEL

Query:  ASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGS
        ASSSSLNAVSASSSFSA SKKSQLAALGNAGQGF+PHVINVAAGEDVGQKIM+FMQQCKREICILSASGSISNASLRQPA SGGNI YEGRFEIVSLCGS
Subjt:  ASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGS

Query:  YVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQGL
        Y+RTD GGKTGGLSVCLSSA+GHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGG KGDASAGKL SP GGT MSNLRYGSN+D+GGN VRGNDEHQG+
Subjt:  YVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQGL

Query:  GESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT
        GESHFLLQPRGVNLTS RSTDWR  LDATN AYDLT
Subjt:  GESHFLLQPRGVNLTSPRSTDWRTGLDATNTAYDLT

SwissProt top hitse value%identityAlignment
A1L4X7 AT-hook motif nuclear-localized protein 142.5e-7652.25Show/hide
Query:  ILSSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKK
        + S YFHH  QHHH  PTT            S  NGL PP     H  +  ++S A    VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK
Subjt:  ILSSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKK

Query:  AATASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG
         A+++S SSS+K ++ELA+ +        S+ S  SKKSQL ++G  GQ F PH++N+A GEDV QKIMMF  Q K E+C+LSASG+ISNASLRQPA SG
Subjt:  AATASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG

Query:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDA--SAGKLSSPIGGTLMSNLR
        GN+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS+++G IIGG +G  L AAGPVQVI+GTF +D KK+    GGKGDA  S  +L+SP+    +  + 
Subjt:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDA--SAGKLSSPIGGTLMSNLR

Query:  YGSNIDS-GGNHVRGNDE------HQ-GL-GESHFLLQ-PRGVNLTSPRSTDWRTG
        +   ++S G N +RGNDE      HQ GL G  HF++Q P+G+++T  R ++WR G
Subjt:  YGSNIDS-GGNHVRGNDE------HQ-GL-GESHFLLQ-PRGVNLTSPRSTDWRTG

O22812 AT-hook motif nuclear-localized protein 108.3e-3239.92Show/hide
Query:  PTNGLLPPTHHLSSAAAASDAGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELASSSSLNAVSASSSF
        P   + PP  +       + AG + V   ++P   S  ++ +  EP +++RGRPRKYG     ++      A S + S  +               SS  
Subjt:  PTNGLLPPTHHLSSAAAASDAGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELASSSSLNAVSASSSF

Query:  SAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGG
           SK+ +L ALG+ G GF PHV+ V AGEDV  KIM       R +C+LSA+G+ISN +LRQ A SGG + YEGRFEI+SL GS+   +  G   +TGG
Subjt:  SAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGG

Query:  LSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPI
        LSV LSS +G+++GG V G L AA PVQ++VG+F+ D +KE     G      LSSP+
Subjt:  LSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPI

O80834 AT-hook motif nuclear-localized protein 91.9e-3141.41Show/hide
Query:  HQHHHQSPTTSPTNGLLPPTHH----LSSAAAASDAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSSKAKKE
        H  +  SP  S + G   P+ H    L++AA  + A PH +    V   A    P E P +RKRGRPRKY   G+   AL++   +T + ++S+ + +  
Subjt:  HQHHHQSPTTSPTNGLLPPTHH----LSSAAAASDAGPHVVYPHSVPSAAVSSSPLE-PARRKRGRPRKY---GTPEEALAAKKAATASSHSSSSKAKKE

Query:  LASSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEI
           S                KK ++A++G     ++G  F PHVI V+ GED+  K++ F QQ  R IC+LSASG++S A+L QP+AS G I YEGRFEI
Subjt:  LASSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEI

Query:  VSLCGSY-VRTD--LGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV
        ++L  SY V TD     +TG LSV L+S +G +IGG +GGPL AA PVQVIVG+F+
Subjt:  VSLCGSY-VRTD--LGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV

Q8VYJ2 AT-hook motif nuclear-localized protein 11.1e-2837.68Show/hide
Query:  HHQHHHQSPTTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK----KAATASSH-SSSSKAKKELA
        H     +S   SPT+ + PP    SS   A           +  +AA+        ++KRGRPRKYG     +A       +A A SH    S    + +
Subjt:  HHQHHHQSPTTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAK----KAATASSH-SSSSKAKKELA

Query:  SSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVS
        +S   + V  ++SF+      Q+  LG     + G  F PH+I V  GEDV  KI+ F QQ  R IC+LSA+G IS+ +LRQP +SGG + YEGRFEI+S
Subjt:  SSSSLNAVSASSSFSAPSKKSQLAALG-----NAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVS

Query:  LCGSYVRTDLGG---KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSN
        L GS++  D GG   +TGG+SV L+S +G ++GGG+ G L AA PVQV+VG+F+     +    K +     LSSP     +S+
Subjt:  LCGSYVRTDLGG---KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSN

Q9SB31 AT-hook motif nuclear-localized protein 32.1e-3039.64Show/hide
Query:  IGRDILSSYFHHHQH--------HHQSPTTSPTNG---LLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKY---GTPEEALAA
        I  +I SS+    QH        +   P   P N    L+PPT   ++A  A+    +   P S+     ++S  E  ++KRGRPRKY   GT    L+ 
Subjt:  IGRDILSSYFHHHQH--------HHQSPTTSPTNG---LLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKY---GTPEEALAA

Query:  KKAATASSHSSSSKAKKELASSSSLN---AVSASSSFSAPSKKSQLAALGNA---GQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNAS
           +++   +S    +K        N     S    F      + LA +G A   G  F PHV+ V AGEDV  KIM F QQ  R ICILSA+G ISN +
Subjt:  KKAATASSHSSSSKAKKELASSSSLN---AVSASSSFSAPSKKSQLAALGNA---GQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNAS

Query:  LRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV
        LRQ   SGG + YEGRFEI+SL GS+++ D GG   + GG+SVCL+  +G + GGG+ G   AAGPVQV+VGTF+
Subjt:  LRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFV

Arabidopsis top hitse value%identityAlignment
AT1G54650.1 Methyltransferase family protein3.4e-8161.15Show/hide
Query:  AAEYFSKDFEWEDLRVRVENDPLLQYHLLPFEPCNSIPSVSPEADSDAWSRFHLRHSSGKFFKERRYLLKEFPELVSCKKYCKGLEVGCGNGSTVIPILR
        A EY SKDFEWE L+  VENDP L +HL       ++    P  DS  W  FH RHSSGKFFKERRYLLKEFPELVSC +  K LE+GCGNGSTV+PILR
Subjt:  AAEYFSKDFEWEDLRVRVENDPLLQYHLLPFEPCNSIPSVSPEADSDAWSRFHLRHSSGKFFKERRYLLKEFPELVSCKKYCKGLEVGCGNGSTVIPILR

Query:  GNENIIIYACDCSTGTLERAKEILNNAGIASLNDRFCPFYCDFSISKFPTWLACNSCRGNTFQQQPGFTTPENDGSQATCLFALEESGCCIGGVDFLTLI
        G++NI +YACDCS+  L R KE ++ A I+S+ D F  F CDFS S+FP W+AC+ CR         F    + GS         ES  CIGGVDF+TLI
Subjt:  GNENIIIYACDCSTGTLERAKEILNNAGIASLNDRFCPFYCDFSISKFPTWLACNSCRGNTFQQQPGFTTPENDGSQATCLFALEESGCCIGGVDFLTLI

Query:  FTLSAVPLQRMPASIRECFMVLKPGGLLLFRDYGLYDMTMLRFGQDQRVGFREYVRLDGT
        FTLSAVP +RMP +I+ECF VLKPGGLLLFRDYGLYDMTMLRF  ++RVGFREYVR DGT
Subjt:  FTLSAVPLQRMPASIRECFMVLKPGGLLLFRDYGLYDMTMLRFGQDQRVGFREYVRLDGT

AT1G54650.2 Methyltransferase family protein6.9e-8261.54Show/hide
Query:  AAEYFSKDFEWEDLRVRVENDPLLQYHLLPFEPCNSIPSVSPEADSDAWSRFHLRHSSGKFFKERRYLLKEFPELVSCKKYCKGLEVGCGNGSTVIPILR
        A EY SKDFEWE L+  VENDP L +HL       ++    P  DS  W  FH RHSSGKFFKERRYLLKEFPELVSC +  K LE+GCGNGSTV+PILR
Subjt:  AAEYFSKDFEWEDLRVRVENDPLLQYHLLPFEPCNSIPSVSPEADSDAWSRFHLRHSSGKFFKERRYLLKEFPELVSCKKYCKGLEVGCGNGSTVIPILR

Query:  GNENIIIYACDCSTGTLERAKEILNNAGIASLNDRFCPFYCDFSISKFPTWLACNSCRGNTFQQQPGFTTPENDGSQATCLFALEESGCCIGGVDFLTLI
        G++NI +YACDCS+  L R KE ++ A I+S+ D F  F CDFS S+FP W+AC+ CR        GF      GS         ES  CIGGVDF+TLI
Subjt:  GNENIIIYACDCSTGTLERAKEILNNAGIASLNDRFCPFYCDFSISKFPTWLACNSCRGNTFQQQPGFTTPENDGSQATCLFALEESGCCIGGVDFLTLI

Query:  FTLSAVPLQRMPASIRECFMVLKPGGLLLFRDYGLYDMTMLRFGQDQRVGFREYVRLDGT
        FTLSAVP +RMP +I+ECF VLKPGGLLLFRDYGLYDMTMLRF  ++RVGFREYVR DGT
Subjt:  FTLSAVPLQRMPASIRECFMVLKPGGLLLFRDYGLYDMTMLRFGQDQRVGFREYVRLDGT

AT2G33620.1 AT hook motif DNA-binding family protein5.9e-3339.92Show/hide
Query:  PTNGLLPPTHHLSSAAAASDAGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELASSSSLNAVSASSSF
        P   + PP  +       + AG + V   ++P   S  ++ +  EP +++RGRPRKYG     ++      A S + S  +               SS  
Subjt:  PTNGLLPPTHHLSSAAAASDAGPHVVYPHSVP---SAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELASSSSLNAVSASSSF

Query:  SAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGG
           SK+ +L ALG+ G GF PHV+ V AGEDV  KIM       R +C+LSA+G+ISN +LRQ A SGG + YEGRFEI+SL GS+   +  G   +TGG
Subjt:  SAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGG---KTGG

Query:  LSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPI
        LSV LSS +G+++GG V G L AA PVQ++VG+F+ D +KE     G      LSSP+
Subjt:  LSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPI

AT3G04590.1 AT hook motif DNA-binding family protein1.5e-6856.64Show/hide
Query:  ILSSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKK
        + S YFHH  QHHH  PTT            S  NGL PP     H  +  ++S A    VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK
Subjt:  ILSSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKK

Query:  AATASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG
         A+++S SSS+K ++ELA+ +        S+ S  SKKSQL ++G  GQ F PH++N+A GEDV QKIMMF  Q K E+C+LSASG+ISNASLRQPA SG
Subjt:  AATASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG

Query:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDAS-AGKL
        GN+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS+++G IIGG +G  L AAGPVQVI+GTF +D KK+    GGKGDAS +GK+
Subjt:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDAS-AGKL

AT3G04590.2 AT hook motif DNA-binding family protein1.8e-7752.25Show/hide
Query:  ILSSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKK
        + S YFHH  QHHH  PTT            S  NGL PP     H  +  ++S A    VYPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK
Subjt:  ILSSYFHHH-QHHHQSPTT------------SPTNGLLPP---THHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKK

Query:  AATASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG
         A+++S SSS+K ++ELA+ +        S+ S  SKKSQL ++G  GQ F PH++N+A GEDV QKIMMF  Q K E+C+LSASG+ISNASLRQPA SG
Subjt:  AATASSHSSSSKAKKELASSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASG

Query:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDA--SAGKLSSPIGGTLMSNLR
        GN+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS+++G IIGG +G  L AAGPVQVI+GTF +D KK+    GGKGDA  S  +L+SP+    +  + 
Subjt:  GNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEV--GGGKGDA--SAGKLSSPIGGTLMSNLR

Query:  YGSNIDS-GGNHVRGNDE------HQ-GL-GESHFLLQ-PRGVNLTSPRSTDWRTG
        +   ++S G N +RGNDE      HQ GL G  HF++Q P+G+++T  R ++WR G
Subjt:  YGSNIDS-GGNHVRGNDE------HQ-GL-GESHFLLQ-PRGVNLTSPRSTDWRTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAAAGGAATTGGCAGAGACATACTCAGCTCCTACTTCCACCACCATCAACACCACCATCAGAGTCCCACCACATCGCCGACCAATGGCCTTTTACCACCCACCCA
CCACCTCTCCTCCGCCGCCGCCGCCTCCGACGCCGGCCCTCATGTCGTATACCCTCACTCTGTCCCTTCCGCCGCCGTGTCCTCGTCTCCTCTCGAGCCCGCACGCCGGA
AGAGAGGCCGGCCGAGGAAGTACGGCACACCGGAAGAAGCTTTAGCCGCTAAAAAAGCTGCTACGGCGTCGTCTCACTCTTCGTCCTCTAAGGCTAAGAAGGAACTCGCT
TCTTCTTCTTCCCTTAATGCCGTTTCCGCTTCTTCTTCCTTCTCCGCGCCTTCCAAAAAATCTCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACATGT
TATTAATGTGGCAGCTGGTGAGGATGTGGGCCAGAAAATTATGATGTTTATGCAACAATGTAAGCGGGAAATTTGTATCCTTTCTGCATCTGGTTCAATCTCCAATGCAT
CTCTCCGTCAGCCAGCCGCATCTGGAGGCAATATTGCATATGAGGGTCGTTTTGAGATTGTTTCGTTATGCGGATCTTATGTACGAACTGACCTCGGAGGAAAGACGGGT
GGTCTTAGTGTATGTCTATCAAGTGCTGAAGGCCACATCATAGGAGGGGGAGTGGGTGGACCGTTGAAGGCTGCTGGACCCGTGCAGGTCATTGTTGGAACCTTTGTAAT
CGACCCCAAGAAGGAAGTTGGTGGTGGTAAAGGCGACGCATCTGCTGGCAAGTTGTCCTCGCCTATTGGTGGGACATTGATGTCAAATCTACGCTATGGCTCCAACATTG
ACTCGGGAGGTAATCATGTAAGGGGAAATGATGAACACCAAGGTCTTGGAGAGAGTCATTTTTTGCTTCAGCCCCGGGGAGTGAATCTGACATCACCGCGATCAACAGAT
TGGAGAACGGGTCTGGATGCCACAAACACTGCTTATGATTTGACAGCCGCTGAATATTTCTCCAAAGACTTCGAATGGGAGGATCTTCGCGTTCGGGTCGAGAACGACCC
GCTTTTGCAGTACCATTTGCTTCCATTTGAGCCCTGCAATTCCATTCCTTCAGTTTCTCCAGAAGCCGATTCGGACGCATGGAGCCGCTTTCACCTTCGCCACTCCTCCG
GCAAATTCTTCAAGGAAAGGAGGTATTTGTTGAAGGAATTCCCTGAACTTGTTTCTTGTAAGAAGTATTGTAAAGGTTTGGAGGTGGGTTGTGGTAATGGAAGCACTGTT
ATTCCAATCTTGCGTGGGAACGAGAATATCATCATCTATGCTTGCGATTGTAGCACAGGAACTCTTGAGAGAGCTAAAGAGATTTTAAATAATGCCGGCATTGCATCTTT
AAATGATCGATTTTGTCCATTCTATTGTGATTTTTCTATTAGTAAGTTTCCAACATGGTTGGCTTGCAATTCATGCCGTGGAAACACTTTCCAACAGCAGCCGGGTTTTA
CTACTCCAGAGAATGATGGATCTCAGGCCACTTGTTTATTTGCATTAGAAGAAAGTGGATGCTGCATTGGTGGGGTTGATTTTCTTACTTTGATATTTACACTATCAGCA
GTGCCTCTCCAGAGGATGCCAGCATCTATCAGAGAATGCTTTATGGTTTTGAAGCCTGGAGGCCTGCTTTTATTTCGGGATTATGGTCTATATGATATGACTATGCTTCG
ATTTGGGCAAGATCAAAGAGTAGGATTCAGGGAATATGTCAGATTAGATGGAACACCTCGAGCTGGAATATTGTTGTGTGAAGTCCTTGAATCGCCGAAATGGAAAGAGT
ATGGAGAGGGTGTGGGTTCATGGAAAGTTCCAGAAGCCTGTATCGTGAATCAAGCTAGAGGCACAGAACAACCTTTGAAGTTACTTGTTTTCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTAAAGGAATTGGCAGAGACATACTCAGCTCCTACTTCCACCACCATCAACACCACCATCAGAGTCCCACCACATCGCCGACCAATGGCCTTTTACCACCCACCCA
CCACCTCTCCTCCGCCGCCGCCGCCTCCGACGCCGGCCCTCATGTCGTATACCCTCACTCTGTCCCTTCCGCCGCCGTGTCCTCGTCTCCTCTCGAGCCCGCACGCCGGA
AGAGAGGCCGGCCGAGGAAGTACGGCACACCGGAAGAAGCTTTAGCCGCTAAAAAAGCTGCTACGGCGTCGTCTCACTCTTCGTCCTCTAAGGCTAAGAAGGAACTCGCT
TCTTCTTCTTCCCTTAATGCCGTTTCCGCTTCTTCTTCCTTCTCCGCGCCTTCCAAAAAATCTCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACATGT
TATTAATGTGGCAGCTGGTGAGGATGTGGGCCAGAAAATTATGATGTTTATGCAACAATGTAAGCGGGAAATTTGTATCCTTTCTGCATCTGGTTCAATCTCCAATGCAT
CTCTCCGTCAGCCAGCCGCATCTGGAGGCAATATTGCATATGAGGGTCGTTTTGAGATTGTTTCGTTATGCGGATCTTATGTACGAACTGACCTCGGAGGAAAGACGGGT
GGTCTTAGTGTATGTCTATCAAGTGCTGAAGGCCACATCATAGGAGGGGGAGTGGGTGGACCGTTGAAGGCTGCTGGACCCGTGCAGGTCATTGTTGGAACCTTTGTAAT
CGACCCCAAGAAGGAAGTTGGTGGTGGTAAAGGCGACGCATCTGCTGGCAAGTTGTCCTCGCCTATTGGTGGGACATTGATGTCAAATCTACGCTATGGCTCCAACATTG
ACTCGGGAGGTAATCATGTAAGGGGAAATGATGAACACCAAGGTCTTGGAGAGAGTCATTTTTTGCTTCAGCCCCGGGGAGTGAATCTGACATCACCGCGATCAACAGAT
TGGAGAACGGGTCTGGATGCCACAAACACTGCTTATGATTTGACAGCCGCTGAATATTTCTCCAAAGACTTCGAATGGGAGGATCTTCGCGTTCGGGTCGAGAACGACCC
GCTTTTGCAGTACCATTTGCTTCCATTTGAGCCCTGCAATTCCATTCCTTCAGTTTCTCCAGAAGCCGATTCGGACGCATGGAGCCGCTTTCACCTTCGCCACTCCTCCG
GCAAATTCTTCAAGGAAAGGAGGTATTTGTTGAAGGAATTCCCTGAACTTGTTTCTTGTAAGAAGTATTGTAAAGGTTTGGAGGTGGGTTGTGGTAATGGAAGCACTGTT
ATTCCAATCTTGCGTGGGAACGAGAATATCATCATCTATGCTTGCGATTGTAGCACAGGAACTCTTGAGAGAGCTAAAGAGATTTTAAATAATGCCGGCATTGCATCTTT
AAATGATCGATTTTGTCCATTCTATTGTGATTTTTCTATTAGTAAGTTTCCAACATGGTTGGCTTGCAATTCATGCCGTGGAAACACTTTCCAACAGCAGCCGGGTTTTA
CTACTCCAGAGAATGATGGATCTCAGGCCACTTGTTTATTTGCATTAGAAGAAAGTGGATGCTGCATTGGTGGGGTTGATTTTCTTACTTTGATATTTACACTATCAGCA
GTGCCTCTCCAGAGGATGCCAGCATCTATCAGAGAATGCTTTATGGTTTTGAAGCCTGGAGGCCTGCTTTTATTTCGGGATTATGGTCTATATGATATGACTATGCTTCG
ATTTGGGCAAGATCAAAGAGTAGGATTCAGGGAATATGTCAGATTAGATGGAACACCTCGAGCTGGAATATTGTTGTGTGAAGTCCTTGAATCGCCGAAATGGAAAGAGT
ATGGAGAGGGTGTGGGTTCATGGAAAGTTCCAGAAGCCTGTATCGTGAATCAAGCTAGAGGCACAGAACAACCTTTGAAGTTACTTGTTTTCGATTGA
Protein sequenceShow/hide protein sequence
MFKGIGRDILSSYFHHHQHHHQSPTTSPTNGLLPPTHHLSSAAAASDAGPHVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELA
SSSSLNAVSASSSFSAPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMMFMQQCKREICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTG
GLSVCLSSAEGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEVGGGKGDASAGKLSSPIGGTLMSNLRYGSNIDSGGNHVRGNDEHQGLGESHFLLQPRGVNLTSPRSTD
WRTGLDATNTAYDLTAAEYFSKDFEWEDLRVRVENDPLLQYHLLPFEPCNSIPSVSPEADSDAWSRFHLRHSSGKFFKERRYLLKEFPELVSCKKYCKGLEVGCGNGSTV
IPILRGNENIIIYACDCSTGTLERAKEILNNAGIASLNDRFCPFYCDFSISKFPTWLACNSCRGNTFQQQPGFTTPENDGSQATCLFALEESGCCIGGVDFLTLIFTLSA
VPLQRMPASIRECFMVLKPGGLLLFRDYGLYDMTMLRFGQDQRVGFREYVRLDGTPRAGILLCEVLESPKWKEYGEGVGSWKVPEACIVNQARGTEQPLKLLVFD