; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G011670 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G011670
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionAT-hook motif nuclear-localized protein
Genome locationCmo_Chr12:10558067..10567660
RNA-Seq ExpressionCmoCh12G011670
SyntenyCmoCh12G011670
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003680 - AT DNA binding (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR005175 - PPC domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR039605 - AT-hook motif nuclear-localized protein
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586330.1 putative prolyl 4-hydroxylase 3, partial [Cucurbita argyrosperma subsp. sororia]1.9e-17579.5Show/hide
Query:  PVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGTSMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFTE
        P QVIVGTF+ DPTKEAG  IK DTS GKLSSPTGGTSMSGLRY                                           GLDGTT AYDFT+
Subjt:  PVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGTSMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFTE

Query:  VAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFIS
        VAAYRNRGLILLKL+RSLMAV KGKYIKFQ RKWSTFKLSKI+MA LLALG+SM IAFRFFSP ESSHSNLLHR+ASVQHRAVHSDGLGKR DQWVEFIS
Subjt:  VAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFIS

Query:  WEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDF
        WEPRAFVYHNFLSKEECLYLISLA PYMKKSTVVD KTGK  DSR RTSSGMFL RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAH+D+
Subjt:  WEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDF

Query:  ISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWM
         ++EF I++GGQR+ATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFW+++P+NT+DPTSLHGACPVIRGNKWSCTKWM
Subjt:  ISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWM

KAG6586331.1 putative prolyl 4-hydroxylase 3, partial [Cucurbita argyrosperma subsp. sororia]1.8e-19799.42Show/hide
Query:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL
        MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL
Subjt:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEFIRKGGQRIATLLM
        YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEFIRKGGQRIATLLM
Subjt:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEFIRKGGQRIATLLM

Query:  YLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMHGRFEIVSLCGSYVRADTG
        YLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMHGRFEIVSLCGSYVRADTG
Subjt:  YLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMHGRFEIVSLCGSYVRADTG

Query:  GKTGGLSVCLSSADGYIIGGGVGGPLKAAGPVQVNIQDETRQDVT
        GKTGGLSVCLSSADGYIIGGGVGGPLKAAGPVQVNIQDETR+D T
Subjt:  GKTGGLSVCLSSADGYIIGGGVGGPLKAAGPVQVNIQDETRQDVT

XP_022937925.1 probable prolyl 4-hydroxylase 3 [Cucurbita moschata]1.3e-17199.67Show/hide
Query:  TEVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEF
        +EVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEF
Subjt:  TEVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEF

Query:  ISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY
        ISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY
Subjt:  ISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY

Query:  DFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKW
        DFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKW
Subjt:  DFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKW

Query:  MH
        MH
Subjt:  MH

XP_022937928.1 AT-hook motif nuclear-localized protein 14-like [Cucurbita moschata]9.0e-17391.71Show/hide
Query:  MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK
        MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK
Subjt:  MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK

Query:  DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNA
        DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGE                              DVGQKIMLFMQQCKREICILSASGLISNA
Subjt:  DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNA

Query:  SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT
        SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT
Subjt:  SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT

Query:  SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT
        SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT
Subjt:  SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT

XP_023537747.1 AT-hook motif nuclear-localized protein 14-like [Cucurbita pepo subsp. pepo]2.9e-17190.88Show/hide
Query:  MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK
        MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAA+SSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK
Subjt:  MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK

Query:  DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNA
        DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGE                              DVGQKIMLFMQQCKREICILSASG ISNA
Subjt:  DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNA

Query:  SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT
        SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFI DPTKEAGGGIKGDTSAGKLSSPTGGT
Subjt:  SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT

Query:  SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT
        SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT
Subjt:  SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT

TrEMBL top hitse value%identityAlignment
A0A6J1FHE6 AT-hook motif nuclear-localized protein4.4e-17391.71Show/hide
Query:  MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK
        MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK
Subjt:  MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK

Query:  DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNA
        DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGE                              DVGQKIMLFMQQCKREICILSASGLISNA
Subjt:  DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNA

Query:  SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT
        SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT
Subjt:  SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT

Query:  SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT
        SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT
Subjt:  SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT

A0A6J1FI70 probable prolyl 4-hydroxylase 36.3e-17299.67Show/hide
Query:  TEVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEF
        +EVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEF
Subjt:  TEVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEF

Query:  ISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY
        ISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY
Subjt:  ISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY

Query:  DFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKW
        DFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKW
Subjt:  DFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKW

Query:  MH
        MH
Subjt:  MH

A0A6J1FI75 probable prolyl 4-hydroxylase 31.3e-14589.4Show/hide
Query:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL
        MAV KGKYIKFQGRKWSTFKLSKI+M  +LALG+SMFIAFRFFSP ESSHS LLHRLASVQH AVHSDGLGKREDQWVE ISWEPRAFVYHNFLSKEECL
Subjt:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLL
        YLISLA PYM+KSTVVDIKTGK  DSR RTSSGMFL RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDF ++EF I+ GGQRIATLL
Subjt:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH
        MYLSDVEEGGETVFPAAEGNFSSLPGWNE SECGKGGLS+ PKMGDALLFW++RP+NTLDPTS+HG+CPVIRGNKWSCTKWMH
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH

A0A6J1HMV0 probable prolyl 4-hydroxylase 39.2e-14790.11Show/hide
Query:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL
        MAVLKGKYIKFQGRKWSTFKLSKI+M  LLALG+SMFIAFRFFSP ESSHS LLHRLASVQH AVHSDGLGKR DQWVEFISWEPRAFVYHNFLSKEECL
Subjt:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLL
        YLISLA PYM+KSTVVD KTGK  DSR RTSSGMFL+RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDF ++EF I+ GGQRIATLL
Subjt:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH
        MYLSDVEEGGETVFPAAEGNFSSLPGWNE SECGKGGLS+KPKMGDALLFW++RP+NTLDPTS+HG+CPVIRGNKWSCTKWMH
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH

A0A6J1HRZ8 AT-hook motif nuclear-localized protein4.1e-17190.61Show/hide
Query:  MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK
        MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK
Subjt:  MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKK

Query:  DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNA
        DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGE                              DVGQKIMLFMQQCKREICILSASGL+SNA
Subjt:  DLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNA

Query:  SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT
        SLRQP+SSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFI DPTKEAGGGIKGDTSAGKLSSPTGGT
Subjt:  SLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT

Query:  SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT
        SMSGLRYGSNIDLGGNQV GNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT
Subjt:  SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFT

SwissProt top hitse value%identityAlignment
A1L4X7 AT-hook motif nuclear-localized protein 143.1e-6746.93Show/hide
Query:  SSYFHHH-QHHHQSPTT------------SPTNGLL---PSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASS
        S YFHH  QHHH  PTT            S  NGL    P   H  +  +++  +YPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A+++S
Subjt:  SSYFHHH-QHHHQSPTT------------SPTNGLL---PSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASS

Query:  HSSSKAKKDL--VSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICI
         SS+K +++L  V+  +++  S SSKKSQL ++G  GQ F P ++++A GE                              DV QKIM+F  Q K E+C+
Subjt:  HSSSKAKKDL--VSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICI

Query:  LSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKE-AGGGIKGD--
        LSASG ISNASLRQPA SGGN+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS++DG IIGG +G  L AAGPVQVI+GTF  D  K+ AG G KGD  
Subjt:  LSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKE-AGGGIKGD--

Query:  TSAGKLSSPTGGTSMSGLRYGSNID-LGGNQVRGNNE------HQ-GI-GESHFLLQ-PRGVNLTSPR-SNWRTG
         S  +L+SP     + G+ +   ++  G N +RGN+E      HQ G+ G  HF++Q P+G+++T  R S WR G
Subjt:  TSAGKLSSPTGGTSMSGLRYGSNID-LGGNQVRGNNE------HQ-GI-GESHFLLQ-PRGVNLTSPR-SNWRTG

F4JNU8 Probable prolyl 4-hydroxylase 86.4e-8958.3Show/hide
Query:  KGKYIKFQGRK-WSTFKLSKIVMAL---LLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL
        K K ++ + RK +ST   + +V+ L   L+ +G+ +F +    + T S   +L   + ++Q R    D      D+W+E ISWEPRAFVYHNFL+ EEC 
Subjt:  KGKYIKFQGRK-WSTFKLSKIVMAL---LLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLL
        +LISLA P M KS VVD+KTGK  DSR RTSSG FLNRG ++IV  IE RI+DFTFIP E+GE LQ+LHYEVGQ+Y+ H+D+  +EF +RKGGQRIAT+L
Subjt:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH
        MYLSDV+EGGETVFPAA+GN S +P W+ELS+CGK GLSV PK  DALLFW+++P+ +LDP+SLHG CPVI+GNKWS TKW H
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH

F4JZ24 Probable prolyl 4-hydroxylase 105.1e-9463.5Show/hide
Query:  LSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGL-GKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIK
        +S  V+ +LLA GI          P+ ++ S+  + L S+  + +   G    + ++WVE ISWEPRA VYHNFL+KEEC YLI LA P+M+KSTVVD K
Subjt:  LSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGL-GKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIK

Query:  TGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEG
        TGK  DSR RTSSG FL RG++K +  IEKRI+DFTFIPVEHGE LQ+LHYE+GQKY+ HYD+  +E+  R GGQRIAT+LMYLSDVEEGGETVFPAA+G
Subjt:  TGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEG

Query:  NFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWM
        N+S++P WNELSECGKGGLSVKPKMGDALLFW++ P+ TLDP+SLHG C VI+GNKWS TKW+
Subjt:  NFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWM

Q24JN5 Prolyl 4-hydroxylase 52.1e-8756.49Show/hide
Query:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHRAVHSDGLGK-REDQWVEFISWEPRAFVYHNFLSKEE
        MA    +++++Q RK  +       + +LL + I + +     S P  + +S+  + L ++  ++  S G  +   ++WVE ISWEPRA VYHNFL+ EE
Subjt:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHRAVHSDGLGK-REDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIAT
        C +LISLA P M KSTVVD KTG  KDSR RTSSG FL RG +++V  IEKRI+DFTFIPVE+GE LQ+LHY+VGQKY+ HYD+  +EF  + GGQRIAT
Subjt:  CLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIAT

Query:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH
        +LMYLSDV++GGETVFPAA GN S++P WNELS+CGK GLSV PK  DALLFW +RP+ +LDP+SLHG CPV++GNKWS TKW H
Subjt:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH

Q9LN20 Probable prolyl 4-hydroxylase 34.6e-10365.03Show/hide
Query:  KGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLI
        K ++ +FQ RKWST  L  + M  +L + + M +AF  FS P  +  S+ +      +     S+GLGKR DQW E +SWEPRAFVYHNFLSKEEC YLI
Subjt:  KGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLI

Query:  SLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLLMYL
        SLA P+M KSTVVD +TGK KDSR RTSSG FL RG++KI+  IEKRIAD+TFIP +HGE LQ+LHYE GQKY+ HYD+  +EF  + GGQR+AT+LMYL
Subjt:  SLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLLMYL

Query:  SDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH-GRFEI
        SDVEEGGETVFPAA  NFSS+P +NELSECGK GLSVKP+MGDALLFW++RP+ TLDPTSLHG CPVIRGNKWS TKWMH G ++I
Subjt:  SDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH-GRFEI

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.3e-10465.03Show/hide
Query:  KGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLI
        K ++ +FQ RKWST  L  + M  +L + + M +AF  FS P  +  S+ +      +     S+GLGKR DQW E +SWEPRAFVYHNFLSKEEC YLI
Subjt:  KGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLI

Query:  SLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLLMYL
        SLA P+M KSTVVD +TGK KDSR RTSSG FL RG++KI+  IEKRIAD+TFIP +HGE LQ+LHYE GQKY+ HYD+  +EF  + GGQR+AT+LMYL
Subjt:  SLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLLMYL

Query:  SDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH-GRFEI
        SDVEEGGETVFPAA  NFSS+P +NELSECGK GLSVKP+MGDALLFW++RP+ TLDPTSLHG CPVIRGNKWS TKWMH G ++I
Subjt:  SDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH-GRFEI

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.5e-8856.49Show/hide
Query:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHRAVHSDGLGK-REDQWVEFISWEPRAFVYHNFLSKEE
        MA    +++++Q RK  +       + +LL + I + +     S P  + +S+  + L ++  ++  S G  +   ++WVE ISWEPRA VYHNFL+ EE
Subjt:  MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHRAVHSDGLGK-REDQWVEFISWEPRAFVYHNFLSKEE

Query:  CLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIAT
        C +LISLA P M KSTVVD KTG  KDSR RTSSG FL RG +++V  IEKRI+DFTFIPVE+GE LQ+LHY+VGQKY+ HYD+  +EF  + GGQRIAT
Subjt:  CLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIAT

Query:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH
        +LMYLSDV++GGETVFPAA GN S++P WNELS+CGK GLSV PK  DALLFW +RP+ +LDP+SLHG CPV++GNKWS TKW H
Subjt:  LLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH

AT3G04590.2 AT hook motif DNA-binding family protein2.2e-6846.93Show/hide
Query:  SSYFHHH-QHHHQSPTT------------SPTNGLL---PSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASS
        S YFHH  QHHH  PTT            S  NGL    P   H  +  +++  +YPHSVPS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A+++S
Subjt:  SSYFHHH-QHHHQSPTT------------SPTNGLL---PSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASS

Query:  HSSSKAKKDL--VSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICI
         SS+K +++L  V+  +++  S SSKKSQL ++G  GQ F P ++++A GE                              DV QKIM+F  Q K E+C+
Subjt:  HSSSKAKKDL--VSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICI

Query:  LSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKE-AGGGIKGD--
        LSASG ISNASLRQPA SGGN+ YEG++EI+SL GSY+RT+ GGK+GGLSV LS++DG IIGG +G  L AAGPVQVI+GTF  D  K+ AG G KGD  
Subjt:  LSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKE-AGGGIKGD--

Query:  TSAGKLSSPTGGTSMSGLRYGSNID-LGGNQVRGNNE------HQ-GI-GESHFLLQ-PRGVNLTSPR-SNWRTG
         S  +L+SP     + G+ +   ++  G N +RGN+E      HQ G+ G  HF++Q P+G+++T  R S WR G
Subjt:  TSAGKLSSPTGGTSMSGLRYGSNID-LGGNQVRGNNE------HQ-GI-GESHFLLQ-PRGVNLTSPR-SNWRTG

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.6e-9058.3Show/hide
Query:  KGKYIKFQGRK-WSTFKLSKIVMAL---LLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL
        K K ++ + RK +ST   + +V+ L   L+ +G+ +F +    + T S   +L   + ++Q R    D      D+W+E ISWEPRAFVYHNFL+ EEC 
Subjt:  KGKYIKFQGRK-WSTFKLSKIVMAL---LLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL

Query:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLL
        +LISLA P M KS VVD+KTGK  DSR RTSSG FLNRG ++IV  IE RI+DFTFIP E+GE LQ+LHYEVGQ+Y+ H+D+  +EF +RKGGQRIAT+L
Subjt:  YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLL

Query:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH
        MYLSDV+EGGETVFPAA+GN S +P W+ELS+CGK GLSV PK  DALLFW+++P+ +LDP+SLHG CPVI+GNKWS TKW H
Subjt:  MYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.6e-9563.5Show/hide
Query:  LSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGL-GKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIK
        +S  V+ +LLA GI          P+ ++ S+  + L S+  + +   G    + ++WVE ISWEPRA VYHNFL+KEEC YLI LA P+M+KSTVVD K
Subjt:  LSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGL-GKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIK

Query:  TGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEG
        TGK  DSR RTSSG FL RG++K +  IEKRI+DFTFIPVEHGE LQ+LHYE+GQKY+ HYD+  +E+  R GGQRIAT+LMYLSDVEEGGETVFPAA+G
Subjt:  TGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEG

Query:  NFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWM
        N+S++P WNELSECGKGGLSVKPKMGDALLFW++ P+ TLDP+SLHG C VI+GNKWS TKW+
Subjt:  NFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCCAATGACAACCAGCTCAGCTCCTACTTCCACCACCATCAACACCACCATCAAAGTCCCACCACATCGCCGACCAATGGCCTTCTACCCTCCACCCACCACCT
CTCCTCCGCCGACGCCACCACCCATGTCCTTTACCCTCACTCGGTTCCCTCCGCCGCCGTCTCCTCCTCTCCTCTCGAGCCCGGTCGCCGGAAGAGAGGTCGCCCGCGGA
AGTACGGCACGCCGGAGGAGGCTTTAGCGGCTAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCAAGGCCAAGAAGGACCTCGTTTCTTCCTCTTCTCTTAATGCC
GTTTCCGCTTCTTCGAAGAAATCTCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACAGGTTATTGATGTGGCAGCTGGTGAGGAAAGTGGAGCTGTGGT
GGGCTGGCGGCTTAAGTTCATGGGAAACCTTAATGATAGAGAGATTGAAGGGTGTATGGCTATCTGGAGCAAGGACGTGGGCCAGAAAATTATGCTGTTTATGCAACAAT
GTAAGCGGGAAATCTGTATCCTTTCCGCATCTGGTTTGATCTCCAATGCATCTCTCCGTCAGCCGGCCTCATCTGGTGGCAATGTTACGTATGAGGGCCGTTTCGAGATT
GTTTCGTTATGTGGATCTTATGTACGAACTGACATTGGAGGAAAGACTGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCCATATCATAGGAGGGGGAGTTGGTGG
ACCGTTGATGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACTTTCATAACCGACCCGACAAAGGAAGCTGGTGGTGGGATTAAAGGCGATACATCTGCTGGCAAGTTGT
CCTCACCCACTGGTGGGACATCGATGTCAGGTCTACGCTATGGTTCGAACATCGACTTGGGAGGTAATCAAGTCAGGGGAAACAATGAGCACCAAGGTATCGGGGAGAGT
CATTTCTTGCTTCAACCCCGGGGAGTGAACCTGACATCTCCTCGATCCAACTGGAGAACAGGTCTGGATGGCACCACCACTGCTTATGATTTTACAGAAGTAGCAGCTTA
TCGGAATCGGGGTTTGATTCTTCTGAAGCTTCTTCGGTCTTTAATGGCGGTGTTGAAAGGGAAATACATCAAGTTTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCA
AGATAGTCATGGCCTTGCTTTTGGCACTTGGGATTTCCATGTTCATCGCTTTCCGATTCTTCTCTCCTACTGAAAGTTCTCATAGCAATCTACTCCACCGGCTCGCTTCC
GTCCAGCATAGAGCCGTTCATAGTGATGGATTGGGGAAGAGAGAGGATCAATGGGTTGAGTTCATTTCATGGGAGCCTAGGGCTTTTGTTTATCACAATTTCTTGTCCAA
GGAAGAATGCTTGTATTTGATTAGTCTTGCAACACCTTACATGAAAAAATCAACTGTGGTTGATATAAAAACTGGCAAGATTAAAGATAGCAGGACGCGCACCAGTTCCG
GGATGTTTCTGAATAGAGGGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGAGCATGGAGAAGCACTTCAAATTCTGCAC
TATGAAGTCGGCCAGAAGTATGATGCTCACTATGATTTCATTTCTGAGGAGTTCATCCGAAAAGGAGGCCAAAGAATAGCCACTCTTCTCATGTATCTGTCAGACGTCGA
AGAAGGGGGTGAGACAGTGTTCCCAGCAGCCGAAGGGAACTTCAGCTCTTTGCCCGGGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTATCTGTAAAACCAAAGA
TGGGTGATGCATTATTGTTCTGGACCCTGAGGCCTAATAATACCTTAGATCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGAGGGAACAAGTGGTCATGTACAAAG
TGGATGCATGGCCGTTTCGAGATTGTTTCGTTATGTGGATCTTATGTACGAGCTGACACTGGAGGAAAGACGGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCTA
TATCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCCGCTGGACCCGTGCAGGTTAACATACAGGACGAGACGAGACAAGACGTGACTGCACAACCGCTGCAACGCCAGC
AATCTTCGCGTCACTGTGTTAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATTATGGATAAATGATAATATGTACACAAAAAAGGAAAAAAAAGTTATATTTATTTTATTTATTTATTTTAATTTTCCAAAACATACGCTCTCTCTCTCATCTCAAAAAT
GGAACCCAATGACAACCAGCTCAGCTCCTACTTCCACCACCATCAACACCACCATCAAAGTCCCACCACATCGCCGACCAATGGCCTTCTACCCTCCACCCACCACCTCT
CCTCCGCCGACGCCACCACCCATGTCCTTTACCCTCACTCGGTTCCCTCCGCCGCCGTCTCCTCCTCTCCTCTCGAGCCCGGTCGCCGGAAGAGAGGTCGCCCGCGGAAG
TACGGCACGCCGGAGGAGGCTTTAGCGGCTAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCAAGGCCAAGAAGGACCTCGTTTCTTCCTCTTCTCTTAATGCCGT
TTCCGCTTCTTCGAAGAAATCTCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACAGGTTATTGATGTGGCAGCTGGTGAGGAAAGTGGAGCTGTGGTGG
GCTGGCGGCTTAAGTTCATGGGAAACCTTAATGATAGAGAGATTGAAGGGTGTATGGCTATCTGGAGCAAGGACGTGGGCCAGAAAATTATGCTGTTTATGCAACAATGT
AAGCGGGAAATCTGTATCCTTTCCGCATCTGGTTTGATCTCCAATGCATCTCTCCGTCAGCCGGCCTCATCTGGTGGCAATGTTACGTATGAGGGCCGTTTCGAGATTGT
TTCGTTATGTGGATCTTATGTACGAACTGACATTGGAGGAAAGACTGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCCATATCATAGGAGGGGGAGTTGGTGGAC
CGTTGATGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACTTTCATAACCGACCCGACAAAGGAAGCTGGTGGTGGGATTAAAGGCGATACATCTGCTGGCAAGTTGTCC
TCACCCACTGGTGGGACATCGATGTCAGGTCTACGCTATGGTTCGAACATCGACTTGGGAGGTAATCAAGTCAGGGGAAACAATGAGCACCAAGGTATCGGGGAGAGTCA
TTTCTTGCTTCAACCCCGGGGAGTGAACCTGACATCTCCTCGATCCAACTGGAGAACAGGTCTGGATGGCACCACCACTGCTTATGATTTTACAGAAGTAGCAGCTTATC
GGAATCGGGGTTTGATTCTTCTGAAGCTTCTTCGGTCTTTAATGGCGGTGTTGAAAGGGAAATACATCAAGTTTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCAAG
ATAGTCATGGCCTTGCTTTTGGCACTTGGGATTTCCATGTTCATCGCTTTCCGATTCTTCTCTCCTACTGAAAGTTCTCATAGCAATCTACTCCACCGGCTCGCTTCCGT
CCAGCATAGAGCCGTTCATAGTGATGGATTGGGGAAGAGAGAGGATCAATGGGTTGAGTTCATTTCATGGGAGCCTAGGGCTTTTGTTTATCACAATTTCTTGTCCAAGG
AAGAATGCTTGTATTTGATTAGTCTTGCAACACCTTACATGAAAAAATCAACTGTGGTTGATATAAAAACTGGCAAGATTAAAGATAGCAGGACGCGCACCAGTTCCGGG
ATGTTTCTGAATAGAGGGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGAGCATGGAGAAGCACTTCAAATTCTGCACTA
TGAAGTCGGCCAGAAGTATGATGCTCACTATGATTTCATTTCTGAGGAGTTCATCCGAAAAGGAGGCCAAAGAATAGCCACTCTTCTCATGTATCTGTCAGACGTCGAAG
AAGGGGGTGAGACAGTGTTCCCAGCAGCCGAAGGGAACTTCAGCTCTTTGCCCGGGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTATCTGTAAAACCAAAGATG
GGTGATGCATTATTGTTCTGGACCCTGAGGCCTAATAATACCTTAGATCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGAGGGAACAAGTGGTCATGTACAAAGTG
GATGCATGGCCGTTTCGAGATTGTTTCGTTATGTGGATCTTATGTACGAGCTGACACTGGAGGAAAGACGGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCTATA
TCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCCGCTGGACCCGTGCAGGTTAACATACAGGACGAGACGAGACAAGACGTGACTGCACAACCGCTGCAACGCCAGCAA
TCTTCGCGTCACTGTGTTAGCTAA
Protein sequenceShow/hide protein sequence
MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDLVSSSSLNA
VSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNASLRQPASSGGNVTYEGRFEI
VSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGTSMSGLRYGSNIDLGGNQVRGNNEHQGIGES
HFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFTEVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLAS
VQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILH
YEVGQKYDAHYDFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTK
WMHGRFEIVSLCGSYVRADTGGKTGGLSVCLSSADGYIIGGGVGGPLKAAGPVQVNIQDETRQDVTAQPLQRQQSSRHCVS