; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh14G016540 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh14G016540
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCma_Chr14:12395828..12403613
RNA-Seq ExpressionCmaCh14G016540
SyntenyCmaCh14G016540
Gene Ontology termsGO:0000160 - phosphorelay signal transduction system (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0000166 - nucleotide binding (molecular function)
InterPro domainsIPR045054 - Prolyl 4-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR036641 - HPT domain superfamily
IPR008207 - Signal transduction histidine kinase, phosphotransfer (Hpt) domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR003582 - ShKT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4393790.1 hypothetical protein G4B88_007776 [Cannabis sativa]3.5e-16765.8Show/hide
Query:  PLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVS
        PLFF           + SSYAGSASSI+NPAKVKQISW PRAF+YEGFLTDLECDHLIS+AK+ELKRSAVAD+ SGES++SE+RTSSG FI K+KDPIV+
Subjt:  PLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVS

Query:  GIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR
        GIEDKIS WTFLPKENGEDIQVLRYE GQKY+ H+DYFAD+VNI RGGHR+ATVLMYL+DV +GGETVFP A E+ R + S T ED S+CAKKG+AVK R
Subjt:  GIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSC---------
        +GDALLFFSL P A+PDT SLH GCPVIEGEKWSATKWI V+ FD+ V     C+D N SCERWA LGEC  N EYMVGSPE PGYCR+SC         
Subjt:  KGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSC---------

Query:  ----KSTEASVSKNGGGIDAETAG--------SGTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSVDFKKIDDHVHQLKGSSSS
            K  E + +   G +  +              LDSQ+LQLLQLQDESNP FV EV +LFF+DTE+LLN L  AL Q  VDFK++D HVHQLKGSSSS
Subjt:  ----KSTEASVSKNGGGIDAETAG--------SGTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSVDFKKIDDHVHQLKGSSSS

Query:  IGALRVKNACIDFRSACEQQSPEWCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGSIPV
        IGA RVKN C+ FR+ CE+Q+ + C RCLQQV+Q +Y VK+KL  L+ LEQ+I+ AGGSIP+
Subjt:  IGALRVKNACIDFRSACEQQSPEWCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGSIPV

KAF9678089.1 hypothetical protein SADUNF_Sadunf08G0175600 [Salix dunnii]7.0e-16868.41Show/hide
Query:  FFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGI
        F F LSISL+L + S SY  ++SSI+NPAKVKQ+S  PRAFVY+GFLTDLECDHLIS+AK+ELKRSAVADN SG+SK+SE+RTSSG FI K+KDPIVSGI
Subjt:  FFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGI

Query:  EDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKG
        EDKI+ WTFLPKENGEDIQVLRYE+GQKYD H+DYF+D+VNIARGGHR+ATVLMYL+DV++GGETVFPSAEE+ RR+AS ++EDLS+CA+KGIAVKP +G
Subjt:  EDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKG

Query:  DALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKSTEASVSKNG
        DALLFFSL+P AVPDTSSLH GCPVIEGEKWSATKWI V+ FD+ +    NC+D+N  C RWA LGEC  NPEYMVGSP  PGYCR+SCK          
Subjt:  DALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKSTEASVSKNG

Query:  GGIDAETAGSGTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPE
                  G LD+Q+ QL  LQD+SNP FV+EV +LFFED+E LL  L   L Q ++DFKK+D HVHQ KGSSSSIGA RVKN CI FR+ CE+Q+ E
Subjt:  GGIDAETAGSGTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPE

Query:  WCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGSIPVD
         C RCLQQV+Q +Y VK KL  L  LEQ+I+ AGGSIP++
Subjt:  WCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGSIPVD

XP_022955844.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]2.9e-16697.67Show/hide
Query:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ
        MEKFCSFN LFFFSLSISLLL RASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLE DHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFI+
Subjt:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ

Query:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK
        KSKDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGEC  NPEYMVGSPEFPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK

XP_022980799.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]4.7e-172100Show/hide
Query:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ
        MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ
Subjt:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ

Query:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK
        KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK

XP_023526540.1 probable prolyl 4-hydroxylase 4 [Cucurbita pepo subsp. pepo]1.3e-16697.33Show/hide
Query:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ
        MEKFCSFN LFFFSLSISLLL  ASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFI+
Subjt:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ

Query:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK
        KSKDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVK+GGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDY+NCSDENASCERWAELGEC DNPEYMVGSPEFPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK

TrEMBL top hitse value%identityAlignment
A0A1Q3B5T1 Procollagen-proline 4-dioxygenase7.3e-16363.08Show/hide
Query:  FFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGI
        F + L IS+++ +  SS+  S SS+++P+KVKQIS  PRA+VYEGFLT LECDHLIS+AK+ELKRSAVADNLSG+SK+SE+RTSSG FI K KDPIV GI
Subjt:  FFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGI

Query:  EDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKG
        EDKIS WTFLPKENGEDIQVLRYE GQKY+ HFDYF D+VNIARGGHR+ATVL+YL+DV +GGETVFPSAE S RR+ S TN DLS+C +KG+AVKPR+G
Subjt:  EDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKG

Query:  DALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKSTE-------
        DALLFFSLHPNA+PD SSLH GCPVIEGEKWSATKWI V+ FD+ +    NC+D N SCE+WA LGEC  N EYMVGSPE PGYCR+SCK  E       
Subjt:  DALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKSTE-------

Query:  ------ASVSK------NGGGIDAETAG-------------------SGTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSVDFK
               SVSK        G I  +  G                    G LD Q+LQL QLQDESNP FV EV +LFF+D+E LLN L  AL QPSVDF 
Subjt:  ------ASVSK------NGGGIDAETAG-------------------SGTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSVDFK

Query:  KIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPEWCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGS
        ++D HVHQLKGSSSSI A R+KNA + FR+ CE+Q+ E C RCLQQ++Q +Y  ++ L  L+ LEQ+I+ AGGS
Subjt:  KIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPEWCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGS

A0A6J1GXF3 Procollagen-proline 4-dioxygenase1.4e-16697.67Show/hide
Query:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ
        MEKFCSFN LFFFSLSISLLL RASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLE DHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFI+
Subjt:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ

Query:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK
        KSKDPIVSGIEDKI+AWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGEC  NPEYMVGSPEFPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK

A0A6J1J084 Procollagen-proline 4-dioxygenase2.3e-172100Show/hide
Query:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ
        MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ
Subjt:  MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQ

Query:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK
        KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK
Subjt:  KSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCK

A0A7J6HHC7 Uncharacterized protein1.7e-16765.8Show/hide
Query:  PLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVS
        PLFF           + SSYAGSASSI+NPAKVKQISW PRAF+YEGFLTDLECDHLIS+AK+ELKRSAVAD+ SGES++SE+RTSSG FI K+KDPIV+
Subjt:  PLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVS

Query:  GIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR
        GIEDKIS WTFLPKENGEDIQVLRYE GQKY+ H+DYFAD+VNI RGGHR+ATVLMYL+DV +GGETVFP A E+ R + S T ED S+CAKKG+AVK R
Subjt:  GIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR

Query:  KGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSC---------
        +GDALLFFSL P A+PDT SLH GCPVIEGEKWSATKWI V+ FD+ V     C+D N SCERWA LGEC  N EYMVGSPE PGYCR+SC         
Subjt:  KGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSC---------

Query:  ----KSTEASVSKNGGGIDAETAG--------SGTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSVDFKKIDDHVHQLKGSSSS
            K  E + +   G +  +              LDSQ+LQLLQLQDESNP FV EV +LFF+DTE+LLN L  AL Q  VDFK++D HVHQLKGSSSS
Subjt:  ----KSTEASVSKNGGGIDAETAG--------SGTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSVDFKKIDDHVHQLKGSSSS

Query:  IGALRVKNACIDFRSACEQQSPEWCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGSIPV
        IGA RVKN C+ FR+ CE+Q+ + C RCLQQV+Q +Y VK+KL  L+ LEQ+I+ AGGSIP+
Subjt:  IGALRVKNACIDFRSACEQQSPEWCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGSIPV

A0A7J7GMG3 Uncharacterized protein1.9e-16362.08Show/hide
Query:  LFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG
        L  F  S+ LL+  +S  Y  S+SSI+NP+K KQ+SW PRAFVYEGFLTD EC+HLISIAK ELKRS+VADN+SG+SK+S++RTSSG FI K+KDPIVSG
Subjt:  LFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG

Query:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        IE+KI+ WTFLPKENGE IQVLRYE+GQKYD H+DYF D+VN+ARGGHR+ATVLMYLSDV +GGETVFPSAEE+    +S +++DLS+CAKKGIAVKPRK
Subjt:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKSTEASVSKN
        GDALLFFSLHP A+PD  SLHGGCPVIEGEKWSATKWI V+ FD++V    NC+D N +CERWA LGEC  NPEYMVG+PE PGYCR+SC+ +  S +  
Subjt:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKSTEASVSKN

Query:  GGGIDAETAGS----------------------------------------GTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSV
          G D     +                                        G LDSQ+ QL QLQDESNP FV EV +LFFED+E LLN L  AL Q  V
Subjt:  GGGIDAETAGS----------------------------------------GTLDSQYLQLLQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSV

Query:  DFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPEWCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGSIPV
        DFKK+D +VHQLKGSSSSIGA RVKNAC+ FR+ CE+ + E C  CLQQV+Q +  VK+KL  L+ LEQ+IL AGGS+P+
Subjt:  DFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPEWCSRCLQQVEQAFYGVKDKLSYLYALEQRILNAGGSIPV

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 63.3e-8856.16Show/hide
Query:  FFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRS-AVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG
        +F + S+SLLL     S   S S  V+P ++ Q+SW PRAF+Y+GFL+D ECDHLI +AK +L++S  VAD  SGES+ SE+RTSSG F+ K +D IV+ 
Subjt:  FFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRS-AVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG

Query:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K++AWTFLP+ENGE +Q+L YE GQKYD HFDYF D+  +  GGHR+ATVLMYLS+V +GGETVFP+    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS
        GDALLFF+LH N   D +SLHG CPVIEGEKWSAT+WI V  F +     + C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK+
Subjt:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS

F4JAU3 Prolyl 4-hydroxylase 27.9e-12272.47Show/hide
Query:  LSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGIEDKI
        ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLIS+AK  L+RSAVADN +GES+VS++RTSSG FI K KDPIVSGIEDK+
Subjt:  LSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGIEDKI

Query:  SAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALL
        S WTFLPKENGED+QVLRYE+GQKYDAHFDYF D+VNIARGGHR+ATVL+YLS+V +GGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+KG+ALL
Subjt:  SAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALL

Query:  FFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS
        FF+L  +A+PD  SLHGGCPVIEGEKWSATKWI V+ FD+I+    NC+D N SCERWA LGEC  NPEYMVG+PE PG CR+SCK+
Subjt:  FFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS

Q8L970 Probable prolyl 4-hydroxylase 71.1e-9656.09Show/hide
Query:  FCSFNPLFFFSLSI-----SLLLGRASSSYAG-------SASSI-VNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSE
        F +F+  F F+L +     +  L R+S++  G       SASS   +P +V Q+SW PR F+YEGFL+D ECDH I +AK +L++S VADN SGES  SE
Subjt:  FCSFNPLFFFSLSI-----SLLLGRASSSYAG-------SASSI-VNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSE

Query:  IRTSSGAFIQKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASE
        +RTSSG F+ K +D IVS +E K++AWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+V++GGETVFP      + +A++
Subjt:  IRTSSGAFIQKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASE

Query:  TNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSP
          +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WI V  F++       C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSP

Query:  EFPGYCRKSCKS
        +  GYCRKSCK+
Subjt:  EFPGYCRKSCKS

Q8LAN3 Probable prolyl 4-hydroxylase 47.6e-12574.66Show/hide
Query:  LFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG
        L  F    S+LL ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++S+AKA LKRSAVADN SGESK SE+RTSSG FI K KDPIVSG
Subjt:  LFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG

Query:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDKIS WTFLPKENGEDIQVLRYE+GQKYDAHFDYF D+VNI RGGHRMAT+LMYLS+V +GGETVFP AE   RR  SE  EDLSDCAK+GIAVKPRK
Subjt:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS
        GDALLFF+LHP+A+PD  SLHGGCPVIEGEKWSATKWI V+ FD+IV    NC+D N SCERWA LGEC  NPEYMVG+ E PGYCR+SCK+
Subjt:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS

Q9LN20 Probable prolyl 4-hydroxylase 31.2e-6454.55Show/hide
Query:  ISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHF
        +SW PRAFVY  FL+  EC++LIS+AK  + +S V D+ +G+SK S +RTSSG F+++ +D I+  IE +I+ +TF+P ++GE +QVL YE GQKY+ H+
Subjt:  ISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHF

Query:  DYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSA
        DYF DE N   GG RMAT+LMYLSDV+ GGETVFP+A  +    +     +LS+C KKG++VKPR GDALLF+S+ P+A  D +SLHGGCPVI G KWS+
Subjt:  DYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSA

Query:  TKWIRVNPF
        TKW+ V  +
Subjt:  TKWIRVNPF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 25.6e-12372.47Show/hide
Query:  LSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGIEDKI
        ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLIS+AK  L+RSAVADN +GES+VS++RTSSG FI K KDPIVSGIEDK+
Subjt:  LSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGIEDKI

Query:  SAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALL
        S WTFLPKENGED+QVLRYE+GQKYDAHFDYF D+VNIARGGHR+ATVL+YLS+V +GGETVFP A+E  RR  SE  +DLSDCAKKGIAVKP+KG+ALL
Subjt:  SAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALL

Query:  FFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS
        FF+L  +A+PD  SLHGGCPVIEGEKWSATKWI V+ FD+I+    NC+D N SCERWA LGEC  NPEYMVG+PE PG CR+SCK+
Subjt:  FFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase8.1e-9856.09Show/hide
Query:  FCSFNPLFFFSLSI-----SLLLGRASSSYAG-------SASSI-VNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSE
        F +F+  F F+L +     +  L R+S++  G       SASS   +P +V Q+SW PR F+YEGFL+D ECDH I +AK +L++S VADN SGES  SE
Subjt:  FCSFNPLFFFSLSI-----SLLLGRASSSYAG-------SASSI-VNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSE

Query:  IRTSSGAFIQKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASE
        +RTSSG F+ K +D IVS +E K++AWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+V++GGETVFP      + +A++
Subjt:  IRTSSGAFIQKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASE

Query:  TNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSP
          +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WI V  F++       C DEN SCE+WA+ GEC  NP YMVGS 
Subjt:  TNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSP

Query:  EFPGYCRKSCKS
        +  GYCRKSCK+
Subjt:  EFPGYCRKSCKS

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase3.3e-9153.44Show/hide
Query:  FCSFNPLFFFSLSI-----SLLLGRASSSYAG-------SASSI-VNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGES----
        F +F+  F F+L +     +  L R+S++  G       SASS   +P +V Q+SW PR F+YEGFL+D ECDH I +AK +L++S VADN SGES    
Subjt:  FCSFNPLFFFSLSI-----SLLLGRASSSYAG-------SASSI-VNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGES----

Query:  -KVSEIRTSSGAFIQKSK---DPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEE
          VS +R SS           D IVS +E K++AWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLS+V++GGETVFP    
Subjt:  -KVSEIRTSSGAFIQKSK---DPIVSGIEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEE

Query:  SQRRQASETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDN
          + +A++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D++SLHG CPV+EGEKWSAT+WI V  F++       C DEN SCE+WA+ GEC  N
Subjt:  SQRRQASETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDN

Query:  PEYMVGSPEFPGYCRKSCKS
        P YMVGS +  GYCRKSCK+
Subjt:  PEYMVGSPEFPGYCRKSCKS

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.4e-8956.16Show/hide
Query:  FFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRS-AVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG
        +F + S+SLLL     S   S S  V+P ++ Q+SW PRAF+Y+GFL+D ECDHLI +AK +L++S  VAD  SGES+ SE+RTSSG F+ K +D IV+ 
Subjt:  FFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRS-AVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG

Query:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K++AWTFLP+ENGE +Q+L YE GQKYD HFDYF D+  +  GGHR+ATVLMYLS+V +GGETVFP+    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS
        GDALLFF+LH N   D +SLHG CPVIEGEKWSAT+WI V  F +     + C D++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK+
Subjt:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.4e-12674.66Show/hide
Query:  LFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG
        L  F    S+LL ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++S+AKA LKRSAVADN SGESK SE+RTSSG FI K KDPIVSG
Subjt:  LFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSG

Query:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDKIS WTFLPKENGEDIQVLRYE+GQKYDAHFDYF D+VNI RGGHRMAT+LMYLS+V +GGETVFP AE   RR  SE  EDLSDCAK+GIAVKPRK
Subjt:  IEDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS
        GDALLFF+LHP+A+PD  SLHGGCPVIEGEKWSATKWI V+ FD+IV    NC+D N SCERWA LGEC  NPEYMVG+ E PGYCR+SCK+
Subjt:  GDALLFFSLHPNAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAATTTTGCAGTTTCAATCCGCTGTTTTTCTTCTCATTATCGATTTCGTTGCTTCTCGGGCGAGCTTCAAGCTCCTATGCAGGTTCCGCTAGCTCAATTGTCAA
TCCTGCTAAAGTAAAGCAGATTTCATGGAATCCTCGGGCGTTTGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCGATTGCTAAAGCTGAATTGA
AGAGATCTGCTGTTGCCGATAATTTGTCAGGAGAGAGCAAGGTCAGCGAGATCCGAACTAGTTCTGGGGCGTTTATTCAAAAATCCAAGGATCCTATTGTTTCTGGTATA
GAAGACAAAATTTCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTATGATGCACACTTCGATTACTTTGC
CGACGAGGTTAATATTGCCCGTGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCCGACGTAAAAAGAGGCGGTGAAACTGTGTTTCCTTCTGCAGAGGAATCTC
AAAGACGTCAGGCTTCTGAAACAAACGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTGAAACCGCGGAAAGGCGACGCTCTTCTGTTCTTCAGTCTTCATCCA
AATGCTGTTCCAGACACAAGTAGTCTGCATGGAGGGTGCCCTGTGATTGAAGGCGAGAAATGGTCAGCAACTAAGTGGATTCGTGTCAATCCTTTCGACCAGATTGTGGG
AGACTACATGAATTGCAGTGATGAGAATGCAAGTTGTGAGAGATGGGCTGAGCTCGGCGAGTGCAATGATAACCCAGAGTATATGGTGGGATCTCCTGAGTTTCCTGGCT
ACTGCAGAAAAAGTTGCAAGTCAACAGAAGCTTCTGTGAGTAAAAATGGAGGTGGGATTGATGCAGAGACAGCGGGTTCAGGCACTCTGGATAGTCAGTATTTGCAGCTT
TTGCAACTGCAAGATGAGAGTAATCCAACTTTCGTTTCTGAAGTGGCGACTCTTTTCTTTGAAGATACCGAGGAGCTTCTCAATAAACTGAGAGTCGCTCTATTACAGCC
ATCTGTGGACTTCAAAAAGATTGATGATCATGTACACCAGCTGAAGGGCAGCAGCTCCAGCATAGGTGCACTTAGAGTGAAGAATGCCTGCATTGACTTCCGGAGCGCCT
GCGAGCAACAAAGTCCAGAATGGTGTTCAAGATGTCTCCAACAAGTAGAGCAAGCATTCTATGGTGTAAAAGATAAGCTCAGTTATCTATATGCTCTGGAGCAACGGATT
TTGAATGCTGGTGGATCCATCCCAGTGGACTTGGGTTCCTAA
mRNA sequenceShow/hide mRNA sequence
TGCTTCGTTCTCTTTCTCTACTGATCCGATTCAGTTCATGGAGAAATTTTGCAGTTTCAATCCGCTGTTTTTCTTCTCATTATCGATTTCGTTGCTTCTCGGGCGAGCTT
CAAGCTCCTATGCAGGTTCCGCTAGCTCAATTGTCAATCCTGCTAAAGTAAAGCAGATTTCATGGAATCCTCGGGCGTTTGTGTATGAAGGTTTTCTCACGGACTTAGAA
TGCGATCATCTCATCTCGATTGCTAAAGCTGAATTGAAGAGATCTGCTGTTGCCGATAATTTGTCAGGAGAGAGCAAGGTCAGCGAGATCCGAACTAGTTCTGGGGCGTT
TATTCAAAAATCCAAGGATCCTATTGTTTCTGGTATAGAAGACAAAATTTCAGCATGGACATTTCTGCCAAAAGAAAATGGAGAAGACATTCAAGTGTTGAGATATGAAT
ATGGGCAGAAGTATGATGCACACTTCGATTACTTTGCCGACGAGGTTAATATTGCCCGTGGTGGACATCGAATGGCAACCGTTCTCATGTATCTTTCCGACGTAAAAAGA
GGCGGTGAAACTGTGTTTCCTTCTGCAGAGGAATCTCAAAGACGTCAGGCTTCTGAAACAAACGAAGATCTCTCAGACTGTGCAAAGAAAGGGATAGCAGTGAAACCGCG
GAAAGGCGACGCTCTTCTGTTCTTCAGTCTTCATCCAAATGCTGTTCCAGACACAAGTAGTCTGCATGGAGGGTGCCCTGTGATTGAAGGCGAGAAATGGTCAGCAACTA
AGTGGATTCGTGTCAATCCTTTCGACCAGATTGTGGGAGACTACATGAATTGCAGTGATGAGAATGCAAGTTGTGAGAGATGGGCTGAGCTCGGCGAGTGCAATGATAAC
CCAGAGTATATGGTGGGATCTCCTGAGTTTCCTGGCTACTGCAGAAAAAGTTGCAAGTCAACAGAAGCTTCTGTGAGTAAAAATGGAGGTGGGATTGATGCAGAGACAGC
GGGTTCAGGCACTCTGGATAGTCAGTATTTGCAGCTTTTGCAACTGCAAGATGAGAGTAATCCAACTTTCGTTTCTGAAGTGGCGACTCTTTTCTTTGAAGATACCGAGG
AGCTTCTCAATAAACTGAGAGTCGCTCTATTACAGCCATCTGTGGACTTCAAAAAGATTGATGATCATGTACACCAGCTGAAGGGCAGCAGCTCCAGCATAGGTGCACTT
AGAGTGAAGAATGCCTGCATTGACTTCCGGAGCGCCTGCGAGCAACAAAGTCCAGAATGGTGTTCAAGATGTCTCCAACAAGTAGAGCAAGCATTCTATGGTGTAAAAGA
TAAGCTCAGTTATCTATATGCTCTGGAGCAACGGATTTTGAATGCTGGTGGATCCATCCCAGTGGACTTGGGTTCCTAAACTGACAAAATCCATGAAAACCAAGAGTTTT
CCATTCGTTCCTGTCGAGTCTCCTTTCTCTGTTCTCTAGGCTCTTAAAATGGCTGTTACTTGTGCTTGTACAAATTCAGAACCTTTGCCCTTTTCACTCAGAACATCAAT
GTTTTCTACAGCTGATTTCGACTTTTGATTTTTCAAGTCTGTTTGAGCTGAACTGTTTGTATTGCATGGAGTCCTGAAAGATTTTCTGGGTGTTTTCATAGAGGCAAATA
CTCTCTTTCATCTTTGTTCCCCTAGTATTCTGCTAATCAAAACATACCTGGTCATTTGTGGGTTTCATTTGAATTTTCTTGGATTTCCACTCTTATTTGAAACAAACAAA
ATCAACAAATGGACAAAGTAGGCCGCCAATGGACTAAGAACAGCATCAAGCTTAGATGATGACATAAATTGATAGGGGGATAATCTAGATCTTATTGTTTGTTGCTTTCT
ATGAACCACGAAATGGAAACCTTTTTTTATGTTTATACAAATGCCCCAAAAGTTTTCATCTCGGTGAATATCAGCAAACACATGATCCATGGCAATAATTACAGGTGTGG
TTTTGCAGTTTATTCCACCTTCTTTTCTGGTCATTAAAGCTTTTGACCACTGCAGATTGTATCTTATCACTTAGCCGGGTTGCAAATAGGTGAAAGTAGGCTGTTTGGTT
CAAATTCTGCAGTACTCGGATTAATTTCCCGCCCTTCTCATAGATGACAGTTTGGCCATCTTGAGATTCAACACCAGCTGCGAAGATTTTTACAGGCCTGAATTGAAAAA
CTGGCTCCACACGGCCTTCTTCAGCCAGGACTACTGCACCAAGAAGTTCCCCCAAGAAAGTATCCTACAGCAGAACAAGCTTTTATTAC
Protein sequenceShow/hide protein sequence
MEKFCSFNPLFFFSLSISLLLGRASSSYAGSASSIVNPAKVKQISWNPRAFVYEGFLTDLECDHLISIAKAELKRSAVADNLSGESKVSEIRTSSGAFIQKSKDPIVSGI
EDKISAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFADEVNIARGGHRMATVLMYLSDVKRGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHP
NAVPDTSSLHGGCPVIEGEKWSATKWIRVNPFDQIVGDYMNCSDENASCERWAELGECNDNPEYMVGSPEFPGYCRKSCKSTEASVSKNGGGIDAETAGSGTLDSQYLQL
LQLQDESNPTFVSEVATLFFEDTEELLNKLRVALLQPSVDFKKIDDHVHQLKGSSSSIGALRVKNACIDFRSACEQQSPEWCSRCLQQVEQAFYGVKDKLSYLYALEQRI
LNAGGSIPVDLGS