; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G000210 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G000210
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationCmo_Chr04:111422..115485
RNA-Seq ExpressionCmoCh04G000210
SyntenyCmoCh04G000210
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0032963 - collagen metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0019797 - procollagen-proline 3-dioxygenase activity (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR039575 - Prolyl 3-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599868.1 Prolyl 3-hydroxylase 1, partial [Cucurbita argyrosperma subsp. sororia]4.1e-19492.52Show/hide
Query:  MPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFH
        MPFVSIRERLKEKAEEFFGCEYELFVEFTGLI                         SWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFH
Subjt:  MPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFH

Query:  FQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDIC
        FQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL+SLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDIC
Subjt:  FQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDIC

Query:  WARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALA
        WARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVV FLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALA
Subjt:  WARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALA

Query:  ESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
        ESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
Subjt:  ESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

KAG7030552.1 hypothetical protein SDJN02_04589 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-20684.21Show/hide
Query:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT
        MKMGDEAEINQRRRL LENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT

Query:  AMLL--------------EYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSL
         M L              +  +   ++  PS FSWTRGARIGWHSDDNRPYLKQREFT                                DCVMYTADSL
Subjt:  AMLL--------------EYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSL

Query:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
        NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL+SLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD
Subjt:  NVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPD

Query:  LFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
        LFSQDVQLVRGNKIFSQKFDSILHALQVV FLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA
Subjt:  LFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLA

Query:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
        AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS
Subjt:  AVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]5.8e-23394.09Show/hide
Query:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT
        MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT

Query:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
                           SWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
Subjt:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL

Query:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
        TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
Subjt:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI

Query:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR
        FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR
Subjt:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR

Query:  RELLRSFAHWRTSQSIYSVPYGS
        RELLRSFAHWRTSQSIYSVPYGS
Subjt:  RELLRSFAHWRTSQSIYSVPYGS

XP_022989994.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita maxima]5.6e-22892.43Show/hide
Query:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT
        MKMGDEAEINQR RL LENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSA LIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT

Query:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
                           SWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
Subjt:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL

Query:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
        TLTLWFTRDSSHDEDAKL+SLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
Subjt:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI

Query:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR
        FSQKFDSILHALQVVQFLYWKGKELDST+SKEDSSYAEGLSPKRNVGVD+FKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVA AWEDYASNLR
Subjt:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR

Query:  RELLRSFAHWRTSQSIYSVPYGS
        RELLRSFAHWRTSQSIYSVPYGS
Subjt:  RELLRSFAHWRTSQSIYSVPYGS

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]2.1e-23093.14Show/hide
Query:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT
        MKMGDEAEINQRRRL LENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT

Query:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
                           SWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
Subjt:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL

Query:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
        TLTLWFTRDSSHDEDAKL+SLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
Subjt:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI

Query:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR
        FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKD+ALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR
Subjt:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR

Query:  RELLRSFAHWRTSQSIYSVPYGS
        RELLRSF HWRTSQSIYSVPYGS
Subjt:  RELLRSFAHWRTSQSIYSVPYGS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase1.4e-18778.38Show/hide
Query:  MGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAM
        M D AE  QRRRL LENFL+ EECRELEFIHKSC TVGYRP VFSTTLLHLV +NSAHLI+PFV IRE+LKEKAEEFFGC YELFVEFTGLIS  S    
Subjt:  MGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAM

Query:  LLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTL
              + P     S   WTRGA IGWHSDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP+TISP  GDCVMYTAD+ NVHSVDE+T+GERLTL
Subjt:  LLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTL

Query:  TLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFS
        TLWFTRDSSHDEDAKL+SLLSQS LHDR PDSCLPQPPSCNMYWFSP+DDPNFKFGFDICWARL ALGY +YFP DH  SEYPDLF QDVQLV G+KIF 
Subjt:  TLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFS

Query:  QKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRE
        QKF++ILH LQVVQFL WKGKELDSTN  EDSSYAE LSPKRNVGV YFKSEFSK+D LAESVF  A+SD KE Q  LGW KL A AAAWE YAS LRRE
Subjt:  QKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRE

Query:  LLRSFAHWRTSQSIYSVPYGS
        LL SF+HWR  QSIYSV   S
Subjt:  LLRSFAHWRTSQSIYSVPYGS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase7.5e-18677.73Show/hide
Query:  MGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAM
        M D AE  QRRRL LENFL+ EECRELEFIHKSCCTVGYRP V STTLLHLV +NSAHLI+PFV IRE+LKEKAEEFFGC YELFVEFTGLI        
Subjt:  MGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAM

Query:  LLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTL
                         SWTRGA IGWHSDDNRPYLKQREF+AVCYLNSYGV+F GGLFHFQDGEP+TISP  GDCVMY ADS NVHSVDE+T+GERLTL
Subjt:  LLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTL

Query:  TLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFS
        TLWFTRDSSHDEDAKL+SLLSQS LHDR P+SCLPQPPSCNMYWFSP+DDPNFKFGFDICWARLHALGY IYFP DH  SEYPDLFSQDVQLV G+KIF 
Subjt:  TLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFS

Query:  QKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKL-AAVAAAWEDYASNLRR
        QKF++ILH LQVVQFL WKGKELD+TN  EDS YAE LSPKRNVGV YFKSEFSK+D LAESVF  A+S  KE QH LGW KL  A AAAWEDYAS LRR
Subjt:  QKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKL-AAVAAAWEDYASNLRR

Query:  ELLRSFAHWRTSQSIYSVPYGS
        ELL SF+HWR  QSIYSV   S
Subjt:  ELLRSFAHWRTSQSIYSVPYGS

A0A6J1DQY8 Procollagen-proline 3-dioxygenase2.6e-18677.07Show/hide
Query:  MGDEAEINQ--RRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT
        MGDE E  Q  RRRL LENFLT EECRELEFIHKSCCTVGYRP VFSTTLLHLV +NSAHLIMPFV IRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MGDEAEINQ--RRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT

Query:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
                           SWTRGA IGWHSDDNRPYLKQREFTAVCYLNSYGVDF GGLFHFQDGEPK+ISP CGDCVMYTADS NVHSVDE+T+GERL
Subjt:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL

Query:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
        TLTLWFTRD+SHDEDAKL+SLLSQSHLHDR P+SC+P PPSCNMYWFSP++DPNFKFG D+CWARLHALGY IYFP+D+ LS+YP LFS  VQLVR  KI
Subjt:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI

Query:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR
        F Q+F +ILHALQVVQF+ WKGKELDSTN K +SSYA  LSPK N GV YFKSEFSK+  LA+SVF  ASSD KEKQ  LGWAKLA   AAWEDYASNLR
Subjt:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR

Query:  RELLRSFAHWRTSQSIYSVPYGS
         ELLRS  HWRT+QS+Y V  GS
Subjt:  RELLRSFAHWRTSQSIYSVPYGS

A0A6J1FRP8 Procollagen-proline 3-dioxygenase2.8e-23394.09Show/hide
Query:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT
        MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT

Query:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
                           SWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
Subjt:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL

Query:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
        TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
Subjt:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI

Query:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR
        FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR
Subjt:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR

Query:  RELLRSFAHWRTSQSIYSVPYGS
        RELLRSFAHWRTSQSIYSVPYGS
Subjt:  RELLRSFAHWRTSQSIYSVPYGS

A0A6J1JQV6 Procollagen-proline 3-dioxygenase2.7e-22892.43Show/hide
Query:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT
        MKMGDEAEINQR RL LENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSA LIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLI      
Subjt:  MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLT

Query:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
                           SWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL
Subjt:  AMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERL

Query:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
        TLTLWFTRDSSHDEDAKL+SLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI
Subjt:  TLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKI

Query:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR
        FSQKFDSILHALQVVQFLYWKGKELDST+SKEDSSYAEGLSPKRNVGVD+FKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVA AWEDYASNLR
Subjt:  FSQKFDSILHALQVVQFLYWKGKELDSTNSKEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLR

Query:  RELLRSFAHWRTSQSIYSVPYGS
        RELLRSFAHWRTSQSIYSVPYGS
Subjt:  RELLRSFAHWRTSQSIYSVPYGS

SwissProt top hitse value%identityAlignment
A5WFM3 PKHD-type hydroxylase PsycPRwf_15232.8e-0431.4Show/hide
Query:  PSQFS-WTRGARIGWHSDD------NRPYLKQREFTAVCYLNSYGVDFEGGLFHFQD--GEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWF
        P  FS +  G   G H D+      +   L + + +   +LN+   D+EGG     D  GE  +I    GD V+Y + SL  H V+ VTSG+RL +  W 
Subjt:  PSQFS-WTRGARIGWHSDD------NRPYLKQREFTAVCYLNSYGVDFEGGLFHFQD--GEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWF

Query:  TRDSSHDEDAKLVSLLSQSHL
              DE  +++  L  SH+
Subjt:  TRDSSHDEDAKLVSLLSQSHL

Q4KLM6 Prolyl 3-hydroxylase 22.6e-1027.39Show/hide
Query:  EINQRRRLPLENFLTLEECRELEFIHKSCCTV--GYR----------PYVFSTTLLHL-------VVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFV
        ++N  +R+ L+N L+ E+CREL  +      V  GYR           +  +T L  L       V   SA L   F  I E+ ++  E +F     L+ 
Subjt:  EINQRRRLPLENFLTLEECRELEFIHKSCCTV--GYR----------PYVFSTTLLHL-------VVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFV

Query:  EFTGLISSGSLTAMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKT----ISPLCGDCVMYTAD
         +T ++   +L+        +S   +  +       A   W      P    R+++A+ Y+N    DFEGG F F + + KT    I P CG  + +++ 
Subjt:  EFTGLISSGSLTAMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKT----ISPLCGDCVMYTAD

Query:  SLNVHSVDEVTSGERLTLTLWFTRDSSHDE
          N H V  VT G+R  + LWFT D  + E
Subjt:  SLNVHSVDEVTSGERLTLTLWFTRDSSHDE

Q5XGE0 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 37.1e-0834.12Show/hide
Query:  WHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQD-GEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSH
        WH   ++      ++T++ YL+ Y  DF GG F F D G  +T+ P  G    +T+ S N+H V++V+ G R  +T+ FT +  H
Subjt:  WHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQD-GEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSH

Q6JHU7 Prolyl 3-hydroxylase 28.9e-1126.61Show/hide
Query:  EINQRRRLPLENFLTLEECRELEFIHKSCCTV--GYR----------PYVFSTTLLHL-------VVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFV
        ++N  +R+ L+N ++ E+CREL  +         GYR           +  +T L  L       V   SA L   F  I E+ +   E +F     L+ 
Subjt:  EINQRRRLPLENFLTLEECRELEFIHKSCCTV--GYR----------PYVFSTTLLHL-------VVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFV

Query:  EFTGLISSGSLTAMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKT----ISPLCGDCVMYTAD
         +T L+   +L+        +S   +  +       A   W      P    R+++A+ Y+N+   DFEGG F F + + KT    I P CG  V +++ 
Subjt:  EFTGLISSGSLTAMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKT----ISPLCGDCVMYTAD

Query:  SLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL-----VSLLSQSHL
          N H V  VT G+R  + LWFT D  + E  ++     +++L Q H+
Subjt:  SLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKL-----VSLLSQSHL

Q8IVL5 Prolyl 3-hydroxylase 23.4e-1027.39Show/hide
Query:  EINQRRRLPLENFLTLEECRELEFIHKSCCTV--GYR----------PYVFSTTLLHL-------VVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFV
        ++N  +R+ L+N L+ E+CREL  +      V  GYR           +  +T L  L       V   SA L   F  I E+ +   E +F     L+ 
Subjt:  EINQRRRLPLENFLTLEECRELEFIHKSCCTV--GYR----------PYVFSTTLLHL-------VVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFV

Query:  EFTGLISSGSLTAMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKT----ISPLCGDCVMYTAD
         +T ++   +L+        +S   +  +       A   W      P    R+++A+ Y+N    DFEGG F F + + KT    I P CG  + +++ 
Subjt:  EFTGLISSGSLTAMLLEYFIISPVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKT----ISPLCGDCVMYTAD

Query:  SLNVHSVDEVTSGERLTLTLWFTRDSSHDE
          N H V  VT G+R  + LWFT D  + E
Subjt:  SLNVHSVDEVTSGERLTLTLWFTRDSSHDE

Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.9e-11352.2Show/hide
Query:  RLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAMLLEYFIISPVW
        RL L NFL+  EC+ELE IHKS  T+GYRP VFSTTL HL+ +NS HLI+PFVSIRERLKEK EE FGCEYELF+EFTGLI                   
Subjt:  RLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAMLLEYFIISPVW

Query:  YGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHD
              SW +GA IGWHSDDNR YLKQR+F AVCYLNSY  DF GGLF FQ GEP T++P  GD +MYTAD  N+HSVDEVT GERLTL LWF+RDSSHD
Subjt:  YGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHD

Query:  EDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILH
        ED+KL+S LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+ ++  Q  DHS      L    +QL +G K+ ++KF +ILH
Subjt:  EDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILH

Query:  ALQVVQFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAH
        ALQVVQF +WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F Y+ S  ++++  L    +A    +WE+Y+  L +ELL S   
Subjt:  ALQVVQFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAH

Query:  WRTSQSIYSV
        W+T Q+I+ V
Subjt:  WRTSQSIYSV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.3e-8844.74Show/hide
Query:  RLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAMLLEYFIISPVW
        RL L NFL+  EC+ELE IHKS  T+GYRP VFSTTL HL+ +NS HLI+PFVSIRERLKEK EE FGCEYELF+EFTGLI                   
Subjt:  RLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAMLLEYFIISPVW

Query:  YGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHD
              SW +GA IGWHSDDNR YLKQR+F +                    GEP T++P  GD +MYTAD  N+HSVDEVT GERLTL LWF+RDSSHD
Subjt:  YGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHD

Query:  EDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHA
        ED+KL+S LSQ                                  FD+C ARLH LG+ ++  Q  DHS      L    +QL +G K+ ++KF +ILHA
Subjt:  EDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHA

Query:  LQVVQFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHW
        LQVVQF +WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F Y+ S  ++++  L    +A    +WE+Y+  L +ELL S   W
Subjt:  LQVVQFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHW

Query:  RTSQSIYSV
        +T Q+I+ V
Subjt:  RTSQSIYSV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.1e-9948.29Show/hide
Query:  RLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAMLLEYFIISPVW
        RL L NFL+  EC+ELE IHKS  T+GYRP VFSTTL HL+ +NS HLI+PFVSIRERLKEK EE FGCEYELF+EFTGLI                   
Subjt:  RLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAMLLEYFIISPVW

Query:  YGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHD
              SW +GA IGWHSDDNR YLKQR+F +                    GEP T++P  GD +MYTAD  N+HSVDEVT GERLTL LWF+RDSSHD
Subjt:  YGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHD

Query:  EDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILH
        ED+KL+S LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+ ++  Q  DHS      L    +QL +G K+ ++KF +ILH
Subjt:  EDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ--DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILH

Query:  ALQVVQFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAH
        ALQVVQF +WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F Y+ S  ++++  L    +A    +WE+Y+  L +ELL S   
Subjt:  ALQVVQFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAH

Query:  WRTSQSIYSV
        W+T Q+I+ V
Subjt:  WRTSQSIYSV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-5145.49Show/hide
Query:  MYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ-
        MYTAD  N+HSVDEVT GERLTL LWF+RDSSHDED+KL+S LSQ   H    + CLP P S NMYWF P +D  N   GFD+C ARLH LG+ ++  Q 
Subjt:  MYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLVSLLSQSHLHDRLPDSCLPQPPSCNMYWFSP-KDDPNFKFGFDICWARLHALGYGIYFPQ-

Query:  -DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKE
         DHS      L    +QL +G K+ ++KF +ILHALQVVQF +WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F Y+ S  ++
Subjt:  -DHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTNSKEDS-SYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKE

Query:  KQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSV
        ++  L    +A    +WE+Y+  L +ELL S   W+T Q+I+ V
Subjt:  KQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGGAGACGAAGCGGAGATCAATCAGCGGCGGCGTCTCCCTCTGGAAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAGAGCTGCTG
TACGGTGGGTTATAGACCATACGTCTTCTCCACCACTCTTTTGCATCTTGTTGTCTCTAATTCTGCTCATTTGATCATGCCTTTTGTTTCGATTAGAGAGCGGTTGAAGG
AGAAAGCGGAGGAGTTCTTTGGCTGTGAATATGAGCTCTTCGTCGAGTTCACTGGCTTGATAAGTTCAGGGAGTCTGACGGCCATGTTACTGGAGTACTTTATAATCAGC
CCAGTCTGGTATGGGCCTTCGCAGTTCAGCTGGACCAGGGGAGCAAGGATTGGATGGCATAGTGACGACAACCGGCCCTATCTAAAACAACGTGAATTTACAGCAGTGTG
TTACTTGAATAGTTATGGAGTAGATTTCGAAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAAACTATCTCGCCTCTTTGTGGAGATTGTGTGATGTACACGGCTG
ACAGCCTCAATGTTCATTCTGTTGATGAGGTAACCAGTGGAGAAAGACTTACGCTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAACTTGTTTCC
CTTCTTTCACAAAGCCATTTACACGATCGTCTTCCTGACTCATGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAAAAGACGATCCAAATTTCAAGTT
CGGTTTTGATATATGTTGGGCAAGACTGCATGCGCTAGGATACGGCATTTATTTTCCTCAGGACCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGATGTACAAT
TAGTACGTGGTAATAAGATCTTCTCTCAGAAGTTTGATAGCATTTTGCATGCACTTCAGGTAGTGCAATTTCTATATTGGAAAGGCAAAGAATTGGATTCTACTAACTCC
AAGGAAGATTCAAGCTATGCAGAAGGTTTATCTCCAAAGAGAAATGTGGGAGTCGATTACTTTAAATCCGAGTTCTCAAAGGACGATGCACTGGCGGAGTCAGTCTTCTT
GTATGCTAGTTCTGATGTCAAGGAGAAGCAACACCGGTTGGGGTGGGCTAAGCTTGCTGCAGTAGCAGCAGCTTGGGAAGATTATGCTTCCAATTTAAGGAGAGAACTCC
TTCGGAGCTTCGCCCATTGGAGAACCAGTCAATCCATATACAGTGTTCCATATGGTAGTTGA
mRNA sequenceShow/hide mRNA sequence
TATATAAACCGTGTTTAATATCTATTTTGTTCTCTATTTAAATTTCGTACCGGCTTCATCGATGGCGCGTGTTTATGAGAGAGGGATACATGGAGAAGCAGCAGTCGCCT
CGCCTTAAAGGATTTCATTCGATTATTGTTGCAAATTGGGATATCGCTTTCGGTATATTTTTATGGTCAAGGGCATTCCATCCGATTACTGCCATTGAAACGAGAGACAG
AGAATTGGATGAAAATGGGAGACGAAGCGGAGATCAATCAGCGGCGGCGTCTCCCTCTGGAAAATTTCTTAACCCTCGAAGAATGCAGGGAACTGGAGTTCATCCATAAG
AGCTGCTGTACGGTGGGTTATAGACCATACGTCTTCTCCACCACTCTTTTGCATCTTGTTGTCTCTAATTCTGCTCATTTGATCATGCCTTTTGTTTCGATTAGAGAGCG
GTTGAAGGAGAAAGCGGAGGAGTTCTTTGGCTGTGAATATGAGCTCTTCGTCGAGTTCACTGGCTTGATAAGTTCAGGGAGTCTGACGGCCATGTTACTGGAGTACTTTA
TAATCAGCCCAGTCTGGTATGGGCCTTCGCAGTTCAGCTGGACCAGGGGAGCAAGGATTGGATGGCATAGTGACGACAACCGGCCCTATCTAAAACAACGTGAATTTACA
GCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTCGAAGGTGGGCTGTTTCACTTTCAGGACGGGGAACCAAAAACTATCTCGCCTCTTTGTGGAGATTGTGTGATGTA
CACGGCTGACAGCCTCAATGTTCATTCTGTTGATGAGGTAACCAGTGGAGAAAGACTTACGCTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCAAAAC
TTGTTTCCCTTCTTTCACAAAGCCATTTACACGATCGTCTTCCTGACTCATGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAAAAGACGATCCAAAT
TTCAAGTTCGGTTTTGATATATGTTGGGCAAGACTGCATGCGCTAGGATACGGCATTTATTTTCCTCAGGACCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGA
TGTACAATTAGTACGTGGTAATAAGATCTTCTCTCAGAAGTTTGATAGCATTTTGCATGCACTTCAGGTAGTGCAATTTCTATATTGGAAAGGCAAAGAATTGGATTCTA
CTAACTCCAAGGAAGATTCAAGCTATGCAGAAGGTTTATCTCCAAAGAGAAATGTGGGAGTCGATTACTTTAAATCCGAGTTCTCAAAGGACGATGCACTGGCGGAGTCA
GTCTTCTTGTATGCTAGTTCTGATGTCAAGGAGAAGCAACACCGGTTGGGGTGGGCTAAGCTTGCTGCAGTAGCAGCAGCTTGGGAAGATTATGCTTCCAATTTAAGGAG
AGAACTCCTTCGGAGCTTCGCCCATTGGAGAACCAGTCAATCCATATACAGTGTTCCATATGGTAGTTGACTCTTCCACTTCTGGGATAGTGGCAGTCTCAAATGCTAAA
AGTAGCTGAGCTTCAAGGTTAGTTATGGGCCTTTGTACAGCTTAACTAGATTTTACTTGTCCCAAGTGTGCAATCCTTTGTGTTATATTCTTACCCTTTTCAACGATTTA
GCCATTTTTCACAGTTTTTAAAGGG
Protein sequenceShow/hide protein sequence
MKMGDEAEINQRRRLPLENFLTLEECRELEFIHKSCCTVGYRPYVFSTTLLHLVVSNSAHLIMPFVSIRERLKEKAEEFFGCEYELFVEFTGLISSGSLTAMLLEYFIIS
PVWYGPSQFSWTRGARIGWHSDDNRPYLKQREFTAVCYLNSYGVDFEGGLFHFQDGEPKTISPLCGDCVMYTADSLNVHSVDEVTSGERLTLTLWFTRDSSHDEDAKLVS
LLSQSHLHDRLPDSCLPQPPSCNMYWFSPKDDPNFKFGFDICWARLHALGYGIYFPQDHSLSEYPDLFSQDVQLVRGNKIFSQKFDSILHALQVVQFLYWKGKELDSTNS
KEDSSYAEGLSPKRNVGVDYFKSEFSKDDALAESVFLYASSDVKEKQHRLGWAKLAAVAAAWEDYASNLRRELLRSFAHWRTSQSIYSVPYGS