; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007395 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007395
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProcollagen-proline 3-dioxygenase
Genome locationscaffold7:19712593..19716113
RNA-Seq ExpressionSpg007395
SyntenySpg007395
Gene Ontology termsGO:0032963 - collagen metabolic process (biological process)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR039575 - Prolyl 3-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599868.1 Prolyl 3-hydroxylase 1, partial [Cucurbita argyrosperma subsp. sororia]2.0e-14573.3Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI WTRGA IGWHSDDNRPYLKQREF+                                      AVCYLNSYGVDF GGLFHFQDGEPKTISP CG   
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                            DCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDR PDSCLPQPPSCNMYWFSP+DDPNFK
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS
        FGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVV FL WKGKELDSTN K DSSYAE LSPKRNVGV YFKSEFS
Subjt:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS

Query:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS
        KDDALAESVF  ASSD KEKQH  GWAKLAA AAAWEDYAS LRRELLRS +HWRTSQSIYSV   S
Subjt:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS

XP_004140463.1 prolyl 3-hydroxylase 1 [Cucumis sativus]9.9e-14571.93Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI WTRGASIGWHSDDNRPYLKQREFS                                      AVCYLNSYGV+FGGGLFHFQDGEP+TISPF     
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                          Y DCVMYTAD+ NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS
        FGFD+CWARL ALGY++YFP DH  SEYPDLF Q+VQLV   KIFFQ+F++ILH LQVVQFLCWKGKELDSTN   DSSYAEYLSPKRNVGVSYFKSEFS
Subjt:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS

Query:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS
        K+D LAESVFS A+SDGKE Q W GW KL AAAAAWE YAS LRRELL S SHWR  QSIYSVSL S
Subjt:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS

XP_022942569.1 prolyl 3-hydroxylase 1 isoform X1 [Cucurbita moschata]6.9e-14673.3Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI WTRGA IGWHSDDNRPYLKQREF+                                      AVCYLNSYGVDF GGLFHFQDGEPKTISP CG   
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                            DCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKL+SLLSQS LHDR PDSCLPQPPSCNMYWFSP+DDPNFK
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS
        FGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVVQFL WKGKELDSTN K DSSYAE LSPKRNVGV YFKSEFS
Subjt:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS

Query:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS
        KDDALAESVF  ASSD KEKQH  GWAKLAA AAAWEDYAS LRRELLRS +HWRTSQSIYSV   S
Subjt:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS

XP_023552239.1 prolyl 3-hydroxylase 1 [Cucurbita pepo subsp. pepo]2.6e-14573.3Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI WTRGA IGWHSDDNRPYLKQREF+                                      AVCYLNSYGVDF GGLFHFQDGEPKTISP CG   
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                            DCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDR PDSCLPQPPSCNMYWFSP+DDPNFK
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS
        FGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVVQFL WKGKELDSTN K DSSYAE LSPKRNVGV YFKSEFS
Subjt:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS

Query:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS
        KD+ALAESVF  ASSD KEKQH  GWAKLAA AAAWEDYAS LRRELLRS  HWRTSQSIYSV   S
Subjt:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS

XP_038893062.1 uncharacterized protein LOC120081945 [Benincasa hispida]1.8e-14672.75Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI WTRGASIGWHSDDNRPYLKQR+FS                                      AVCYLNSYGV+FGGGLFHFQDGEP+TISPFCG   
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                            DCVMYTADS NVHSVDEITNGERLTLTLW TRDSSHDED+KLLSLLSQS LHDR PDS LPQPPSCNMYWFS EDDPNFK
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS
         GFD+CWARLHALGY+IYF  DHS SEYPDLFS++VQLV+  K+FFQEF++ILH LQVVQFLCWKGKELDSTN K DSSYAEYLSPKRNVGVSYFKSEFS
Subjt:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS

Query:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS
        KDD LAESVFS A+SDGKE QHW GW KLAAAAAAWEDYAS LRRELL SLS+WR SQSIYSVSL+S
Subjt:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS

TrEMBL top hitse value%identityAlignment
A0A0A0KMN7 Procollagen-proline 3-dioxygenase9.7e-14666.92Show/hide
Query:  KRNFQDCYNNRSEIWDSFCLVFSYDSP---RPSGNLICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFL
        K   ++ +    E++  F  + S  S    +PS + + WTRGASIGWHSDDNRPYLKQREFS                                      
Subjt:  KRNFQDCYNNRSEIWDSFCLVFSYDSP---RPSGNLICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFL

Query:  AVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVLVLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSL
        AVCYLNSYGV+FGGGLFHFQDGEP+TISPF                       Y DCVMYTAD+ NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSL
Subjt:  AVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVLVLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSL

Query:  LSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWK
        LSQS LHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFD+CWARL ALGY++YFP DH  SEYPDLF Q+VQLV   KIFFQ+F++ILH LQVVQFLCWK
Subjt:  LSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWK

Query:  GKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSL
        GKELDSTN   DSSYAEYLSPKRNVGVSYFKSEFSK+D LAESVFS A+SDGKE Q W GW KL AAAAAWE YAS LRRELL S SHWR  QSIYSVSL
Subjt:  GKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFSKDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSL

Query:  AS
         S
Subjt:  AS

A0A5D3CRE9 Procollagen-proline 3-dioxygenase1.4e-14472.01Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI WTRGASIGWHSDDNRPYLKQREFS                                      AVCYLNSYGV+FGGGLFHFQDGEP+TISPF     
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                          Y DCVMY ADS NVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDRFP+SCLPQPPSCNMYWFSPEDDPNFK
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS
        FGFD+CWARLHALGY+IYFP DH  SEYPDLFSQ+VQLV   KIFFQ+F++ILH LQVVQFLCWKGKELD+TN   DS YAEYLSPKRNVGVSYFKSEFS
Subjt:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS

Query:  KDDALAESVFSCASSDGKEKQHWFGWAKL-AAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS
        K+D LAESVFS A+S GKE QHW GW KL  AAAAAWEDYAS LRRELL S SHWR  QSIYSVSL S
Subjt:  KDDALAESVFSCASSDGKEKQHWFGWAKL-AAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS

A0A6J1DQY8 Procollagen-proline 3-dioxygenase6.3e-14571.39Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI WTRGASIGWHSDDNRPYLKQREF+                                      AVCYLNSYGVDF GGLFHFQDGEPK+ISPFCG   
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                            DCVMYTADSHNVHSVDEITNGERLTLTLWFTRD+SHDEDAKLLSLLSQS LHDRFP+SC+P PPSCNMYWFSPE+DPNFK
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS
        FG DVCWARLHALGY+IYFP D+ LS+YP LFS  VQLVR+KKIFFQEF +ILHALQVVQF+CWKGKELDSTNFKG+SSYA YLSPK N GVSYFKSEFS
Subjt:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS

Query:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS
        K+  LA+SVFS ASSD KEKQ W GWAKLA A AAWEDYAS LR ELLRSL HWRT+QS+Y VSL S
Subjt:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS

A0A6J1FRP8 Procollagen-proline 3-dioxygenase3.3e-14673.3Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI WTRGA IGWHSDDNRPYLKQREF+                                      AVCYLNSYGVDF GGLFHFQDGEPKTISP CG   
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                            DCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKL+SLLSQS LHDR PDSCLPQPPSCNMYWFSP+DDPNFK
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS
        FGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVVQFL WKGKELDSTN K DSSYAE LSPKRNVGV YFKSEFS
Subjt:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS

Query:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS
        KDDALAESVF  ASSD KEKQH  GWAKLAA AAAWEDYAS LRRELLRS +HWRTSQSIYSV   S
Subjt:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS

A0A6J1JQV6 Procollagen-proline 3-dioxygenase6.3e-14572.75Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI WTRGA IGWHSDDNRPYLKQREF+                                      AVCYLNSYGVDF GGLFHFQDGEPKTISP CG   
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                            DCVMYTADS NVHSVDE+T+GERLTLTLWFTRDSSHDEDAKLLSLLSQS LHDR PDSCLPQPPSCNMYWFSP+DDPNFK
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS
        FGFD+CWARLHALGY IYFP DHSLSEYPDLFSQ+VQLVR  KIF Q+FDSILHALQVVQFL WKGKELDST+ K DSSYAE LSPKRNVGV +FKSEFS
Subjt:  FGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEYLSPKRNVGVSYFKSEFS

Query:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS
        KDDALAESVF  ASSD KEKQH  GWAKLAA A AWEDYAS LRRELLRS +HWRTSQSIYSV   S
Subjt:  KDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G68080.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-7043.09Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI W +GASIGWHSDDNR YLKQR+F+                                      AVCYLNSY  DF GGLF FQ GEP T++P  GDV 
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSP-EDDPNF
                              +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N 
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSP-EDDPNF

Query:  KFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFK
          GFDVC ARLH LG++++     DHS      L    +QL +  K+  ++F +ILHALQVVQF  WK  EL ++N + D+    + +S  +   ++  K
Subjt:  KFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFK

Query:  SEFSKDDALAESVF--SCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSV
        S F  D+ L  + F  SC+  D K+     G   +A A  +WE+Y+ KL +ELL SL  W+T Q+I+ V
Subjt:  SEFSKDDALAESVF--SCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSV

AT1G68080.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.8e-4734.78Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI W +GASIGWHSDDNR YLKQR+F+                                                           GEP T++P  GDV 
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK
                              +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ                                
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSPEDDPNFK

Query:  FGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFKS
          FDVC ARLH LG++++     DHS      L    +QL +  K+  ++F +ILHALQVVQF  WK  EL ++N + D+    + +S  +   ++  KS
Subjt:  FGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFKS

Query:  EFSKDDALAESVF--SCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSV
         F  D+ L  + F  SC+  D K+     G   +A A  +WE+Y+ KL +ELL SL  W+T Q+I+ V
Subjt:  EFSKDDALAESVF--SCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSV

AT1G68080.3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.8e-5938.75Show/hide
Query:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL
        LI W +GASIGWHSDDNR YLKQR+F+                                                           GEP T++P  GDV 
Subjt:  LICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINLFLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVL

Query:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSP-EDDPNF
                              +MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N 
Subjt:  VLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSP-EDDPNF

Query:  KFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFK
          GFDVC ARLH LG++++     DHS      L    +QL +  K+  ++F +ILHALQVVQF  WK  EL ++N + D+    + +S  +   ++  K
Subjt:  KFGFDVCWARLHALGYNIYF--PCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFK

Query:  SEFSKDDALAESVF--SCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSV
        S F  D+ L  + F  SC+  D K+     G   +A A  +WE+Y+ KL +ELL SL  W+T Q+I+ V
Subjt:  SEFSKDDALAESVF--SCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSV

AT1G68080.4 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.5e-5046.34Show/hide
Query:  MYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDVCWARLHALGYNIYF--P
        MYTAD  N+HSVDE+T+GERLTL LWF+RDSSHDED+KLLS LSQ   H    + CLP P S NMYWF P +D  N   GFDVC ARLH LG++++    
Subjt:  MYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHDRFPDSCLPQPPSCNMYWFSP-EDDPNFKFGFDVCWARLHALGYNIYF--P

Query:  CDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVF--SCASSDG
         DHS      L    +QL +  K+  ++F +ILHALQVVQF  WK  EL ++N + D+    + +S  +   ++  KS F  D+ L  + F  SC+  D 
Subjt:  CDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDS-SYAEYLSPKRNVGVSYFKSEFSKDDALAESVF--SCASSDG

Query:  KEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSV
        K+     G   +A A  +WE+Y+ KL +ELL SL  W+T Q+I+ V
Subjt:  KEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTACAACTGTTTCTTTGGAACATTTGGATAGAATGTAATAAGAGGAATTTTCAAGATTGTTATAATAATCGTAGTGAGATTTGGGATTCATTCTGTCTTGTTTT
CTCTTATGATTCCCCCCGCCCTTCTGGCAATCTCATTTGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGACGATAACCGGCCCTATCTGAAACAACGTGAATTTT
CAGTATGTGGACCAATCTTTCCTGGGAAAGTCGATGTCATTCTTGCTATTCTTATGATAGCCTATTCTTTCATTTCATTTATATTTTCATATCCTTTTTATATTAATTTA
TTTCTTGCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTTGGAGGTGGGCTGTTTCACTTTCAGGATGGGGAACCAAAAACCATCTCGCCTTTTTGTGGAGATGTGCT
TGTGCTCAAAGAGCAGAGCAGTCTGTTGTATTTTTTGTTGATACTTGTTTCACTTTATCAGGATTGTGTGATGTACACGGCCGACAGCCACAATGTTCATTCTGTTGATG
AGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCCAAACTGCTTTCGCTTCTTTCACAAAGCGATTTACATGAT
CGTTTTCCTGACTCGTGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTCAAGTTTGGTTTTGATGTCTGTTGGGCGAGACT
GCACGCACTTGGATACAACATTTATTTTCCTTGTGATCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGAAGTACAATTAGTACGGGAAAAGAAGATATTCTTTC
AGGAATTTGATAGCATTTTGCATGCACTTCAGGTAGTGCAATTCCTGTGTTGGAAAGGCAAAGAACTGGATTCTACTAACTTCAAAGGGGATTCAAGCTATGCAGAATAT
TTATCTCCAAAGAGGAATGTGGGTGTCAGTTACTTTAAATCCGAGTTTTCGAAGGACGATGCACTGGCCGAGTCAGTCTTCTCGTGTGCTAGTTCTGATGGCAAGGAGAA
GCAACACTGGTTCGGGTGGGCTAAGCTTGCTGCTGCTGCAGCAGCTTGGGAAGATTATGCTTCCAAACTAAGGAGAGAACTCCTTCGGAGCTTGAGCCATTGGAGAACCA
GTCAATCCATATACAGTGTTTCACTTGCTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTACAACTGTTTCTTTGGAACATTTGGATAGAATGTAATAAGAGGAATTTTCAAGATTGTTATAATAATCGTAGTGAGATTTGGGATTCATTCTGTCTTGTTTT
CTCTTATGATTCCCCCCGCCCTTCTGGCAATCTCATTTGCTGGACCAGGGGAGCAAGCATTGGATGGCATAGTGACGATAACCGGCCCTATCTGAAACAACGTGAATTTT
CAGTATGTGGACCAATCTTTCCTGGGAAAGTCGATGTCATTCTTGCTATTCTTATGATAGCCTATTCTTTCATTTCATTTATATTTTCATATCCTTTTTATATTAATTTA
TTTCTTGCAGTGTGTTACTTGAATAGTTATGGAGTAGATTTTGGAGGTGGGCTGTTTCACTTTCAGGATGGGGAACCAAAAACCATCTCGCCTTTTTGTGGAGATGTGCT
TGTGCTCAAAGAGCAGAGCAGTCTGTTGTATTTTTTGTTGATACTTGTTTCACTTTATCAGGATTGTGTGATGTACACGGCCGACAGCCACAATGTTCATTCTGTTGATG
AGATAACCAATGGAGAAAGGCTTACACTGACATTATGGTTCACCCGTGATAGTTCCCATGATGAAGATGCCAAACTGCTTTCGCTTCTTTCACAAAGCGATTTACATGAT
CGTTTTCCTGACTCGTGCCTACCTCAGCCTCCATCCTGTAATATGTATTGGTTTTCACCAGAAGACGATCCAAATTTCAAGTTTGGTTTTGATGTCTGTTGGGCGAGACT
GCACGCACTTGGATACAACATTTATTTTCCTTGTGATCATAGTTTGTCAGAGTATCCAGATTTATTCTCACAGGAAGTACAATTAGTACGGGAAAAGAAGATATTCTTTC
AGGAATTTGATAGCATTTTGCATGCACTTCAGGTAGTGCAATTCCTGTGTTGGAAAGGCAAAGAACTGGATTCTACTAACTTCAAAGGGGATTCAAGCTATGCAGAATAT
TTATCTCCAAAGAGGAATGTGGGTGTCAGTTACTTTAAATCCGAGTTTTCGAAGGACGATGCACTGGCCGAGTCAGTCTTCTCGTGTGCTAGTTCTGATGGCAAGGAGAA
GCAACACTGGTTCGGGTGGGCTAAGCTTGCTGCTGCTGCAGCAGCTTGGGAAGATTATGCTTCCAAACTAAGGAGAGAACTCCTTCGGAGCTTGAGCCATTGGAGAACCA
GTCAATCCATATACAGTGTTTCACTTGCTAGTTGA
Protein sequenceShow/hide protein sequence
MPLQLFLWNIWIECNKRNFQDCYNNRSEIWDSFCLVFSYDSPRPSGNLICWTRGASIGWHSDDNRPYLKQREFSVCGPIFPGKVDVILAILMIAYSFISFIFSYPFYINL
FLAVCYLNSYGVDFGGGLFHFQDGEPKTISPFCGDVLVLKEQSSLLYFLLILVSLYQDCVMYTADSHNVHSVDEITNGERLTLTLWFTRDSSHDEDAKLLSLLSQSDLHD
RFPDSCLPQPPSCNMYWFSPEDDPNFKFGFDVCWARLHALGYNIYFPCDHSLSEYPDLFSQEVQLVREKKIFFQEFDSILHALQVVQFLCWKGKELDSTNFKGDSSYAEY
LSPKRNVGVSYFKSEFSKDDALAESVFSCASSDGKEKQHWFGWAKLAAAAAAWEDYASKLRRELLRSLSHWRTSQSIYSVSLAS