; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G013810 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G013810
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationchr09:21833818..21845953
RNA-Seq ExpressionLsi09G013810
SyntenyLsi09G013810
Gene Ontology termsGO:0000160 - phosphorelay signal transduction system (biological process)
GO:0006468 - protein phosphorylation (biological process)
GO:0009736 - cytokinin-activated signaling pathway (biological process)
GO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0031418 - L-ascorbic acid binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0004674 - protein serine/threonine kinase activity (molecular function)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
InterPro domainsIPR045054 - Prolyl 4-hydroxylase
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR036641 - HPT domain superfamily
IPR008207 - Signal transduction histidine kinase, phosphotransfer (Hpt) domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR005123 - Oxoglutarate/iron-dependent dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2311022.1 hypothetical protein GH714_019167 [Hevea brasiliensis]1.7e-16972.58Show/hide
Query:  RAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVD
        +AFVYEGFLTDLECDHL+S+A+SELKRS VADN SGKSKLS VRTSSGMFISK KDPIV+GIEDKI+ WTFLPKENGEDIQVLRYEHGQKY+ HYDYFVD
Subjt:  RAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVD

Query:  KVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH
        KVNIA GGHR ATVLMYL+DV KGGETVFP AE+ P R+A  +DE+LSECA+KGIAVKP++GDALLFFSL PNAIPD +SLH GCPV+EGEKWSATKWIH
Subjt:  KVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH

Query:  VDSFSKNLGNIGNCTDLNESCGAAGSVDLNNTLEISIPLDVISGMESTTTLIVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQ
        VDSFSKNL   GNCTDLNE C    ++                GM     +  K             ASMEVG MQR WVEYT+SLF+E FLD QF +LQ
Subjt:  VDSFSKNLGNIGNCTDLNESCGAAGSVDLNNTLEISIPLDVISGMESTTTLIVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQ

Query:  KLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLE
         LQDE NPDF+VEVVSLFFEDSERLLNDLT A DQ  VDF+++D HVHQLKGSSSSIGAQRVKN CIA RSFCEEQN + CL+CLQQ+KQE  LVKNKLE
Subjt:  KLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLE

Query:  NLFRMEQQIVAAGGSIPATELIF
         L R+EQQIVAAGGS+P  EL F
Subjt:  NLFRMEQQIVAAGGSIPATELIF

KAF4393790.1 hypothetical protein G4B88_007776 [Cannabis sativa]6.3e-17267.45Show/hide
Query:  LFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSG
        LF    +      ES  SYAGSASS ++P+KVKQISWKPRAF+YEGFLTDLECDHL+S+A+SELKRS VAD++SG+S+LS VRTSSGMFISK+KDPIV+G
Subjt:  LFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSG

Query:  IEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKK
        IEDKIS WTFLPKENGEDIQVLRYE GQKYE HYDYF DKVNI  GGHR+ATVLMYL+DV KGGETVFP A ++P  + + T ED SECA+KG+AVK ++
Subjt:  IEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKK

Query:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIP----LDVISGMESTTTLIVK
        GDALLFFSL P AIPDT SLH GCPV+EGEKWSATKWIHVDSF K++   G CTD+NESC    A G    N    +  P        S    T+  I  
Subjt:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIP----LDVISGMESTTTLIVK

Query:  LSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLED-FLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLKGS
        +   R +EA     +MEVG MQRQWV+YT+SLF+E  +LDSQF++L +LQDE NPDF+VEVVSLFF+D+E+LLNDLT A +Q  VDF+++D HVHQLKGS
Subjt:  LSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLED-FLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLKGS

Query:  SSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF
        SSSIGA+RVKN C+A R+ CEEQN D CLRCLQQ+KQE  LVKNKLENLFR+EQQIVAAGGSIP  EL F
Subjt:  SSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF

KAF5941687.1 hypothetical protein HYC85_019329 [Camellia sinensis]5.5e-16865.03Show/hide
Query:  MFKFRNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKS
        MF+F     I LI ISS++ ES      S+SS ++PSK KQ+SWKPRAFVYEGFLTD EC+HL+SIA++ELKRS VADN SGKSKLS VRTSSGMFISK+
Subjt:  MFKFRNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKG
        KDPIVSGIE+KI+ WTFLPKENGE IQVLRYEHGQKY+ HYDYF+DKVN+A GGHR+ATVLMYLSDV KGGETVFP AE++PH  ++ +D+DLSECA+KG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIP-------------
        IAVKP+KGDALLFFSL P AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K + + GNCTD NE+C    A G    N    +  P             
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIP-------------

Query:  ----LDVISGM---ESTTTLIVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTI
             D   G    +      V +     +E      SMEVG +QRQ+VEYT SLF E FLDSQF +LQ+LQDE NPDF+VEVVSLFFEDSERLLNDL  
Subjt:  ----LDVISGM---ESTTTLIVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTI

Query:  AFDQPDVDFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATE
        A DQ  VDF+K+DG+VHQLKGSSSSIGA RVKN C+A R++CEE N + C  CLQQ+KQE  LVKNKLE LF +EQQI+AAGGS+P  E
Subjt:  AFDQPDVDFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATE

KAG6593935.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]1.9e-21682.2Show/hide
Query:  MFKFRNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKS
        M +FR+LLFIFLI I+SVVRES CS A SAS+TVDPSKVKQISWKPRAFVYEGFLTDLECDHL+SIARSELKRSEVADN+SGKSKLSTVRTSSGMFI KS
Subjt:  MFKFRNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKG
        KD IVSGIEDKI+AWTFLPKENGEDIQVLRYEHGQ+YESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP+AEKSPHRRA+ETDEDLS+CARKG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLDVISGMESTTTL
        IAVKPKKGDALLFFSLEPNAIPDT SLHGGCPVLEGEKWSATKWIHVDSFSKNL ++GNCTDLNESC    A G    N    +  P ++   ++S    
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLDVISGMESTTTL

Query:  IVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLK
                         SMEVGLMQRQWVEYT+SLF+E FLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLT A ++PDVDFQK+DGHVHQLK
Subjt:  IVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLK

Query:  GSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF
        GSSSSIGAQRVKNVCI +RSFCEEQN++GCLR LQQLKQETCLVKNK ENLFRMEQQI+AAGGSIP TEL+F
Subjt:  GSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF

RXH80310.1 hypothetical protein DVH24_041457 [Malus domestica]2.2e-17266.53Show/hide
Query:  NLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIV
        +LL + L  + S+   S+ S + S + TV+PSKVKQISW PRAFVYEG LTD ECDHL+SIA+SELKRS VADN SG+SKLS VRTSSGMFI K+KDPIV
Subjt:  NLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIV

Query:  SGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKP
        +GIEDK+S WTFLPKENGEDIQVLRYE GQKYE HYDYFVDKVNIA GGHR+ATVLMYL+DV KGGETVFP AE    R+AAE D  LSECA+KGIAVKP
Subjt:  SGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKP

Query:  KKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLDVISGMESTTTLIVK--
        ++GDALLFFSL P+A+PD NSLH GCPV+EGEKWSATKWIHVDSF KNL   G+C DLNESC    A G    N    +  P        S  +LI    
Subjt:  KKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLDVISGMESTTTLIVK--

Query:  -----------LSEARMDEAVAVGAS----MEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVD
                   L       +V V  +    MEVG MQRQWV+YT+SLFLE FLD QF++LQ+LQDE NPDF+VEVVSLFFEDSE+LLNDLT A +QP VD
Subjt:  -----------LSEARMDEAVAVGAS----MEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVD

Query:  FQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF
        F+++D HVHQ KGSSSSIGAQRVKN CIA R+FCEE+N +GC+RC+QQ+K E  LVK+KLE LF MEQQI+AAGGSIP  EL F
Subjt:  FQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF

TrEMBL top hitse value%identityAlignment
A0A1Q3B5T1 Procollagen-proline 4-dioxygenase7.3e-16664.57Show/hide
Query:  LLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVS
        L F++ + IS ++++   S+  S SS +DPSKVKQIS KPRA+VYEGFLT LECDHL+S+A+SELKRS VADN SGKSKLS VRTSSGMFI K KDPIV 
Subjt:  LLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVS

Query:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPK
        GIEDKIS WTFLPKENGEDIQVLRYE GQKYE H+DYFVDKVNIA GGHR+ATVL+YL+DV KGGETVFP AE S  R+ + T+ DLSEC RKG+AVKP+
Subjt:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCG--------------AAGSVDLNNTLEISIPLDVISGME
        +GDALLFFSL PNA+PD +SLH GCPV+EGEKWSATKWIHVDSF KNL   GNCTD NESC                 GS +L      S  +      +
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCG--------------AAGSVDLNNTLEISIPLDVISGME

Query:  STT--TLIVKLSEARMDEAVAVG------ASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDV
             TL V +S+ R+++    G        MEVG MQR+ ++YT++LF+E FLD QF++LQ+LQDE NP F+VEVVSLFF+DSERLLNDLT A DQP V
Subjt:  STT--TLIVKLSEARMDEAVAVG------ASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDV

Query:  DFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGS
        DF ++D HVHQLKGSSSSI AQR+KN  +A R+FCEEQN++ C RCLQQ+KQE  L +N LE LFR+EQQIVAAGGS
Subjt:  DFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGS

A0A498IAV7 Uncharacterized protein1.1e-17266.53Show/hide
Query:  NLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIV
        +LL + L  + S+   S+ S + S + TV+PSKVKQISW PRAFVYEG LTD ECDHL+SIA+SELKRS VADN SG+SKLS VRTSSGMFI K+KDPIV
Subjt:  NLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIV

Query:  SGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKP
        +GIEDK+S WTFLPKENGEDIQVLRYE GQKYE HYDYFVDKVNIA GGHR+ATVLMYL+DV KGGETVFP AE    R+AAE D  LSECA+KGIAVKP
Subjt:  SGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKP

Query:  KKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLDVISGMESTTTLIVK--
        ++GDALLFFSL P+A+PD NSLH GCPV+EGEKWSATKWIHVDSF KNL   G+C DLNESC    A G    N    +  P        S  +LI    
Subjt:  KKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLDVISGMESTTTLIVK--

Query:  -----------LSEARMDEAVAVGAS----MEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVD
                   L       +V V  +    MEVG MQRQWV+YT+SLFLE FLD QF++LQ+LQDE NPDF+VEVVSLFFEDSE+LLNDLT A +QP VD
Subjt:  -----------LSEARMDEAVAVGAS----MEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVD

Query:  FQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF
        F+++D HVHQ KGSSSSIGAQRVKN CIA R+FCEE+N +GC+RC+QQ+K E  LVK+KLE LF MEQQI+AAGGSIP  EL F
Subjt:  FQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF

A0A6A6MBE0 Uncharacterized protein8.3e-17072.58Show/hide
Query:  RAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVD
        +AFVYEGFLTDLECDHL+S+A+SELKRS VADN SGKSKLS VRTSSGMFISK KDPIV+GIEDKI+ WTFLPKENGEDIQVLRYEHGQKY+ HYDYFVD
Subjt:  RAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVD

Query:  KVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH
        KVNIA GGHR ATVLMYL+DV KGGETVFP AE+ P R+A  +DE+LSECA+KGIAVKP++GDALLFFSL PNAIPD +SLH GCPV+EGEKWSATKWIH
Subjt:  KVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH

Query:  VDSFSKNLGNIGNCTDLNESCGAAGSVDLNNTLEISIPLDVISGMESTTTLIVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQ
        VDSFSKNL   GNCTDLNE C    ++                GM     +  K             ASMEVG MQR WVEYT+SLF+E FLD QF +LQ
Subjt:  VDSFSKNLGNIGNCTDLNESCGAAGSVDLNNTLEISIPLDVISGMESTTTLIVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQ

Query:  KLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLE
         LQDE NPDF+VEVVSLFFEDSERLLNDLT A DQ  VDF+++D HVHQLKGSSSSIGAQRVKN CIA RSFCEEQN + CL+CLQQ+KQE  LVKNKLE
Subjt:  KLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLE

Query:  NLFRMEQQIVAAGGSIPATELIF
         L R+EQQIVAAGGS+P  EL F
Subjt:  NLFRMEQQIVAAGGSIPATELIF

A0A7J6HHC7 Uncharacterized protein3.1e-17267.45Show/hide
Query:  LFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSG
        LF    +      ES  SYAGSASS ++P+KVKQISWKPRAF+YEGFLTDLECDHL+S+A+SELKRS VAD++SG+S+LS VRTSSGMFISK+KDPIV+G
Subjt:  LFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSG

Query:  IEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKK
        IEDKIS WTFLPKENGEDIQVLRYE GQKYE HYDYF DKVNI  GGHR+ATVLMYL+DV KGGETVFP A ++P  + + T ED SECA+KG+AVK ++
Subjt:  IEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKK

Query:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIP----LDVISGMESTTTLIVK
        GDALLFFSL P AIPDT SLH GCPV+EGEKWSATKWIHVDSF K++   G CTD+NESC    A G    N    +  P        S    T+  I  
Subjt:  GDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIP----LDVISGMESTTTLIVK

Query:  LSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLED-FLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLKGS
        +   R +EA     +MEVG MQRQWV+YT+SLF+E  +LDSQF++L +LQDE NPDF+VEVVSLFF+D+E+LLNDLT A +Q  VDF+++D HVHQLKGS
Subjt:  LSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLED-FLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLKGS

Query:  SSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF
        SSSIGA+RVKN C+A R+ CEEQN D CLRCLQQ+KQE  LVKNKLENLFR+EQQIVAAGGSIP  EL F
Subjt:  SSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATELIF

A0A7J7GMG3 Uncharacterized protein2.7e-16865.03Show/hide
Query:  MFKFRNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKS
        MF+F     I LI ISS++ ES      S+SS ++PSK KQ+SWKPRAFVYEGFLTD EC+HL+SIA++ELKRS VADN SGKSKLS VRTSSGMFISK+
Subjt:  MFKFRNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKS

Query:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKG
        KDPIVSGIE+KI+ WTFLPKENGE IQVLRYEHGQKY+ HYDYF+DKVN+A GGHR+ATVLMYLSDV KGGETVFP AE++PH  ++ +D+DLSECA+KG
Subjt:  KDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKG

Query:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIP-------------
        IAVKP+KGDALLFFSL P AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K + + GNCTD NE+C    A G    N    +  P             
Subjt:  IAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIP-------------

Query:  ----LDVISGM---ESTTTLIVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTI
             D   G    +      V +     +E      SMEVG +QRQ+VEYT SLF E FLDSQF +LQ+LQDE NPDF+VEVVSLFFEDSERLLNDL  
Subjt:  ----LDVISGM---ESTTTLIVKLSEARMDEAVAVGASMEVGLMQRQWVEYTRSLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTI

Query:  AFDQPDVDFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATE
        A DQ  VDF+K+DG+VHQLKGSSSSIGA RVKN C+A R++CEE N + C  CLQQ+KQE  LVKNKLE LF +EQQI+AAGGS+P  E
Subjt:  AFDQPDVDFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCLVKNKLENLFRMEQQIVAAGGSIPATE

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.2e-7758.78Show/hide
Query:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENG
        S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENG
Subjt:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENG

Query:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-LAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIP
        E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP    K+P  +    D+  S+CA++G AVKP+KGDALLFF+L  N   
Subjt:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-LAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIP

Query:  DTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC
        D NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC
Subjt:  DTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC

F4JAU3 Prolyl 4-hydroxylase 22.8e-10671.65Show/hide
Query:  LLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVS
        LLF+ ++L+  +++ STC    S SS ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVS
Subjt:  LLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVS

Query:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPK
        GIEDK+S WTFLPKENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++   R  +E  +DLS+CA+KGIAVKPK
Subjt:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC
        KG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + GNCTD+NESC
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC

Q8L970 Probable prolyl 4-hydroxylase 72.2e-8255.36Show/hide
Query:  FLILISSVVRESTCSYAGSASS-TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIE
        FL   S+    S      SASS   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTSSGMF+SK +D IVS +E
Subjt:  FLILISSVVRESTCSYAGSASS-TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIE

Query:  DKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGD
         K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFP+ +    +     D+  +ECA++G AVKP+KGD
Subjt:  DKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGD

Query:  ALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLD
        ALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SC     AG    N T  +    D
Subjt:  ALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLD

Q8LAN3 Probable prolyl 4-hydroxylase 44.3e-10772.62Show/hide
Query:  RNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPI
        R  L I    I SV+ +S+ S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPI

Query:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVK
        VSGIEDKIS WTFLPKENGEDIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE    R  +E  EDLS+CA++GIAVK
Subjt:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVK

Query:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC
        P+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD+NESC
Subjt:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC

Q9LN20 Probable prolyl 4-hydroxylase 32.0e-6454.55Show/hide
Query:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHY
        +SW+PRAFVY  FL+  EC++L+S+A+  + +S V D+++GKSK S VRTSSG F+ + +D I+  IE +I+ +TF+P ++GE +QVL YE GQKYE HY
Subjt:  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHY

Query:  DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSA
        DYFVD+ N   GG R+AT+LMYLSDV +GGETVFP A  + +  +     +LSEC +KG++VKP+ GDALLF+S+ P+A  D  SLHGGCPV+ G KWS+
Subjt:  DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSA

Query:  TKWIHVDSF
        TKW+HV  +
Subjt:  TKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 22.0e-10771.65Show/hide
Query:  LLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVS
        LLF+ ++L+  +++ STC    S SS ++PSKVKQ+S KPRAFVYEGFLTDLECDHL+S+A+  L+RS VADND+G+S++S VRTSSG FISK KDPIVS
Subjt:  LLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVS

Query:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPK
        GIEDK+S WTFLPKENGED+QVLRYEHGQKY++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A++   R  +E  +DLS+CA+KGIAVKPK
Subjt:  GIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPK

Query:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC
        KG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSATKWIHVDSF K L + GNCTD+NESC
Subjt:  KGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.5e-8355.36Show/hide
Query:  FLILISSVVRESTCSYAGSASS-TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIE
        FL   S+    S      SASS   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S  S VRTSSGMF+SK +D IVS +E
Subjt:  FLILISSVVRESTCSYAGSASS-TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIE

Query:  DKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGD
         K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFP+ +    +     D+  +ECA++G AVKP+KGD
Subjt:  DKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGD

Query:  ALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLD
        ALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SC     AG    N T  +    D
Subjt:  ALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLD

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase8.2e-7751.74Show/hide
Query:  FLILISSVVRESTCSYAGSASS-TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKS-----KLSTVRTSSGMFISKSK---
        FL   S+    S      SASS   DP++V Q+SW PR F+YEGFL+D ECDH + +A+ +L++S VADNDSG+S      +S VR SS    +      
Subjt:  FLILISSVVRESTCSYAGSASS-TVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKS-----KLSTVRTSSGMFISKSK---

Query:  DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGI
        D IVS +E K++AWTFLP+ENGE +Q+L YE+GQKYE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFP+ +    +     D+  +ECA++G 
Subjt:  DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGI

Query:  AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLD
        AVKP+KGDALLFF+L PNA  D+NSLHG CPV+EGEKWSAT+WIHV SF +       C D N SC     AG    N T  +    D
Subjt:  AVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC---GAAGSVDLNNTLEISIPLD

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase8.7e-7958.78Show/hide
Query:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENG
        S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHL+ +A+ +L++S  VAD DSG+S+ S VRTSSGMF++K +D IV+ +E K++AWTFLP+ENG
Subjt:  SYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRS-EVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIEDKISAWTFLPKENG

Query:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-LAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIP
        E +Q+L YE+GQKY+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP    K+P  +    D+  S+CA++G AVKP+KGDALLFF+L  N   
Subjt:  EDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFP-LAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNAIP

Query:  DTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC
        D NSLHG CPV+EGEKWSAT+WIHV SF K       C D +ESC
Subjt:  DTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.1e-10872.62Show/hide
Query:  RNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPI
        R  L I    I SV+ +S+ S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH+VS+A++ LKRS VADNDSG+SK S VRTSSG FISK KDPI
Subjt:  RNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPI

Query:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVK
        VSGIEDKIS WTFLPKENGEDIQVLRYEHGQKY++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE    R  +E  EDLS+CA++GIAVK
Subjt:  VSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVK

Query:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC
        P+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDSF + +   GNCTD+NESC
Subjt:  PKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAATTTCGTAATCTGTTATTCATCTTCTTGATTTTGATCTCATCGGTTGTTCGGGAATCAACTTGTTCGTATGCTGGTTCGGCTAGCTCCACCGTAGATCCTAG
TAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTAGAATGCGACCACCTGGTTTCTATAGCGAGATCCGAGCTAAAGAGAT
CTGAGGTTGCTGATAATGATTCAGGAAAGAGCAAGCTCAGTACTGTTCGAACGAGTTCAGGAATGTTCATTTCTAAAAGCAAGGATCCTATTGTTTCTGGCATAGAGGAC
AAAATTTCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGATATTCAGGTATTGAGATATGAGCATGGGCAGAAATATGAGTCACATTATGATTACTTTGTTGACAA
GGTGAATATTGCCTGGGGAGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTGATGTGACTAAAGGCGGTGAAACAGTTTTCCCCTTGGCAGAGAAATCTCCCCACC
GGAGGGCTGCTGAAACAGATGAGGATCTCTCAGAGTGTGCAAGGAAAGGAATTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGTCTTGAACCAAATGCT
ATCCCAGACACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCCTTCAGCAAAAACTTAGGAAACAT
TGGGAACTGTACTGATCTAAATGAAAGTTGCGGAGCTGCAGGATCTGTTGATCTCAACAATACACTCGAAATTTCGATACCTTTAGACGTTATTAGTGGAATGGAAAGCA
CTACAACATTGATTGTGAAGCTATCCGAAGCACGGATGGATGAGGCAGTAGCCGTTGGTGCAAGCATGGAGGTGGGGTTGATGCAGAGACAATGGGTTGAGTACACAAGG
TCGTTATTCTTGGAGGATTTCTTGGACAGTCAGTTTATAGAGCTGCAGAAACTGCAAGATGAGGGCAACCCAGATTTCATTGTTGAAGTGGTGTCTCTTTTCTTCGAGGA
TTCTGAGAGGCTTCTCAATGATCTCACCATAGCATTTGATCAACCTGATGTGGACTTCCAAAAGATTGATGGCCATGTTCACCAGCTTAAAGGAAGCAGTTCGAGCATAG
GTGCACAGAGAGTTAAAAATGTCTGTATTGCCATGCGCAGCTTCTGTGAGGAACAGAACATGGATGGGTGCCTGAGATGTCTGCAACAATTAAAGCAAGAGACCTGCCTT
GTGAAAAACAAGCTTGAGAATCTATTCAGGATGGAGCAACAAATTGTGGCTGCTGGTGGGTCAATCCCTGCAACAGAATTGATATTCTAA
mRNA sequenceShow/hide mRNA sequence
GGAAAGAAAGCCTCCGCTCCGTGAAATTCTGGATTTCCAGACTTCCATCAAATTTCCTCCGATAACTCTCTCTATCGCTCTCGCTCTCGCTCTCTTTCTAATTTGATCCG
ACCGACACTATGTTCAAATTTCGTAATCTGTTATTCATCTTCTTGATTTTGATCTCATCGGTTGTTCGGGAATCAACTTGTTCGTATGCTGGTTCGGCTAGCTCCACCGT
AGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCGAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTAGAATGCGACCACCTGGTTTCTATAGCGAGATCCGAGC
TAAAGAGATCTGAGGTTGCTGATAATGATTCAGGAAAGAGCAAGCTCAGTACTGTTCGAACGAGTTCAGGAATGTTCATTTCTAAAAGCAAGGATCCTATTGTTTCTGGC
ATAGAGGACAAAATTTCTGCATGGACTTTTCTTCCAAAAGAAAATGGGGAGGATATTCAGGTATTGAGATATGAGCATGGGCAGAAATATGAGTCACATTATGATTACTT
TGTTGACAAGGTGAATATTGCCTGGGGAGGACATCGTTTAGCTACAGTCCTTATGTATCTCTCTGATGTGACTAAAGGCGGTGAAACAGTTTTCCCCTTGGCAGAGAAAT
CTCCCCACCGGAGGGCTGCTGAAACAGATGAGGATCTCTCAGAGTGTGCAAGGAAAGGAATTGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTTTTCTTTAGTCTTGAA
CCAAATGCTATCCCAGACACAAACAGTCTCCATGGAGGTTGCCCTGTCCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCCTTCAGCAAAAACTT
AGGAAACATTGGGAACTGTACTGATCTAAATGAAAGTTGCGGAGCTGCAGGATCTGTTGATCTCAACAATACACTCGAAATTTCGATACCTTTAGACGTTATTAGTGGAA
TGGAAAGCACTACAACATTGATTGTGAAGCTATCCGAAGCACGGATGGATGAGGCAGTAGCCGTTGGTGCAAGCATGGAGGTGGGGTTGATGCAGAGACAATGGGTTGAG
TACACAAGGTCGTTATTCTTGGAGGATTTCTTGGACAGTCAGTTTATAGAGCTGCAGAAACTGCAAGATGAGGGCAACCCAGATTTCATTGTTGAAGTGGTGTCTCTTTT
CTTCGAGGATTCTGAGAGGCTTCTCAATGATCTCACCATAGCATTTGATCAACCTGATGTGGACTTCCAAAAGATTGATGGCCATGTTCACCAGCTTAAAGGAAGCAGTT
CGAGCATAGGTGCACAGAGAGTTAAAAATGTCTGTATTGCCATGCGCAGCTTCTGTGAGGAACAGAACATGGATGGGTGCCTGAGATGTCTGCAACAATTAAAGCAAGAG
ACCTGCCTTGTGAAAAACAAGCTTGAGAATCTATTCAGGATGGAGCAACAAATTGTGGCTGCTGGTGGGTCAATCCCTGCAACAGAATTGATATTCTAAGTTGAACCCAA
GGGCCAAGAGAATACTTCTTAGAGTCACAGTTGATGGGAATGTTGTTTTTCCATTCTTTTGGGATCCTTGTGAAATTATTGTTCTTCTGTTTGATGAGATCTGTATGAAG
GTTTAATGTGTGAGGAAGTATTTTCTTTGGATAGGAGTGAGGAAGTATTTTCTAAACAATGCCACTTTCCTGTGTGACGTCAATGGTAAAAGAAGGTTGTATTTAAGATT
CCGTGGAGTTCCACATTGTGAAAAAGACTTGGCTTGGCGTGTGTTTATAAATATTTGAGGACTCTTCATTTTCAAGTTGATTTTAAAGGTAGGTAAAAGCTCTACCAATT
GTGTCTGAGTGCGCCTATTCTGCATTGACAAGACATCAGAGGAGGACTTCTACGGCTGAATTTAGGGGGCGCAATACCAAATGTGTCAATACATATATAATTTCAACTAC
ATAATCCTGATGTGAACATTAATACTAGATCAGAGACTAACCTAAGCAAGTCGTTGTGTCACTATCAAAACAGAAACTTATACCTTGAAAGGAAAGATGATGCATTTTCC
CTGCCCCTCTTTGGAGAAAGATTTTGATAATTTATCTGAAAGAAGGATCTTTAATCAAGAAGAGTGGAACCTTTCTCTTAAGGAGGAAGGTGAAATTCTTGGATGGTTGA
GCATTTCAATGGATACTTGTAAAGAATTGTCATATGGCACCTCAGTGTCAGATATACTTGATGATTTATGTGGAAATTTGAGGATTGTCAAGGAATTTGGATAAAAATAA
ACACCAGGTGTCTAAGTCTATGAACGGATAGGTAGCATATATTATGATATTAAACACCTCATAGCGAGCATTTCAAAGAAATATTTCTTGCACATCCACTTGGCTTGTCC
TAATTTAGCATAAGTTATAAAAACTCGATTAAAACGAAACAAAATGGTATCTTCAGTTAGATGGAAA
Protein sequenceShow/hide protein sequence
MFKFRNLLFIFLILISSVVRESTCSYAGSASSTVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKSKDPIVSGIED
KISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPLAEKSPHRRAAETDEDLSECARKGIAVKPKKGDALLFFSLEPNA
IPDTNSLHGGCPVLEGEKWSATKWIHVDSFSKNLGNIGNCTDLNESCGAAGSVDLNNTLEISIPLDVISGMESTTTLIVKLSEARMDEAVAVGASMEVGLMQRQWVEYTR
SLFLEDFLDSQFIELQKLQDEGNPDFIVEVVSLFFEDSERLLNDLTIAFDQPDVDFQKIDGHVHQLKGSSSSIGAQRVKNVCIAMRSFCEEQNMDGCLRCLQQLKQETCL
VKNKLENLFRMEQQIVAAGGSIPATELIF