; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg15947 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg15947
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCarg_Chr08:7285977..7289589
RNA-Seq ExpressionCarg15947
SyntenyCarg15947
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593935.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]2.7e-15982.77Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             KSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRS
        KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGY + S
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRS

KAG7026278.1 putative prolyl 4-hydroxylase 4 [Cucurbita argyrosperma subsp. argyrosperma]3.4e-210100Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
        CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

XP_022930331.1 probable prolyl 4-hydroxylase 4 isoform X4 [Cucurbita moschata]1.4e-16382.96Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSRFRS+LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             KSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        KWIHVDSFSKNLA+VGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

XP_023000081.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]4.5e-16282.4Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MS+FRSLLFIFLISIASVVRESICS ARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             KSPHRRASETDEDLS+CARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        KWIHVDSFSKNLA+VGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

XP_023514355.1 probable prolyl 4-hydroxylase 4 isoform X1 [Cucurbita pepo subsp. pepo]8.2e-16483.24Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             KSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        KWIHVDSFSKNLA+VGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

TrEMBL top hitse value%identityAlignment
A0A6J1EQM4 Procollagen-proline 4-dioxygenase6.8e-16482.96Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSRFRS+LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             KSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        KWIHVDSFSKNLA+VGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

A0A6J1ER58 Procollagen-proline 4-dioxygenase4.6e-15282.11Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSRFRS+LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             KSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYM
        KWIHVDSFSKNLA+VGNCTDLNESCERWAALGECTKNPEYM
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYM

A0A6J1EUU4 Procollagen-proline 4-dioxygenase4.6e-15282.11Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSRFRS+LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             KSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYM
        KWIHVDSFSKNLA+VGNCTDLNESCERWAALGECTKNPEYM
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYM

A0A6J1EWM7 Procollagen-proline 4-dioxygenase4.6e-15282.11Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSRFRS+LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             KSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYM
        KWIHVDSFSKNLA+VGNCTDLNESCERWAALGECTKNPEYM
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYM

A0A6J1KCK1 Procollagen-proline 4-dioxygenase2.2e-16282.4Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MS+FRSLLFIFLISIASVVRESICS ARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAE                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             KSPHRRASETDEDLS+CARKGIAVKPKKGDALLFFSLEPNAIPDT+SLHGGCPVLEGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        KWIHVDSFSKNLA+VGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 66.1e-8548.66Show/hide
Query:  ICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKE
        I S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHLI +A+ +L++S  VAD +SG+S+ S VRTSSGMF+ K +D IV+ +E K+AAWTFLP+E
Subjt:  ICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKE

Query:  NGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWR
        NGE +Q+L YE+GQ+Y+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP                                          +W+
Subjt:  NGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWR

Query:  RKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTDL
                        K+P  +    D+  S CA++G AVKP+KGDALLFF+L  N   D  SLHG CPV+EGEKWSAT+WIHV SF K       C D 
Subjt:  RKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTDL

Query:  NESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  NESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

F4JAU3 Prolyl 4-hydroxylase 21.9e-11558.94Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSR   LLF+ ++ +  +++ S C    S S+ ++PSKVKQ+S KPRAFVYEGFLTDLECDHLIS+A+  L+RS VADN++G+S++S VRTSSG FI K 
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KD IVSGIEDK++ WTFLPKENGED+QVLRYEHGQ+Y++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A+                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             +   R  SE  +DLSDCA+KGIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        KWIHVDSF K L   GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

Q8L970 Probable prolyl 4-hydroxylase 72.5e-9149.7Show/hide
Query:  SASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQV
        ++S   DP++V Q+SW PR F+YEGFL+D ECDH I +A+ +L++S VADN+SG+S  S VRTSSGMF+ K +D IVS +E K+AAWTFLP+ENGE +Q+
Subjt:  SASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQV

Query:  LRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWRRKQVLAR
        L YE+GQ+YE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFPM                            W  K +  +              
Subjt:  LRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWRRKQVLAR

Query:  TLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTDLNESCERW
                           D+  ++CA++G AVKP+KGDALLFF+L PNA  D+ SLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+W
Subjt:  TLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTDLNESCERW

Query:  AALGECTKNPEYMVGSPELPGYCRRSCRTC
        A  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  AALGECTKNPEYMVGSPELPGYCRRSCRTC

Q8LAN3 Probable prolyl 4-hydroxylase 41.3e-11961.25Show/hide
Query:  LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSG
        L I   +I SV+ +S  S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH++S+A++ LKRS VADN+SG+SK S VRTSSG FI K KD IVSG
Subjt:  LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWA
        IEDKI+ WTFLPKENGEDIQVLRYEHGQ+Y++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE     +P++R+                 
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWA

Query:  SKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDS
                                             SE  EDLSDCA++GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDS
Subjt:  SKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDS

Query:  FSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        F + +   GNCTD+NESCERWA LGECTKNPEYMVG+ ELPGYCRRSC+ C
Subjt:  FSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

Q9LN20 Probable prolyl 4-hydroxylase 31.0e-6044.19Show/hide
Query:  ISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHY
        +SW+PRAFVY  FL+  EC++LIS+A+  + +S V D+E+GKSK S VRTSSG F+ + +D I+  IE +IA +TF+P ++GE +QVL YE GQ+YE HY
Subjt:  ISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHY

Query:  DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPH
        DYFVD+ N   GG R+AT+LMYLSDV +GGETVFP A ++  +VP                                                 W     
Subjt:  DYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPH

Query:  RRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSF
                +LS+C +KG++VKP+ GDALLF+S+ P+A  D  SLHGGCPV+ G KWS+TKW+HV  +
Subjt:  RRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSF

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 21.4e-11658.94Show/hide
Query:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS
        MSR   LLF+ ++ +  +++ S C    S S+ ++PSKVKQ+S KPRAFVYEGFLTDLECDHLIS+A+  L+RS VADN++G+S++S VRTSSG FI K 
Subjt:  MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKS

Query:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV
        KD IVSGIEDK++ WTFLPKENGED+QVLRYEHGQ+Y++H+DYF DKVNIA GGHR+ATVL+YLS+VTKGGETVFP A+                     
Subjt:  KDAIVSGIEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAV

Query:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT
                                             +   R  SE  +DLSDCA+KGIAVKPKKG+ALLFF+L+ +AIPD  SLHGGCPV+EGEKWSAT
Subjt:  CQQGRWASKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSAT

Query:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        KWIHVDSF K L   GNCTD+NESCERWA LGEC KNPEYMVG+PE+PG CRRSC+ C
Subjt:  KWIHVDSFSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase1.8e-9249.7Show/hide
Query:  SASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQV
        ++S   DP++V Q+SW PR F+YEGFL+D ECDH I +A+ +L++S VADN+SG+S  S VRTSSGMF+ K +D IVS +E K+AAWTFLP+ENGE +Q+
Subjt:  SASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKENGEDIQV

Query:  LRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWRRKQVLAR
        L YE+GQ+YE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFPM                            W  K +  +              
Subjt:  LRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWRRKQVLAR

Query:  TLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTDLNESCERW
                           D+  ++CA++G AVKP+KGDALLFF+L PNA  D+ SLHG CPV+EGEKWSAT+WIHV SF +       C D N SCE+W
Subjt:  TLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTDLNESCERW

Query:  AALGECTKNPEYMVGSPELPGYCRRSCRTC
        A  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  AALGECTKNPEYMVGSPELPGYCRRSCRTC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase5.6e-8647.04Show/hide
Query:  SASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKS-----KLSTVRTSSGMFIPKSK---DAIVSGIEDKIAAWTFLPK
        ++S   DP++V Q+SW PR F+YEGFL+D ECDH I +A+ +L++S VADN+SG+S      +S VR SS           D IVS +E K+AAWTFLP+
Subjt:  SASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKS-----KLSTVRTSSGMFIPKSK---DAIVSGIEDKIAAWTFLPK

Query:  ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDW
        ENGE +Q+L YE+GQ+YE H+DYF D+ N+  GGHR+ATVLMYLS+V KGGETVFPM                            W  K +  +      
Subjt:  ENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDW

Query:  RRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTD
                                   D+  ++CA++G AVKP+KGDALLFF+L PNA  D+ SLHG CPV+EGEKWSAT+WIHV SF +       C D
Subjt:  RRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTD

Query:  LNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
         N SCE+WA  GEC KNP YMVGS +  GYCR+SC+ C
Subjt:  LNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase4.3e-8648.66Show/hide
Query:  ICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKE
        I S   S S +VDP+++ Q+SW PRAF+Y+GFL+D ECDHLI +A+ +L++S  VAD +SG+S+ S VRTSSGMF+ K +D IV+ +E K+AAWTFLP+E
Subjt:  ICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRS-EVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIEDKIAAWTFLPKE

Query:  NGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWR
        NGE +Q+L YE+GQ+Y+ H+DYF DK  +  GGHR+ATVLMYLS+VTKGGETVFP                                          +W+
Subjt:  NGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDWR

Query:  RKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTDL
                        K+P  +    D+  S CA++G AVKP+KGDALLFF+L  N   D  SLHG CPV+EGEKWSAT+WIHV SF K       C D 
Subjt:  RKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTDL

Query:  NESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        +ESC+ WA  GEC KNP YMVGS    G+CR+SC+ C
Subjt:  NESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.2e-12161.25Show/hide
Query:  LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSG
        L I   +I SV+ +S  S   S+S  V+PSKVKQ+S KPRAFVYEGFLT+LECDH++S+A++ LKRS VADN+SG+SK S VRTSSG FI K KD IVSG
Subjt:  LFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWA
        IEDKI+ WTFLPKENGEDIQVLRYEHGQ+Y++H+DYF DKVNI  GGHR+AT+LMYLS+VTKGGETVFP AE     +P++R+                 
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWA

Query:  SKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDS
                                             SE  EDLSDCA++GIAVKP+KGDALLFF+L P+AIPD  SLHGGCPV+EGEKWSATKWIHVDS
Subjt:  SKESGFRGFHIDWRRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDS

Query:  FSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC
        F + +   GNCTD+NESCERWA LGECTKNPEYMVG+ ELPGYCRRSC+ C
Subjt:  FSKNLADVGNCTDLNESCERWAALGECTKNPEYMVGSPELPGYCRRSCRTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAGATTTCGCTCTCTGTTATTCATCTTCTTGATTTCGATTGCATCGGTTGTTCGAGAATCCATTTGTTCGCCTGCTCGTTCGGCGAGTACCACCGTAGATCCTAG
TAAAGTGAAGCAGATTTCATGGAAACCAAGAGCTTTTGTATATGAAGGTTTTCTCACGGACCTAGAATGCGACCACCTGATTTCTATTGCGAGATCCGAGCTAAAGAGAT
CTGAGGTTGCTGATAATGAGTCAGGAAAGAGTAAGCTCAGTACTGTTCGAACGAGCTCAGGAATGTTCATTCCTAAGAGCAAGGATGCTATTGTTTCTGGCATAGAGGAT
AAAATTGCTGCGTGGACTTTTCTTCCAAAAGAAAATGGGGAAGACATTCAGGTATTGAGATATGAGCATGGGCAGAGATATGAATCACACTATGATTACTTTGTTGACAA
GGTGAATATTGCCTGGGGAGGACATCGTTTGGCTACTGTCCTCATGTATCTCTCTGATGTGACCAAAGGCGGTGAAACAGTTTTCCCCATGGCAGAGCTAGACACATGCA
ATGTGCCAGCGAAGAGGTTGAGTCCCGAAGGGGGTGGACACGAGGCAGTGTGCCAGCAAGGACGTTGGGCTTCAAAGGAGAGTGGATTTAGGGGGTTCCACATCGATTGG
AGAAGGAAACAAGTGCTAGCGAGGACACTAGACCTTGAAAGAGGGTGGATTAAATCTCCCCACAGGAGGGCTTCTGAAACAGACGAGGATCTCTCAGACTGTGCAAGGAA
AGGAATCGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTCTTCTTTAGCCTTGAACCAAATGCAATCCCGGACACCAGAAGTCTACATGGAGGTTGCCCTGTTCTTGAAG
GAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGCAGACGTTGGGAACTGCACTGATCTAAATGAAAGCTGTGAGAGATGGGCCGCC
TTAGGGGAATGCACCAAAAACCCAGAGTACATGGTCGGATCTCCAGAGCTTCCTGGCTACTGTAGGAGGAGTTGCAGGACCTGTTGA
mRNA sequenceShow/hide mRNA sequence
TTCTCTCTCTCTCTCTCTCGCTCTCTTTCTATTTTGATCCAAGCGAAATTATGTCCAGATTTCGCTCTCTGTTATTCATCTTCTTGATTTCGATTGCATCGGTTGTTCGA
GAATCCATTTGTTCGCCTGCTCGTTCGGCGAGTACCACCGTAGATCCTAGTAAAGTGAAGCAGATTTCATGGAAACCAAGAGCTTTTGTATATGAAGGTTTTCTCACGGA
CCTAGAATGCGACCACCTGATTTCTATTGCGAGATCCGAGCTAAAGAGATCTGAGGTTGCTGATAATGAGTCAGGAAAGAGTAAGCTCAGTACTGTTCGAACGAGCTCAG
GAATGTTCATTCCTAAGAGCAAGGATGCTATTGTTTCTGGCATAGAGGATAAAATTGCTGCGTGGACTTTTCTTCCAAAAGAAAATGGGGAAGACATTCAGGTATTGAGA
TATGAGCATGGGCAGAGATATGAATCACACTATGATTACTTTGTTGACAAGGTGAATATTGCCTGGGGAGGACATCGTTTGGCTACTGTCCTCATGTATCTCTCTGATGT
GACCAAAGGCGGTGAAACAGTTTTCCCCATGGCAGAGCTAGACACATGCAATGTGCCAGCGAAGAGGTTGAGTCCCGAAGGGGGTGGACACGAGGCAGTGTGCCAGCAAG
GACGTTGGGCTTCAAAGGAGAGTGGATTTAGGGGGTTCCACATCGATTGGAGAAGGAAACAAGTGCTAGCGAGGACACTAGACCTTGAAAGAGGGTGGATTAAATCTCCC
CACAGGAGGGCTTCTGAAACAGACGAGGATCTCTCAGACTGTGCAAGGAAAGGAATCGCAGTGAAACCAAAGAAAGGCGATGCCCTTCTCTTCTTTAGCCTTGAACCAAA
TGCAATCCCGGACACCAGAAGTCTACATGGAGGTTGCCCTGTTCTTGAAGGAGAAAAATGGTCAGCAACAAAGTGGATTCACGTAGATTCTTTCAGCAAAAACTTAGCAG
ACGTTGGGAACTGCACTGATCTAAATGAAAGCTGTGAGAGATGGGCCGCCTTAGGGGAATGCACCAAAAACCCAGAGTACATGGTCGGATCTCCAGAGCTTCCTGGCTAC
TGTAGGAGGAGTTGCAGGACCTGTTGA
Protein sequenceShow/hide protein sequence
MSRFRSLLFIFLISIASVVRESICSPARSASTTVDPSKVKQISWKPRAFVYEGFLTDLECDHLISIARSELKRSEVADNESGKSKLSTVRTSSGMFIPKSKDAIVSGIED
KIAAWTFLPKENGEDIQVLRYEHGQRYESHYDYFVDKVNIAWGGHRLATVLMYLSDVTKGGETVFPMAELDTCNVPAKRLSPEGGGHEAVCQQGRWASKESGFRGFHIDW
RRKQVLARTLDLERGWIKSPHRRASETDEDLSDCARKGIAVKPKKGDALLFFSLEPNAIPDTRSLHGGCPVLEGEKWSATKWIHVDSFSKNLADVGNCTDLNESCERWAA
LGECTKNPEYMVGSPELPGYCRRSCRTC