; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005308 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005308
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprolyl 4-hydroxylase 1
Genome locationscaffold11:35374323..35380075
RNA-Seq ExpressionSpg005308
SyntenySpg005308
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0000137 - Golgi cis cisterna (cellular component)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152082.1 prolyl 4-hydroxylase 1 [Cucumis sativus]1.4e-5876.82Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQ+P+ENGELIQVLRY  N                    + +  FN+KRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKP KGDAVLFWSMGLDGQSDPKSIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_016901569.1 PREDICTED: prolyl 4-hydroxylase 1 isoform X4 [Cucumis melo]5.2e-5876.16Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQIP+ENGELIQVLRY  N                    + +  FN+KRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKP KGDA+LFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_022137963.1 prolyl 4-hydroxylase 1 [Momordica charantia]2.1e-5980.13Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQIPIENGELIQVLRY  N                    + +  FN+KRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKPVKGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_038904320.1 prolyl 4-hydroxylase 1 isoform X1 [Benincasa hispida]4.0e-5876.82Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQIP+ENGELIQVLRY  N                    + +  FN+KRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKP KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

XP_038904325.1 prolyl 4-hydroxylase 1 isoform X2 [Benincasa hispida]4.0e-5876.82Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQIP+ENGELIQVLRY  N                    + +  FN+KRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKP KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

TrEMBL top hitse value%identityAlignment
A0A0A0KU17 Fe2OG dioxygenase domain-containing protein6.6e-5976.82Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQ+P+ENGELIQVLRY  N                    + +  FN+KRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKP KGDAVLFWSMGLDGQSDPKSIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A1S3BY76 prolyl 4-hydroxylase 1 isoform X62.5e-5876.16Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQIP+ENGELIQVLRY  N                    + +  FN+KRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKP KGDA+LFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A1S3BYM5 prolyl 4-hydroxylase 1 isoform X72.5e-5876.16Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQIP+ENGELIQVLRY  N                    + +  FN+KRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKP KGDA+LFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A1S4E011 prolyl 4-hydroxylase 1 isoform X22.5e-5876.16Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQIP+ENGELIQVLRY  N                    + +  FN+KRGGQR+ATMLMYLS+N+EGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKP KGDA+LFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

A0A6J1CBS4 prolyl 4-hydroxylase 11.0e-5980.13Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
        AIEKRISVYSQIPIENGELIQVLRY  N                    + +  FN+KRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS
Subjt:  AIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLS

Query:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP
        VKPVKGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQKSTLVP
Subjt:  VKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKSTLVP

SwissProt top hitse value%identityAlignment
F4JNU8 Probable prolyl 4-hydroxylase 82.5e-2648.32Show/hide
Query:  IEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSG--------ECSCGG
        IE RIS ++ IP ENGE +QVL Y    V  +     +         +   +FNV++GGQR+AT+LMYLSD  EGGET FP A           E S  G
Subjt:  IEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSG--------ECSCGG

Query:  KTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKW
        K   GLSV P K DA+LFWSM  D   DP S+HGGC V+ G KWS+TKW
Subjt:  KTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKW

F4JZ24 Probable prolyl 4-hydroxylase 102.6e-2845.57Show/hide
Query:  TLWAIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGS-----------
        T+  IEKRIS ++ IP+E+GE +QVL Y    +  K     +         +   ++N + GGQR+AT+LMYLSD  EGGET FP A             
Subjt:  TLWAIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGS-----------

Query:  GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMR
         EC  G     GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G KWS+TKW+R
Subjt:  GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMR

Q24JN5 Prolyl 4-hydroxylase 51.1e-2648.32Show/hide
Query:  IEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGS--------GECSCGG
        IEKRIS ++ IP+ENGE +QVL Y    V  K     +         +   +FN K GGQR+AT+LMYLSD  +GGET FP A           E S  G
Subjt:  IEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGS--------GECSCGG

Query:  KTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKW
        K   GLSV P K DA+LFW+M  D   DP S+HGGC V+ G KWS+TKW
Subjt:  KTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKW

Q9LN20 Probable prolyl 4-hydroxylase 34.5e-2848.37Show/hide
Query:  IEKRISVYSQIPIENGELIQVLRY--GYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECS---------
        IEKRI+ Y+ IP ++GE +QVL Y  G  +                   +   +FN K GGQR+ATMLMYLSD  EGGET FP A     S         
Subjt:  IEKRISVYSQIPIENGELIQVLRY--GYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECS---------

Query:  CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWM
        CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G KWS+TKWM
Subjt:  CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWM

Q9ZW86 Prolyl 4-hydroxylase 19.0e-5369.13Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRY-GYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGL
        AIEKRI+V+SQ+P ENGELIQVLRY    F                   + A  FN+KRGGQRVATMLMYL+D+VEGGETYFP AG G+C+CGGK + G+
Subjt:  AIEKRISVYSQIPIENGELIQVLRY-GYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKST
        SVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQK+T
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKST

Arabidopsis top hitse value%identityAlignment
AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.2e-2948.37Show/hide
Query:  IEKRISVYSQIPIENGELIQVLRY--GYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECS---------
        IEKRI+ Y+ IP ++GE +QVL Y  G  +                   +   +FN K GGQR+ATMLMYLSD  EGGET FP A     S         
Subjt:  IEKRISVYSQIPIENGELIQVLRY--GYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECS---------

Query:  CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWM
        CG K   GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G KWS+TKWM
Subjt:  CGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWM

AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.8e-2848.32Show/hide
Query:  IEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGS--------GECSCGG
        IEKRIS ++ IP+ENGE +QVL Y    V  K     +         +   +FN K GGQR+AT+LMYLSD  +GGET FP A           E S  G
Subjt:  IEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGS--------GECSCGG

Query:  KTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKW
        K   GLSV P K DA+LFW+M  D   DP S+HGGC V+ G KWS+TKW
Subjt:  KTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKW

AT2G43080.1 P4H isoform 16.4e-5469.13Show/hide
Query:  AIEKRISVYSQIPIENGELIQVLRY-GYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGL
        AIEKRI+V+SQ+P ENGELIQVLRY    F                   + A  FN+KRGGQRVATMLMYL+D+VEGGETYFP AG G+C+CGGK + G+
Subjt:  AIEKRISVYSQIPIENGELIQVLRY-GYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGL

Query:  SVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKST
        SVKP KGDAVLFWSMGLDGQSDP+SIHGGCEVL+GEKWSATKWMRQK+T
Subjt:  SVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQKST

AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.7e-2748.32Show/hide
Query:  IEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSG--------ECSCGG
        IE RIS ++ IP ENGE +QVL Y    V  +     +         +   +FNV++GGQR+AT+LMYLSD  EGGET FP A           E S  G
Subjt:  IEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSG--------ECSCGG

Query:  KTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKW
        K   GLSV P K DA+LFWSM  D   DP S+HGGC V+ G KWS+TKW
Subjt:  KTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKW

AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.9e-2945.57Show/hide
Query:  TLWAIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGS-----------
        T+  IEKRIS ++ IP+E+GE +QVL Y    +  K     +         +   ++N + GGQR+AT+LMYLSD  EGGET FP A             
Subjt:  TLWAIEKRISVYSQIPIENGELIQVLRYGYNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGS-----------

Query:  GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMR
         EC  G     GLSVKP  GDA+LFWSM  D   DP S+HGGC V+ G KWS+TKW+R
Subjt:  GECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCTCGTTGGTGGGCCCGTTTTGTTGCATTCTTTGCTGGAAGACGGAGGAAAACCTGGATCACATTCTTCGTACTTGTGACTATGTGCGCGTGGTGTGGGATTTGTT
CTATGATGCTTTTGGGATGCAGACGTGGAGTTATCGGAATGTTAAGGAGTTGATCGAGGAGTTCTTCTTCCATCCGCCTTTTCGCGAGAAGGGGAGATTTTTGTGGCAAG
CTGGGTTGTGTGCTTTGTTATGGACTTTATGGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATACCAATAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTATGGA
TATAATTTTGTTTCTGTGAAGATGCGAACACAAAGAGAGGTTTGTACTTCAGTTTTAAGGCTAACATTTCAAGCAGTACAGTTTAACGTGAAGCGTGGTGGTCAGCGAGT
AGCAACCATGCTTATGTATCTAAGTGACAATGTTGAAGGTGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGGCGGGAAGACCGTCCCAGGGCTGT
CAGTCAAACCAGTCAAAGGAGACGCAGTGCTTTTCTGGAGCATGGGGTTGGATGGACAGTCGGATCCTAAGAGCATTCATGGAGGTTGTGAAGTATTGGCAGGCGAAAAA
TGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCTCGTTGGTGGGCCCGTTTTGTTGCATTCTTTGCTGGAAGACGGAGGAAAACCTGGATCACATTCTTCGTACTTGTGACTATGTGCGCGTGGTGTGGGATTTGTT
CTATGATGCTTTTGGGATGCAGACGTGGAGTTATCGGAATGTTAAGGAGTTGATCGAGGAGTTCTTCTTCCATCCGCCTTTTCGCGAGAAGGGGAGATTTTTGTGGCAAG
CTGGGTTGTGTGCTTTGTTATGGACTTTATGGGCAATTGAAAAAAGAATTTCTGTCTATTCTCAAATACCAATAGAAAATGGAGAGCTCATTCAAGTGTTAAGGTATGGA
TATAATTTTGTTTCTGTGAAGATGCGAACACAAAGAGAGGTTTGTACTTCAGTTTTAAGGCTAACATTTCAAGCAGTACAGTTTAACGTGAAGCGTGGTGGTCAGCGAGT
AGCAACCATGCTTATGTATCTAAGTGACAATGTTGAAGGTGGAGAAACCTACTTTCCGAAGGCTGGTTCTGGTGAGTGTAGTTGTGGCGGGAAGACCGTCCCAGGGCTGT
CAGTCAAACCAGTCAAAGGAGACGCAGTGCTTTTCTGGAGCATGGGGTTGGATGGACAGTCGGATCCTAAGAGCATTCATGGAGGTTGTGAAGTATTGGCAGGCGAAAAA
TGGTCTGCCACAAAATGGATGAGGCAAAAGAGTACTCTGGTACCATAG
Protein sequenceShow/hide protein sequence
MPSLVGPFCCILCWKTEENLDHILRTCDYVRVVWDLFYDAFGMQTWSYRNVKELIEEFFFHPPFREKGRFLWQAGLCALLWTLWAIEKRISVYSQIPIENGELIQVLRYG
YNFVSVKMRTQREVCTSVLRLTFQAVQFNVKRGGQRVATMLMYLSDNVEGGETYFPKAGSGECSCGGKTVPGLSVKPVKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEK
WSATKWMRQKSTLVP