; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021208 (gene) of Snake gourd v1 genome

Gene IDTan0021208
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationLG11:4869681..4873326
RNA-Seq ExpressionTan0021208
SyntenyTan0021208
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458700.1 PREDICTED: probable prolyl 4-hydroxylase 7 [Cucumis melo]6.9e-15987.97Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPEL----NTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG
        MDSR  L FSLCFL +FT FARLPE     +++K+ +GSVLRLK DSSPLIFDPTRVTQLSW+PRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPEL----NTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFL++AQDK+VA +EARI+AW  LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK-GCVDENENCPMWAKRGECKKNPTYMV
        SQEKDDSWSDC+RKGYAVKAQKGDALLFFSLHLDATTD RSLHGSCPVIEGEKWSATKWIHVRSFEK  R S+  CVDENENCP WAKRGECKKNPTYMV
Subjt:  SQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK-GCVDENENCPMWAKRGECKKNPTYMV

Query:  GSEGGLGYCRKSCKAC
        GSEG LGYCRKSCKAC
Subjt:  GSEGGLGYCRKSCKAC

XP_011655982.1 probable prolyl 4-hydroxylase 7 isoform X1 [Cucumis sativus]3.2e-15687.22Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSR  L FSLCFL +FT FARLPE  THK+ SGSVLRLK DSSPLIFDPTRVTQLSW+PRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL++AQD+VVA +EARI+AW  LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ K
Subjt:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTR--ASKGCVDENENCPMWAKRGECKKNPTYMVGSE
        D+SWSDC+RKGYAVKAQKGDALLFFSL+LDATTD RSLHGSCPVI GEKWSATKWIHVRSFEK T   + +GCVDENENC  WAK+GECKKNPTYMVGS 
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTR--ASKGCVDENENCPMWAKRGECKKNPTYMVGSE

Query:  GGLGYCRKSCKAC
        G LGYCRKSCKAC
Subjt:  GGLGYCRKSCKAC

XP_022134044.1 probable prolyl 4-hydroxylase 7 [Momordica charantia]3.3e-16187.86Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDS R L+FSLCFLF+FT  ARLP++  HKK+SGSVLRLKG+ SPLIFDPTRVTQLSW+PRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADN SGKSVS
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL +AQD++VA +EARI+AW FLPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK--GCVDENENCPMWAKRGECKKNPTYMVGSE
        DDSWSDCARKGYAVKA+KGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEKPTR S+   CVDENENC  WAKRGECKKNPTYMVGSE
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK--GCVDENENCPMWAKRGECKKNPTYMVGSE

Query:  GGLGYCRKSCKAC
          LGYCRKSC+AC
Subjt:  GGLGYCRKSCKAC

XP_038889686.1 probable prolyl 4-hydroxylase 7 isoform X1 [Benincasa hispida]4.1e-15988.46Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRR L F LCFL +FTGFARLPEL + KK SGSV+RLK DSSPL+FDPTRVTQLSW+PRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL++AQD++VA IEARISAW  LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK-GCVDENENCPMWAKRGECKKNPTYMVGSEG
        D+SWSDCARKGYAVKA+KGDALLFFSL  DATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK TR S+  CVDENENC +WAKRGECKKNPTYMVGSE 
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK-GCVDENENCPMWAKRGECKKNPTYMVGSEG

Query:  GLGYCRKSCKAC
         LGYCRKSC+AC
Subjt:  GLGYCRKSCKAC

XP_038889687.1 probable prolyl 4-hydroxylase 7 isoform X2 [Benincasa hispida]5.9e-15888.46Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRR L F LCFL +FTGFARLPEL + KK SGSV+RLK DSSPL+FDPTRVTQLSW+PRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL++AQD++VA IEARISAW  LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK-GCVDENENCPMWAKRGECKKNPTYMVGSEG
        D+SWSDCARKGYAVKA+KGDALLFFSL  DATTD +SLHGSCPVIEGEKWSATKWIHVRSFEK TR S+  CVDENENC +WAKRGECKKNPTYMVGSE 
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK-GCVDENENCPMWAKRGECKKNPTYMVGSEG

Query:  GLGYCRKSCKAC
         LGYCRKSC+AC
Subjt:  GLGYCRKSCKAC

TrEMBL top hitse value%identityAlignment
A0A0A0KS38 Procollagen-proline 4-dioxygenase1.6e-15687.22Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSR  L FSLCFL +FT FARLPE  THK+ SGSVLRLK DSSPLIFDPTRVTQLSW+PRAFLYKGFLSD ECDHLIDLAKDKLEKSMVADN+SGKSVS
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL++AQD+VVA +EARI+AW  LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ K
Subjt:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTR--ASKGCVDENENCPMWAKRGECKKNPTYMVGSE
        D+SWSDC+RKGYAVKAQKGDALLFFSL+LDATTD RSLHGSCPVI GEKWSATKWIHVRSFEK T   + +GCVDENENC  WAK+GECKKNPTYMVGS 
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTR--ASKGCVDENENCPMWAKRGECKKNPTYMVGSE

Query:  GGLGYCRKSCKAC
        G LGYCRKSCKAC
Subjt:  GGLGYCRKSCKAC

A0A1S3C8G4 Procollagen-proline 4-dioxygenase3.4e-15987.97Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPEL----NTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG
        MDSR  L FSLCFL +FT FARLPE     +++K+ +GSVLRLK DSSPLIFDPTRVTQLSW+PRAFLYKGFLSD+ECDHLIDLAKDKLEKSMVADNESG
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPEL----NTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESG

Query:  KSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
        KSVSSEVRTSSGMFL++AQDK+VA +EARI+AW  LPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE
Subjt:  KSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKE

Query:  SQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK-GCVDENENCPMWAKRGECKKNPTYMV
        SQEKDDSWSDC+RKGYAVKAQKGDALLFFSLHLDATTD RSLHGSCPVIEGEKWSATKWIHVRSFEK  R S+  CVDENENCP WAKRGECKKNPTYMV
Subjt:  SQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK-GCVDENENCPMWAKRGECKKNPTYMV

Query:  GSEGGLGYCRKSCKAC
        GSEG LGYCRKSCKAC
Subjt:  GSEGGLGYCRKSCKAC

A0A6J1BXN9 Procollagen-proline 4-dioxygenase1.6e-16187.86Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDS R L+FSLCFLF+FT  ARLP++  HKK+SGSVLRLKG+ SPLIFDPTRVTQLSW+PRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADN SGKSVS
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL +AQD++VA +EARI+AW FLPAENGESIQILHYENGQKYEPHFD+FHDKVNQELGGHR+ATVLMYLSNVEKGGETIFPNSEFKESQEK
Subjt:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK--GCVDENENCPMWAKRGECKKNPTYMVGSE
        DDSWSDCARKGYAVKA+KGDALLFFSLHLDATTD +SLHGSCPVIEGEKWSATKWIHVRSFEKPTR S+   CVDENENC  WAKRGECKKNPTYMVGSE
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASK--GCVDENENCPMWAKRGECKKNPTYMVGSE

Query:  GGLGYCRKSCKAC
          LGYCRKSC+AC
Subjt:  GGLGYCRKSCKAC

A0A6J1FJ93 Procollagen-proline 4-dioxygenase4.5e-15688.46Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRR L FSL FL + TGFARLPE  THKKLSGSVL LK DS  LIFDPTRVTQLSW+PRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL++AQD++VA IEARISAW FLP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS F ESQEK
Subjt:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTR-ASKGCVDENENCPMWAKRGECKKNPTYMVGSEG
        DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTD RSLHGSCPVIEGEKWSATKWIHVRSF+K TR +S+ CVDEN+NCP WAKRGEC+KNPTYMVGSEG
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTR-ASKGCVDENENCPMWAKRGECKKNPTYMVGSEG

Query:  GLGYCRKSCKAC
         +GYCRKSCKAC
Subjt:  GLGYCRKSCKAC

A0A6J1JWX0 Procollagen-proline 4-dioxygenase7.7e-15688.46Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS
        MDSRR L FSL FL + TGFARLPE  THKKLSGSVL LK DS  LIFDPTRVTQLSW+PRAFLYKGFL+D+ECDHLIDLAKDKLEKSMVADNESGKSVS
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVS

Query:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK
        SEVRTSSGMFL++AQD++VA IEARISAW FLP ENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS F ESQEK
Subjt:  SEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEK

Query:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTR-ASKGCVDENENCPMWAKRGECKKNPTYMVGSEG
        DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTD RSLHGSCPVIEGEKWSATKWIHVRSF+K TR +S+ CVDEN+NCP WAKRGEC+KNPTYMVGSEG
Subjt:  DDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTR-ASKGCVDENENCPMWAKRGECKKNPTYMVGSEG

Query:  GLGYCRKSCKAC
         +GYCRKSCKAC
Subjt:  GLGYCRKSCKAC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 61.6e-11365.38Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSV
        MDS+  L FSL  L +F+  +                     S     DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSV

Query:  SSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQE
         SEVRTSSGMFL + QD +VA++EA+++AW FLP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q 
Subjt:  SSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQE

Query:  KDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEG
        KDDSWS CA++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K       CVD++E+C  WA  GEC+KNP YMVGSE 
Subjt:  KDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEG

Query:  GLGYCRKSCKAC
         LG+CRKSCKAC
Subjt:  GLGYCRKSCKAC

F4JAU3 Prolyl 4-hydroxylase 25.3e-9361.17Show/hide
Query:  SSP-LIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQ
        SSP  I +P++V Q+S KPRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ + +D +V+ IE ++S W FLP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSL
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  SDCA+KG AVK +KG+ALLFF+L  DA  D  SL
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSL

Query:  HGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEGGLGYCRKSCKAC
        HG CPVIEGEKWSATKWIHV SF+K       C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEGGLGYCRKSCKAC

Q8L970 Probable prolyl 4-hydroxylase 73.3e-12768.15Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPE---LNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGK
        MDSR  L FSLCFLF     +  P      +     GSV+++K  +S   FDPTRVTQLSW PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPE---LNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGK

Query:  SVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES
        SV SEVRTSSGMFL + QD +V+++EA+++AW FLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  + K +
Subjt:  SVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES

Query:  QEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGS
        Q KDDSW++CA++GYAVK +KGDALLFF+LH +ATTD+ SLHGSCPV+EGEKWSAT+WIHV+SFE+      GC+DEN +C  WAK GEC+KNPTYMVGS
Subjt:  QEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGS

Query:  EGGLGYCRKSCKAC
        +   GYCRKSCKAC
Subjt:  EGGLGYCRKSCKAC

Q8LAN3 Probable prolyl 4-hydroxylase 41.9e-9560.66Show/hide
Query:  SSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQI
        SS +  +P++V Q+S KPRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ + +D +V+ IE +IS W FLP ENGE IQ+
Subjt:  SSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQI

Query:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLH
        L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDCA++G AVK +KGDALLFF+LH DA  D  SLH
Subjt:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLH

Query:  GSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEGGLGYCRKSCKAC
        G CPVIEGEKWSATKWIHV SF++    S  C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEGGLGYCRKSCKAC

Q9LN20 Probable prolyl 4-hydroxylase 33.0e-6455.77Show/hide
Query:  LSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHF
        LSW+PRAF+Y  FLS +EC++LI LAK  + KS V D+E+GKS  S VRTSSG FL+R +DK++  IE RI+ + F+PA++GE +Q+LHYE GQKYEPH+
Subjt:  LSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHF

Query:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATK
        D+F D+ N + GG R+AT+LMYLS+VE+GGET+FP +     S    +  S+C +KG +VK + GDALLF+S+  DAT D  SLHG CPVI G KWS+TK
Subjt:  DFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFK-ESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATK

Query:  WIHVRSFE
        W+HV  ++
Subjt:  WIHVRSFE

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 23.7e-9461.17Show/hide
Query:  SSP-LIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQ
        SSP  I +P++V Q+S KPRAF+Y+GFL+D ECDHLI LAK+ L++S VADN++G+S  S+VRTSSG F+ + +D +V+ IE ++S W FLP ENGE +Q
Subjt:  SSP-LIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQ

Query:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSL
        +L YE+GQKY+ HFD+FHDKVN   GGHRIATVL+YLSNV KGGET+FP++ EF  +   E  D  SDCA+KG AVK +KG+ALLFF+L  DA  D  SL
Subjt:  ILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNS-EF--KESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSL

Query:  HGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEGGLGYCRKSCKAC
        HG CPVIEGEKWSATKWIHV SF+K       C D NE+C  WA  GEC KNP YMVG+    G CR+SCKAC
Subjt:  HGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEGGLGYCRKSCKAC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.3e-12868.15Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPE---LNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGK
        MDSR  L FSLCFLF     +  P      +     GSV+++K  +S   FDPTRVTQLSW PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPE---LNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGK

Query:  SVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES
        SV SEVRTSSGMFL + QD +V+++EA+++AW FLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+FP  + K +
Subjt:  SVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKES

Query:  QEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGS
        Q KDDSW++CA++GYAVK +KGDALLFF+LH +ATTD+ SLHGSCPV+EGEKWSAT+WIHV+SFE+      GC+DEN +C  WAK GEC+KNPTYMVGS
Subjt:  QEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGS

Query:  EGGLGYCRKSCKAC
        +   GYCRKSCKAC
Subjt:  EGGLGYCRKSCKAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.4e-12064.6Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPE---LNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGK
        MDSR  L FSLCFLF     +  P      +     GSV+++K  +S   FDPTRVTQLSW PR FLY+GFLSD+ECDH I LAK KLEKSMVADN+SG+
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPE---LNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGK

Query:  SVSSE-----VRTSSGMFLQRAQ---DKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF
        SV SE     VR SS           D +V+++EA+++AW FLP ENGES+QILHYENGQKYEPHFD+FHD+ N ELGGHRIATVLMYLSNVEKGGET+F
Subjt:  SVSSE-----VRTSSGMFLQRAQ---DKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIF

Query:  PNSEFKESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKK
        P  + K +Q KDDSW++CA++GYAVK +KGDALLFF+LH +ATTD+ SLHGSCPV+EGEKWSAT+WIHV+SFE+      GC+DEN +C  WAK GEC+K
Subjt:  PNSEFKESQEKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKK

Query:  NPTYMVGSEGGLGYCRKSCKAC
        NPTYMVGS+   GYCRKSCKAC
Subjt:  NPTYMVGSEGGLGYCRKSCKAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase1.1e-11465.38Show/hide
Query:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSV
        MDS+  L FSL  L +F+  +                     S     DPTR+TQLSW PRAFLYKGFLSD+ECDHLI LAK KLEKSM VAD +SG+S 
Subjt:  MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSM-VADNESGKSV

Query:  SSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQE
         SEVRTSSGMFL + QD +VA++EA+++AW FLP ENGE++QILHYENGQKY+PHFD+F+DK   ELGGHRIATVLMYLSNV KGGET+FPN + K  Q 
Subjt:  SSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQE

Query:  KDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEG
        KDDSWS CA++GYAVK +KGDALLFF+LHL+ TTD  SLHGSCPVIEGEKWSAT+WIHVRSF K       CVD++E+C  WA  GEC+KNP YMVGSE 
Subjt:  KDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEG

Query:  GLGYCRKSCKAC
         LG+CRKSCKAC
Subjt:  GLGYCRKSCKAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-9660.66Show/hide
Query:  SSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQI
        SS +  +P++V Q+S KPRAF+Y+GFL++ ECDH++ LAK  L++S VADN+SG+S  SEVRTSSG F+ + +D +V+ IE +IS W FLP ENGE IQ+
Subjt:  SSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMFLQRAQDKVVADIEARISAWAFLPAENGESIQI

Query:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLH
        L YE+GQKY+ HFD+FHDKVN   GGHR+AT+LMYLSNV KGGET+FP++E    +   E  +  SDCA++G AVK +KGDALLFF+LH DA  D  SLH
Subjt:  LHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQ---EKDDSWSDCARKGYAVKAQKGDALLFFSLHLDATTDTRSLH

Query:  GSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEGGLGYCRKSCKAC
        G CPVIEGEKWSATKWIHV SF++    S  C D NE+C  WA  GEC KNP YMVG+    GYCR+SCKAC
Subjt:  GSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEGGLGYCRKSCKAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGTCGGCTTCTCACATTTTCCCTTTGCTTTCTCTTCCTGTTTACCGGCTTCGCTCGCTTGCCTGAATTGAATACGCACAAGAAACTAAGTGGATCTGTGCT
TCGCTTGAAAGGGGATTCATCTCCGCTGATTTTCGATCCAACACGAGTCACCCAGCTCTCCTGGAAACCCAGGGCATTTTTGTATAAGGGATTTTTATCTGATAAGGAAT
GTGATCATCTAATCGATCTGGCTAAGGATAAATTAGAGAAGTCGATGGTAGCAGATAATGAGTCTGGTAAGAGTGTTAGTAGTGAAGTCCGGACGAGTTCTGGCATGTTC
CTTCAGAGAGCCCAGGATAAAGTGGTTGCTGACATTGAGGCCAGAATTTCTGCATGGGCATTCCTTCCAGCAGAAAATGGGGAGTCCATTCAAATTCTGCACTATGAGAA
TGGTCAAAAGTATGAACCGCATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGTCACCGAATAGCCACAGTCTTGATGTATTTATCCAATGTTGAAAAGG
GTGGAGAAACCATCTTTCCAAATTCAGAGTTTAAAGAATCTCAAGAAAAGGATGACAGTTGGTCTGATTGTGCTCGAAAGGGTTATGCAGTTAAAGCACAGAAAGGTGAT
GCATTGCTGTTCTTTAGCCTCCATCTCGATGCAACGACAGATACGAGAAGCTTGCACGGTAGTTGCCCTGTGATCGAGGGCGAGAAATGGTCTGCAACCAAGTGGATTCA
TGTGAGATCATTCGAGAAGCCAACTCGTGCAAGTAAGGGTTGCGTGGACGAGAATGAAAATTGCCCTATGTGGGCCAAAAGGGGTGAGTGCAAAAAGAACCCTACGTACA
TGGTGGGTTCAGAAGGTGGTTTAGGATACTGTAGGAAGAGTTGCAAAGCATGTTAA
mRNA sequenceShow/hide mRNA sequence
GCTCATTTTGGCAAAATGGAGACACTTTTACAATATTACAATTTTATCTAAAACATCAGATGCTATGCTATCGTCTGTGGTTAAAAAACAGAGAAAATATAAATATATTT
GCAGTTTCAGGGGTGACACTGGCAACATCGAGAGATAGAAACCCAAAAAAAATTCCCAGGGTCCCGAAAAACAGCCCATTCATTGTCATTTCTCATTCGTTCTCTTCACA
GTTGAATTTTCGTAATAAATTCAAATCCGTTTCTCTTTCTTCTTCCCTTCTCCTTCACGAAAGAACCCTTCCATAGTTGAATTTTTCTCTTTTTCCTTTCTCCGATTTCA
TATCGGAGGAACGAGATTTGGCTCATGGATTCCCGTCGGCTTCTCACATTTTCCCTTTGCTTTCTCTTCCTGTTTACCGGCTTCGCTCGCTTGCCTGAATTGAATACGCA
CAAGAAACTAAGTGGATCTGTGCTTCGCTTGAAAGGGGATTCATCTCCGCTGATTTTCGATCCAACACGAGTCACCCAGCTCTCCTGGAAACCCAGGGCATTTTTGTATA
AGGGATTTTTATCTGATAAGGAATGTGATCATCTAATCGATCTGGCTAAGGATAAATTAGAGAAGTCGATGGTAGCAGATAATGAGTCTGGTAAGAGTGTTAGTAGTGAA
GTCCGGACGAGTTCTGGCATGTTCCTTCAGAGAGCCCAGGATAAAGTGGTTGCTGACATTGAGGCCAGAATTTCTGCATGGGCATTCCTTCCAGCAGAAAATGGGGAGTC
CATTCAAATTCTGCACTATGAGAATGGTCAAAAGTATGAACCGCATTTTGATTTTTTTCACGACAAGGTGAATCAGGAGTTAGGTGGTCACCGAATAGCCACAGTCTTGA
TGTATTTATCCAATGTTGAAAAGGGTGGAGAAACCATCTTTCCAAATTCAGAGTTTAAAGAATCTCAAGAAAAGGATGACAGTTGGTCTGATTGTGCTCGAAAGGGTTAT
GCAGTTAAAGCACAGAAAGGTGATGCATTGCTGTTCTTTAGCCTCCATCTCGATGCAACGACAGATACGAGAAGCTTGCACGGTAGTTGCCCTGTGATCGAGGGCGAGAA
ATGGTCTGCAACCAAGTGGATTCATGTGAGATCATTCGAGAAGCCAACTCGTGCAAGTAAGGGTTGCGTGGACGAGAATGAAAATTGCCCTATGTGGGCCAAAAGGGGTG
AGTGCAAAAAGAACCCTACGTACATGGTGGGTTCAGAAGGTGGTTTAGGATACTGTAGGAAGAGTTGCAAAGCATGTTAAACTAGGAAGAAGAAGAAGAAGAAGTCCACA
TCTCTCTCTCTCTCTCTCTCTCTCGTTTTGCAGAGGTTGAGTGTTGATTCTGTGATGGTTATGTATATAACATTGGGCAGTAACTGGGTATACAACTTAGAAGTGGTGGA
TATTACACCTCTTTGTTCAAACCTTGTATTAGCAATTAGCCAAGTGTTTCATTTGGTAATCGAAACACAATGAGAAATTTTTCTCAGTAATCTCATTTACTATACTATTG
AAGTATTGG
Protein sequenceShow/hide protein sequence
MDSRRLLTFSLCFLFLFTGFARLPELNTHKKLSGSVLRLKGDSSPLIFDPTRVTQLSWKPRAFLYKGFLSDKECDHLIDLAKDKLEKSMVADNESGKSVSSEVRTSSGMF
LQRAQDKVVADIEARISAWAFLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQEKDDSWSDCARKGYAVKAQKGD
ALLFFSLHLDATTDTRSLHGSCPVIEGEKWSATKWIHVRSFEKPTRASKGCVDENENCPMWAKRGECKKNPTYMVGSEGGLGYCRKSCKAC