; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg23177 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg23177
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCarg_Chr15:7830641..7833224
RNA-Seq ExpressionCarg23177
SyntenyCarg23177
Gene Ontology termsGO:0019511 - peptidyl-proline hydroxylation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579383.1 putative prolyl 4-hydroxylase 12, partial [Cucurbita argyrosperma subsp. sororia]2.0e-182100Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
        NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF

Query:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
        WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
Subjt:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS

Query:  PDYYGTCRKSCNACG
        PDYYGTCRKSCNACG
Subjt:  PDYYGTCRKSCNACG

XP_022922237.1 probable prolyl 4-hydroxylase 12 [Cucurbita moschata]2.5e-18098.73Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRL FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
        NTVSTKFLGNSGAILNTTDDII RIENRIAVWTFLPKDHSMPFQIM+YGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF

Query:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
        WSD+RKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
Subjt:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS

Query:  PDYYGTCRKSCNACG
        PDYYGTCRKSCNACG
Subjt:  PDYYGTCRKSCNACG

XP_022973003.1 probable prolyl 4-hydroxylase 12 [Cucurbita maxima]2.1e-17998.41Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRLNFLLLLAAAFSF SCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
        NTVSTKFLGNSGAILNTTDDIIARIENRIAVW FLPKDHSMPFQIMQYGGEEAAG KYFFGNRSAMPSSEPLMATVVLYLSDSA+GGEILFPVSKVKRRF
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF

Query:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
        WSD+RKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
Subjt:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS

Query:  PDYYGTCRKSCNACG
        PDYYGTCRKSCNACG
Subjt:  PDYYGTCRKSCNACG

XP_023549812.1 probable prolyl 4-hydroxylase 12 [Cucurbita pepo subsp. pepo]1.6e-17997.78Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRLNFLLL AAAFSFSSCLAQSNS+SGRKGLRDQMVNSGHLSYSNH ERIDPSRVVQISWQPR FLYKGFLSDEECDHLIALASNSEDKPSR+NAGSR
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
        NTVSTKFLGNSGA+LNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF

Query:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
        WSD+RKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
Subjt:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS

Query:  PDYYGTCRKSCNACG
        PDYYGTCRKSCNACG
Subjt:  PDYYGTCRKSCNACG

XP_038906497.1 probable prolyl 4-hydroxylase 12 [Benincasa hispida]2.6e-14582.8Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRLNFLLLLA AFSFS+CLAQSN ISGRKGLRDQ+V+   LSYSNHS RIDPSRVVQ+SWQPR FLYKGFLSDEECDHLI+LASNSED PS N+AGS 
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
        NTVSTK L +SG ILNT+DDIIARIEN+IAVWTFLPKDH MPFQIMQY GEEA  HKYF+GN SAM SSEPLMATVVLYLSDSA GGE+LFP SKVK +F
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF

Query:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
        WSD+RKKNNFLRPVKGNA+LFFSVHLNASPDKS YH+R+PIL+G+LWVATKFFY+RP  TGN+  VES V D CIDED+SCP+WAAIGEC+RN VFMIGS
Subjt:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

TrEMBL top hitse value%identityAlignment
A0A1S3AT39 Procollagen-proline 4-dioxygenase2.4e-14180.63Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRLNFLLL A AFSFS+CLAQSN ISGRKGLRDQ+V+   LSYSN S RIDPSRVVQ+SW+PR FLYKGFLSDEECDHLI+LASNSED PSRN+AGS 
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAM-PSSEPLMATVVLYLSDSASGGEILFPVSKVKRR
        NTVST+ L  SG ILNTTDDIIARIENRIAVWT LPKDH MPFQIMQY GEEA  HKYF+GNRSAM  SSEPLMATVVLYLSDSASGGE+LFP SKVK +
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAM-PSSEPLMATVVLYLSDSASGGEILFPVSKVKRR

Query:  FWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIG
        FWS +RKK NFLRPVKGNA+LFFSVHLNASPDKS YH R PI +G+LWVATKF Y+RP  TGN+H ++S + D CIDED+SCP+WAAIGEC+RNAVFM+G
Subjt:  FWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIG

Query:  SPDYYGTCRKSCNAC
        SPDYYGTCRKSCNAC
Subjt:  SPDYYGTCRKSCNAC

A0A5A7TKX1 Procollagen-proline 4-dioxygenase1.6e-14080Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRLNFLLL A AFSFS+CLAQSN ISGRKGLRDQ+V+   LSYSN S RIDPSRVVQ+SW+PR FLYKGFLSD+ECDHLI+LASNS+D PSRN+AGS 
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAM-PSSEPLMATVVLYLSDSASGGEILFPVSKVKRR
        NTVST+ L  SG ILNTTDDIIARIENRIAVWT LPKDH MPFQIMQY GEEA  HKYF+GNRSAM  SSEPLMATVVLYLSDSASGGE+LFP SKVK +
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAM-PSSEPLMATVVLYLSDSASGGEILFPVSKVKRR

Query:  FWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIG
        FWS +RKK NFLRPVKGNA+LFFSVHLNASPDKS YH R PI +G+LWVATKF Y+RP  TGN+H ++S + D CIDED+SCP+WAAIGEC+RNAVFM+G
Subjt:  FWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIG

Query:  SPDYYGTCRKSCNAC
        SPDYYGTCRKSCNAC
Subjt:  SPDYYGTCRKSCNAC

A0A6J1E0X9 Procollagen-proline 4-dioxygenase4.6e-14080.57Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRL  LLLLA A SF SCLAQSN ISGRKGLRDQ++ S  LSYSNHS RIDPSRVVQ+SW+PR FLYKGFLSDEECDHLI+LA++SEDKPS N+  S 
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
        NTV TK L +SGAILNTTDDIIARIENRIAVWTFLPKD+SMP QI+QYGGEEA  HKY FGNRSAM SSEPLMATVVLYLSDSASGGE+ FP SKVK RF
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF

Query:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
        WSD+RKKNN LRPVKGNAVL FSVHLNASPDKS  H+R+PILDG+LW+ATKFFY+RP  TGN+H  E   D DC DED+SCP+WAAIGEC+RNAVFMIGS
Subjt:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS

Query:  PDYYGTCRKSCNAC
        PDYYGTCRKSCNAC
Subjt:  PDYYGTCRKSCNAC

A0A6J1E2P0 Procollagen-proline 4-dioxygenase1.2e-18098.73Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRL FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
        NTVSTKFLGNSGAILNTTDDII RIENRIAVWTFLPKDHSMPFQIM+YGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF

Query:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
        WSD+RKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
Subjt:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS

Query:  PDYYGTCRKSCNACG
        PDYYGTCRKSCNACG
Subjt:  PDYYGTCRKSCNACG

A0A6J1IBS3 Procollagen-proline 4-dioxygenase1.0e-17998.41Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
        MDSRLNFLLLLAAAFSF SCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSR

Query:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF
        NTVSTKFLGNSGAILNTTDDIIARIENRIAVW FLPKDHSMPFQIMQYGGEEAAG KYFFGNRSAMPSSEPLMATVVLYLSDSA+GGEILFPVSKVKRRF
Subjt:  NTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRF

Query:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
        WSD+RKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS
Subjt:  WSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGS

Query:  PDYYGTCRKSCNACG
        PDYYGTCRKSCNACG
Subjt:  PDYYGTCRKSCNACG

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 63.7e-5439.35Show/hide
Query:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS-RNNAGSRNTVSTKFLGNSGAIL-NTTDDIIARIENRIAVWTFLPKDHSMPF
        S+ S  +DP+R+ Q+SW PRAFLYKGFLSDEECDHLI LA    +K     +  S  +  ++   +SG  L    DDI+A +E ++A WTFLP+++    
Subjt:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS-RNNAGSRNTVSTKFLGNSGAIL-NTTDDIIARIENRIAVWTFLPKDHSMPF

Query:  QIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-----VSKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHS
        QI+ Y  G++   H  +F ++ A+      +ATV++YLS+   GGE +FP       ++K   WS   K+   ++P KG+A+LFF++HLN + D +  H 
Subjt:  QIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-----VSKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHS

Query:  RTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC
          P+++G+ W AT++ ++R  + G +  V       C+D+ ESC +WA  GEC++N ++M+GS    G CRKSC AC
Subjt:  RTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC

F4JAU3 Prolyl 4-hydroxylase 23.1e-4836.11Show/hide
Query:  MVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALA-SNSEDKPSRNNAGSRNTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFL
        ++ S     S+ S  I+PS+V Q+S +PRAF+Y+GFL+D ECDHLI+LA  N +     +N    + VS     +   I    D I++ IE++++ WTFL
Subjt:  MVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALA-SNSEDKPSRNNAGSRNTVSTKFLGNSGAILNTTDDIIARIENRIAVWTFL

Query:  PKDHSMPFQIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-VSKVKRRFWSDQR-------KKNNFLRPVKGNAVLFFSVHL
        PK++    Q+++Y  G++   H  +F ++  +      +ATV+LYLS+   GGE +FP   +  RR  S+ +       KK   ++P KGNA+LFF++  
Subjt:  PKDHSMPFQIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-VSKVKRRFWSDQR-------KKNNFLRPVKGNAVLFFSVHL

Query:  NASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC
        +A PD    H   P+++G+ W ATK+ ++        H      D +C D +ESC +WA +GEC +N  +M+G+P+  G CR+SC AC
Subjt:  NASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC

Q8GXT7 Probable prolyl 4-hydroxylase 121.3e-6243.27Show/hide
Query:  FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNS----GHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNT
        FL+L+    S S       S   RK LRD+ + S       SY   S+ +DP+RV+Q+SW PR FLY+GFLS+EECDHLI+L   + +  S +  G    
Subjt:  FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNS----GHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNT

Query:  VSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWS
                        D ++A IE +++ WTFLP ++    ++  Y  E++     +FG   +    E L+ATVVLYLS++  GGE+LFP S++K +  +
Subjt:  VSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWS

Query:  DQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPD
           +  N LRPVKGNA+LFF+  LNAS D    H R P++ G+L VATK  Y +  A       ESG   +C DEDE+C +WA +GECK+N V+MIGSPD
Subjt:  DQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPD

Query:  YYGTCRKSCNAC
        YYGTCRKSCNAC
Subjt:  YYGTCRKSCNAC

Q8L970 Probable prolyl 4-hydroxylase 77.0e-5336.11Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNA
        MDSR    + LA +  F   L   +S   R   R      G    +  S  S   DP+RV Q+SW PR FLY+GFLSDEECDH I LA    +K    + 
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNA

Query:  GSRNTVSTKFLGNSGAILN-TTDDIIARIENRIAVWTFLPKDHSMPFQIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV--
         S  +V ++   +SG  L+   DDI++ +E ++A WTFLP+++    QI+ Y  G++   H  +F +++ +      +ATV++YLS+   GGE +FP+  
Subjt:  GSRNTVSTKFLGNSGAILN-TTDDIIARIENRIAVWTFLPKDHSMPFQIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV--

Query:  ---SKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGEC
           +++K   W++  K+   ++P KG+A+LFF++H NA+ D +  H   P+++G+ W AT++ +++      E A        C+DE+ SC KWA  GEC
Subjt:  ---SKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGEC

Query:  KRNAVFMIGSPDYYGTCRKSCNAC
        ++N  +M+GS   +G CRKSC AC
Subjt:  KRNAVFMIGSPDYYGTCRKSCNAC

Q8LAN3 Probable prolyl 4-hydroxylase 48.3e-4634.41Show/hide
Query:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ
        S+ S  ++PS+V Q+S +PRAF+Y+GFL++ ECDH+++LA  S  + +  +  S  +  ++   +SG  ++   D I++ IE++I+ WTFLPK++    Q
Subjt:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ

Query:  IMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRR--------FWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCY
        +++Y  G++   H  +F ++  +      MAT+++YLS+   GGE +FP +++  R          SD  K+   ++P KG+A+LFF++H +A PD    
Subjt:  IMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRR--------FWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCY

Query:  HSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC
        H   P+++G+ W ATK+ ++    + +     SG   +C D +ESC +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  HSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC

Arabidopsis top hitse value%identityAlignment
AT3G28480.1 Oxoglutarate/iron-dependent oxygenase5.0e-5436.11Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNA
        MDSR    + LA +  F   L   +S   R   R      G    +  S  S   DP+RV Q+SW PR FLY+GFLSDEECDH I LA    +K    + 
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNA

Query:  GSRNTVSTKFLGNSGAILN-TTDDIIARIENRIAVWTFLPKDHSMPFQIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV--
         S  +V ++   +SG  L+   DDI++ +E ++A WTFLP+++    QI+ Y  G++   H  +F +++ +      +ATV++YLS+   GGE +FP+  
Subjt:  GSRNTVSTKFLGNSGAILN-TTDDIIARIENRIAVWTFLPKDHSMPFQIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPV--

Query:  ---SKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGEC
           +++K   W++  K+   ++P KG+A+LFF++H NA+ D +  H   P+++G+ W AT++ +++      E A        C+DE+ SC KWA  GEC
Subjt:  ---SKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGEC

Query:  KRNAVFMIGSPDYYGTCRKSCNAC
        ++N  +M+GS   +G CRKSC AC
Subjt:  KRNAVFMIGSPDYYGTCRKSCNAC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase1.6e-5235.63Show/hide
Query:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALA------SNSEDK
        MDSR    + LA +  F   L   +S   R   R      G    +  S  S   DP+RV Q+SW PR FLY+GFLSDEECDH I LA      S   D 
Subjt:  MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSG---HLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALA------SNSEDK

Query:  PSRNNAGSRNTV-----STKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSAS
         S  +  S ++V     S+ F+ N  ++    DDI++ +E ++A WTFLP+++    QI+ Y  G++   H  +F +++ +      +ATV++YLS+   
Subjt:  PSRNNAGSRNTV-----STKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSAS

Query:  GGEILFPV-----SKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDES
        GGE +FP+     +++K   W++  K+   ++P KG+A+LFF++H NA+ D +  H   P+++G+ W AT++ +++      E A        C+DE+ S
Subjt:  GGEILFPV-----SKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDES

Query:  CPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC
        C KWA  GEC++N  +M+GS   +G CRKSC AC
Subjt:  CPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase2.7e-5539.35Show/hide
Query:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS-RNNAGSRNTVSTKFLGNSGAIL-NTTDDIIARIENRIAVWTFLPKDHSMPF
        S+ S  +DP+R+ Q+SW PRAFLYKGFLSDEECDHLI LA    +K     +  S  +  ++   +SG  L    DDI+A +E ++A WTFLP+++    
Subjt:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPS-RNNAGSRNTVSTKFLGNSGAIL-NTTDDIIARIENRIAVWTFLPKDHSMPF

Query:  QIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-----VSKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHS
        QI+ Y  G++   H  +F ++ A+      +ATV++YLS+   GGE +FP       ++K   WS   K+   ++P KG+A+LFF++HLN + D +  H 
Subjt:  QIMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFP-----VSKVKRRFWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHS

Query:  RTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC
          P+++G+ W AT++ ++R  + G +  V       C+D+ ESC +WA  GEC++N ++M+GS    G CRKSC AC
Subjt:  RTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC

AT4G25600.1 Oxoglutarate/iron-dependent oxygenase9.1e-6443.27Show/hide
Query:  FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNS----GHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNT
        FL+L+    S S       S   RK LRD+ + S       SY   S+ +DP+RV+Q+SW PR FLY+GFLS+EECDHLI+L   + +  S +  G    
Subjt:  FLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNS----GHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNT

Query:  VSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWS
                        D ++A IE +++ WTFLP ++    ++  Y  E++     +FG   +    E L+ATVVLYLS++  GGE+LFP S++K +  +
Subjt:  VSTKFLGNSGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWS

Query:  DQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPD
           +  N LRPVKGNA+LFF+  LNAS D    H R P++ G+L VATK  Y +  A       ESG   +C DEDE+C +WA +GECK+N V+MIGSPD
Subjt:  DQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPD

Query:  YYGTCRKSCNAC
        YYGTCRKSCNAC
Subjt:  YYGTCRKSCNAC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.9e-4734.41Show/hide
Query:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ
        S+ S  ++PS+V Q+S +PRAF+Y+GFL++ ECDH+++LA  S  + +  +  S  +  ++   +SG  ++   D I++ IE++I+ WTFLPK++    Q
Subjt:  SNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGNSGAILNT-TDDIIARIENRIAVWTFLPKDHSMPFQ

Query:  IMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRR--------FWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCY
        +++Y  G++   H  +F ++  +      MAT+++YLS+   GGE +FP +++  R          SD  K+   ++P KG+A+LFF++H +A PD    
Subjt:  IMQY-GGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRR--------FWSDQRKKNNFLRPVKGNAVLFFSVHLNASPDKSCY

Query:  HSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC
        H   P+++G+ W ATK+ ++    + +     SG   +C D +ESC +WA +GEC +N  +M+G+ +  G CR+SC AC
Subjt:  HSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGTCTCAACTTTTTGCTTCTTTTAGCGGCCGCATTTTCATTCTCGAGCTGCCTTGCACAAAGCAATTCGATTAGTGGCCGTAAGGGTTTAAGGGACCAAAT
GGTTAACAGTGGACATTTGAGCTACTCAAATCATTCTGAAAGAATCGACCCATCACGAGTTGTCCAAATCTCTTGGCAACCAAGGGCCTTCTTGTATAAAGGCTTTCTCT
CAGATGAGGAGTGTGATCACCTTATTGCTTTGGCTTCAAATTCTGAAGATAAACCTTCTAGGAACAATGCTGGTTCCAGGAACACTGTCTCAACCAAATTTCTAGGCAAT
TCAGGAGCTATTTTAAACACAACAGATGATATCATTGCCAGGATTGAAAATAGAATTGCGGTGTGGACTTTTCTCCCAAAAGATCATAGCATGCCTTTCCAGATTATGCA
ATACGGGGGTGAAGAAGCAGCAGGGCATAAGTACTTTTTTGGCAACAGATCTGCAATGCCATCCAGTGAACCGTTGATGGCCACGGTAGTTTTGTATCTATCAGATTCGG
CCAGCGGTGGCGAGATTCTGTTCCCAGTATCAAAGGTAAAGAGAAGATTTTGGTCAGACCAGAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAGTTCTT
TTTTTCTCTGTTCATCTTAATGCTTCTCCAGACAAGAGTTGCTACCATTCCCGAACGCCAATACTCGATGGGAAATTGTGGGTTGCTACAAAATTCTTCTACATAAGACC
AGCAGCCACTGGGAATGAACACGCAGTTGAATCCGGTGTAGACGACGACTGCATTGATGAAGATGAAAGCTGCCCCAAATGGGCTGCCATCGGCGAATGCAAACGAAACG
CGGTGTTCATGATCGGTTCTCCAGATTACTATGGCACATGTAGAAAAAGCTGCAACGCATGTGGATGA
mRNA sequenceShow/hide mRNA sequence
GATTCATGGTAGTACAGAGGAGACCATGGATTTGTTCAGCAGCAGCTTCCTGAAAATCTGTATCCACAACTTCTAATTCTTCATACCTTTTCAACTTCCTCTTCCTCGTC
TCTTCTTTTTGGATCTTCGTCTCACCCATCCATGGATTCTCGTCTCAACTTTTTGCTTCTTTTAGCGGCCGCATTTTCATTCTCGAGCTGCCTTGCACAAAGCAATTCGA
TTAGTGGCCGTAAGGGTTTAAGGGACCAAATGGTTAACAGTGGACATTTGAGCTACTCAAATCATTCTGAAAGAATCGACCCATCACGAGTTGTCCAAATCTCTTGGCAA
CCAAGGGCCTTCTTGTATAAAGGCTTTCTCTCAGATGAGGAGTGTGATCACCTTATTGCTTTGGCTTCAAATTCTGAAGATAAACCTTCTAGGAACAATGCTGGTTCCAG
GAACACTGTCTCAACCAAATTTCTAGGCAATTCAGGAGCTATTTTAAACACAACAGATGATATCATTGCCAGGATTGAAAATAGAATTGCGGTGTGGACTTTTCTCCCAA
AAGATCATAGCATGCCTTTCCAGATTATGCAATACGGGGGTGAAGAAGCAGCAGGGCATAAGTACTTTTTTGGCAACAGATCTGCAATGCCATCCAGTGAACCGTTGATG
GCCACGGTAGTTTTGTATCTATCAGATTCGGCCAGCGGTGGCGAGATTCTGTTCCCAGTATCAAAGGTAAAGAGAAGATTTTGGTCAGACCAGAGAAAGAAAAACAACTT
TCTGAGACCAGTGAAAGGCAATGCAGTTCTTTTTTTCTCTGTTCATCTTAATGCTTCTCCAGACAAGAGTTGCTACCATTCCCGAACGCCAATACTCGATGGGAAATTGT
GGGTTGCTACAAAATTCTTCTACATAAGACCAGCAGCCACTGGGAATGAACACGCAGTTGAATCCGGTGTAGACGACGACTGCATTGATGAAGATGAAAGCTGCCCCAAA
TGGGCTGCCATCGGCGAATGCAAACGAAACGCGGTGTTCATGATCGGTTCTCCAGATTACTATGGCACATGTAGAAAAAGCTGCAACGCATGTGGATGA
Protein sequenceShow/hide protein sequence
MDSRLNFLLLLAAAFSFSSCLAQSNSISGRKGLRDQMVNSGHLSYSNHSERIDPSRVVQISWQPRAFLYKGFLSDEECDHLIALASNSEDKPSRNNAGSRNTVSTKFLGN
SGAILNTTDDIIARIENRIAVWTFLPKDHSMPFQIMQYGGEEAAGHKYFFGNRSAMPSSEPLMATVVLYLSDSASGGEILFPVSKVKRRFWSDQRKKNNFLRPVKGNAVL
FFSVHLNASPDKSCYHSRTPILDGKLWVATKFFYIRPAATGNEHAVESGVDDDCIDEDESCPKWAAIGECKRNAVFMIGSPDYYGTCRKSCNACG