; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G004780 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G004780
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCicolChr01:4785804..4807204
RNA-Seq ExpressionCcUC01G004780
SyntenyCcUC01G004780
Gene Ontology termsGO:0016042 - lipid catabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR002921 - Fungal lipase-like domain
IPR005592 - Mono-/di-acylglycerol lipase, N-terminal
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR007749 - Protein of unknown function DUF677
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4271435.1 unnamed protein product [Prunus armeniaca]3.1e-26857.74Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILCG P++ECV CL C RW WKRCLHTAGHDSETWG AT +EFEP+PR+CRYILAVYEDD+R+PLWEP GGYGI PDWL++KKTY+DTQG+APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDHDHADIVLA RGLN+A+ESDYAVL+DN+LGKKKFDGGYVHNGLLKAA WVLD E E LKDLV++YP+YTLTF GHSLGSGVAA+LT+VVVQ+R++L N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ        TPLEDIFKSLF                                          LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LC+RC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR GRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D++ME+PP+QK
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL
        MERQ+TLA+EH+EEY+AALQRAVTLAVPHAY+ S YGTF + D  EEE S  SSG SS    +     + R      SR  +  +  +K           
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL

Query:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG
                                       ++ +HF                                    +S HS RIDPSR VQ+SWRPRVFLY+G
Subjt:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG

Query:  FLSDEECDHLISLASNSEDN------PSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP
        FLSDEECDHL+SLA   E+N        GNT + ++  S  + LN  D+I++RIE RI+ WTFLPK++S   Q+ +   EEAE    F+GN+S +  SEP
Subjt:  FLSDEECDHLISLASNSEDN------PSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP

Query:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDV
        L+ATV+LY+S+ +RGG +LFPES++ S+ WS+  K ++ L+P KGNA+LFF++  NASPDKSS H+R P+L GE+W ATKF Y +   G  +K + +S+ 
Subjt:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDV

Query:  DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
          C DED +CP WA+IGEC+RN VFM+GSPDYY
Subjt:  DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

CAB4301873.1 unnamed protein product [Prunus armeniaca]4.8e-26957.86Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILCG P++ECV CL C RW WKRCLHTAGHDSETWG AT +EFEP+PR+CRYILAVYEDD+R+PLWEP GGYGI PDWL++KKTY+DTQG+APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDHDHADIVLA RGLN+A+ESDYAVL+DN+LGKKKFDGGYVHNGLLKAA WVLD E E LKDLV++YP+YTLTF GHSLGSGVAA+LT+VVVQ+R++L N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ        TPLEDIFKSLF                                          LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LC+RC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR GRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D++ME+PP+QK
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL
        MERQ+TLA+EH+EEY+AALQRAVTLAVPHAY+ S YGTF + D  EEE S  SSG SS    +     + R      SR  +  +  +K           
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL

Query:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG
                                       ++ +HF                                    +S HS RIDPSR VQ+SWRPRVFLY+G
Subjt:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG

Query:  FLSDEECDHLISLASNSEDN------PSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP
        FLSDEECDHL+SLA   E+N        GNT + ++ KS  + LN  D+I++RIE RI+ WTFLPK++S   Q+ +   EEAE    F+GN+S +  SEP
Subjt:  FLSDEECDHLISLASNSEDN------PSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP

Query:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDV
        L+ATV+LY+S+ +RGG +LFPES++ S+ WS+  K ++ L+P KGNA+LFF++  NASPDKSS H+R P+L GE+W ATKF Y +   G  +K + +S+ 
Subjt:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDV

Query:  DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
          C DED +CP WA+IGEC+RN VFM+GSPDYY
Subjt:  DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

PON44192.1 Mono-/di-acylglycerol lipase [Trema orientale]1.2e-26758.63Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILCG+P+LECV CL CARW WKRCLHTAGHDSETWG AT +EFEP+PRIC YILAVYEDD+R PLWEP  GYGINPDWL++K+TY+DT+G+APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDHDHADIVLA RGLN+AKESDYAVLLDN+LGK+KFDGGYVHNGLLKAAGWVL TE++ LKDLV++YP+YTLTFAGHSLGSGVAA+LT+V VQNR+KL N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLF                                          LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR GRFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALELM + D +ME+P +Q+
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPA-SSGGSSRKK--KETWDELIERLYDKDDSRHTVLKK---SLSKCWIIF
        MERQ+TLA+E SEEYKAALQRAVTLAVPHAY+ S YGTF          +P  SS GSSRK   KETWDELIERL+DKDDS H +  +   S SK   IF
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPA-SSGGSSRKK--KETWDELIERLYDKDDSRHTVLKK---SLSKCWIIF

Query:  EDVDFLGFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPR
            F GF   ++      S  F G+      L  S    L F  L          L    L S   G + Q  ER                        
Subjt:  EDVDFLGFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPR

Query:  VFLYKGFLSDEECDHLISLASNSEDNPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP
                         +  S+ + + SG+T+  ++LKSSG      DDI++ IE RI+ WTFLPK++    Q++ Y  E++E    ++GN S +   +P
Subjt:  VFLYKGFLSDEECDHLISLASNSEDNPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP

Query:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKN-NFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESD
        L+ATVVLYLS+ ++GG +LFP+S+V  K WS+  K + N  RP+KGNA+LFF+++ N + D SS H R P+L GE+W ATKFF ++ T G  +K ++ESD
Subjt:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKN-NFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESD

Query:  VDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
         + C D+D++CP WAA+GEC+RN VFM+GSPDYY
Subjt:  VDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

PON51727.1 Mono-/di-acylglycerol lipase [Parasponia andersonii]1.3e-26657.83Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILCG+P+LECV CL CARW WKRCLHTAGHDSETWG AT +EFEP+PRIC YILAVYEDD+R+PLWEP  GYGINPDWL++K+TY+DT+G+APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDHDHADIVLA RGLN+AKESDYAVLLDN+LGK+KFDGGYVHNGLLKAAGWVL TE++ LKDLV+RYP+YTLTFAGHSLGSGVAA+LT+V VQNR+KL N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLF                                          LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR GRFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALEL+ + D +ME+P +Q+
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKK---KETWDELIERLYDKDDSRHTVLKK---SLSKCWIIF
        MERQ+TLA+E SEEYKAALQRAVTLAVPHAY+ S YGTF          +P  S   S +K   KETWDELIERL+DKDDS H    +   S SK  I+ 
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKK---KETWDELIERLYDKDDSRHTVLKK---SLSKCWIIF

Query:  EDVDFLGFNCLKMIENMPR---SSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSW
         +    GF+     EN      S  F G+      L+ S    L F  L     S  T L    L S   G + Q  ER                     
Subjt:  EDVDFLGFNCLKMIENMPR---SSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSW

Query:  RPRVFLYKGFLSDEECDHLISLASNSEDNPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSS
                            +  S+ + + SG+T+   +L SSG      DD+++ IE RI+ WTFLPK++    Q++ Y  E++E    ++GN S +  
Subjt:  RPRVFLYKGFLSDEECDHLISLASNSEDNPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSS

Query:  SEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFL-RPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTI
        S+PL+ATVVLYLS+ ++GG +LFP+S+V  K WS+  K +  + RP+KGNA+LFF+++ N + D SS H R P++ GE+W ATKFF ++ T G  +K ++
Subjt:  SEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFL-RPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTI

Query:  ESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
        ESD + C D+D++CP WAA+GEC+RN VFM+GSPDYY
Subjt:  ESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

RXH95088.1 hypothetical protein DVH24_024772 [Malus domestica]3.7e-26957.95Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILC  P+LECV CL C RW WKRCLHTAGHDSETWG +T +EFEP+PR+CRYILAVYEDD+R PLWEP GGYGINPDWL++KKTY+DT G APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDH+HADIVLA RGLN+A+ESDYAVL+DN+LG++KFDGGYVHNGLLK+A WV+D E E LKDLV+ YP+YTLTFAGHSLGSGVAA+LT+VVV+NR++L +
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKR+R YAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIF                                             +LPC+
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D +ME+P +Q+
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL
        MERQ+TLA+EH+EEYKAALQRAVTLAVPHAY+ SPYGTF + D  EE+ S  SSG SS                                          
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL

Query:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLR-DQLVERPL---SYSNHSGRIDPSRVVQVSWRPRVF
         F   K      +S +F      S     SM S     LLLS+  SF +  A+ +    RK LR +Q ++  +    +S HS RIDPSRVVQ+SW+PR  
Subjt:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLR-DQLVERPL---SYSNHSGRIDPSRVVQVSWRPRVF

Query:  LYKGFLSDEECDHLISLASNSED------NPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMS
              SDEECDHL+SLA   ED      +  GNT + +++KS  + L+  D++++RIE RI+ WTFLPK++S   Q+  +  EE +  + ++GN+S + 
Subjt:  LYKGFLSDEECDHLISLASNSED------NPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMS

Query:  SSEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTI
         +EPL+ATV+LYLS+ +RGG +LFPES++ SK  S+ R+ ++ LRPVKGNA+LFF++H NASPDKSS HTR P+L GE+W ATKF + +   G  +K + 
Subjt:  SSEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTI

Query:  ESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
        +S    C DED +CP+WA++GEC+RN VFM+GSPDYY
Subjt:  ESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

TrEMBL top hitse value%identityAlignment
A0A2P5B5Y0 Procollagen-proline 4-dioxygenase5.7e-26858.63Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILCG+P+LECV CL CARW WKRCLHTAGHDSETWG AT +EFEP+PRIC YILAVYEDD+R PLWEP  GYGINPDWL++K+TY+DT+G+APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDHDHADIVLA RGLN+AKESDYAVLLDN+LGK+KFDGGYVHNGLLKAAGWVL TE++ LKDLV++YP+YTLTFAGHSLGSGVAA+LT+V VQNR+KL N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLF                                          LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR GRFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALELM + D +ME+P +Q+
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPA-SSGGSSRKK--KETWDELIERLYDKDDSRHTVLKK---SLSKCWIIF
        MERQ+TLA+E SEEYKAALQRAVTLAVPHAY+ S YGTF          +P  SS GSSRK   KETWDELIERL+DKDDS H +  +   S SK   IF
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPA-SSGGSSRKK--KETWDELIERLYDKDDSRHTVLKK---SLSKCWIIF

Query:  EDVDFLGFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPR
            F GF   ++      S  F G+      L  S    L F  L          L    L S   G + Q  ER                        
Subjt:  EDVDFLGFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPR

Query:  VFLYKGFLSDEECDHLISLASNSEDNPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP
                         +  S+ + + SG+T+  ++LKSSG      DDI++ IE RI+ WTFLPK++    Q++ Y  E++E    ++GN S +   +P
Subjt:  VFLYKGFLSDEECDHLISLASNSEDNPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP

Query:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKN-NFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESD
        L+ATVVLYLS+ ++GG +LFP+S+V  K WS+  K + N  RP+KGNA+LFF+++ N + D SS H R P+L GE+W ATKFF ++ T G  +K ++ESD
Subjt:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKN-NFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESD

Query:  VDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
         + C D+D++CP WAA+GEC+RN VFM+GSPDYY
Subjt:  VDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

A0A2P5BSE2 Procollagen-proline 4-dioxygenase6.3e-26757.83Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILCG+P+LECV CL CARW WKRCLHTAGHDSETWG AT +EFEP+PRIC YILAVYEDD+R+PLWEP  GYGINPDWL++K+TY+DT+G+APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDHDHADIVLA RGLN+AKESDYAVLLDN+LGK+KFDGGYVHNGLLKAAGWVL TE++ LKDLV+RYP+YTLTFAGHSLGSGVAA+LT+V VQNR+KL N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLF                                          LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR GRFPPVV+TAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ ALEL+ + D +ME+P +Q+
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKK---KETWDELIERLYDKDDSRHTVLKK---SLSKCWIIF
        MERQ+TLA+E SEEYKAALQRAVTLAVPHAY+ S YGTF          +P  S   S +K   KETWDELIERL+DKDDS H    +   S SK  I+ 
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKK---KETWDELIERLYDKDDSRHTVLKK---SLSKCWIIF

Query:  EDVDFLGFNCLKMIENMPR---SSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSW
         +    GF+     EN      S  F G+      L+ S    L F  L     S  T L    L S   G + Q  ER                     
Subjt:  EDVDFLGFNCLKMIENMPR---SSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSW

Query:  RPRVFLYKGFLSDEECDHLISLASNSEDNPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSS
                            +  S+ + + SG+T+   +L SSG      DD+++ IE RI+ WTFLPK++    Q++ Y  E++E    ++GN S +  
Subjt:  RPRVFLYKGFLSDEECDHLISLASNSEDNPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSS

Query:  SEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFL-RPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTI
        S+PL+ATVVLYLS+ ++GG +LFP+S+V  K WS+  K +  + RP+KGNA+LFF+++ N + D SS H R P++ GE+W ATKFF ++ T G  +K ++
Subjt:  SEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFL-RPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTI

Query:  ESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
        ESD + C D+D++CP WAA+GEC+RN VFM+GSPDYY
Subjt:  ESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

A0A498JHB5 Procollagen-proline 4-dioxygenase1.8e-26957.95Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILC  P+LECV CL C RW WKRCLHTAGHDSETWG +T +EFEP+PR+CRYILAVYEDD+R PLWEP GGYGINPDWL++KKTY+DT G APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDH+HADIVLA RGLN+A+ESDYAVL+DN+LG++KFDGGYVHNGLLK+A WV+D E E LKDLV+ YP+YTLTFAGHSLGSGVAA+LT+VVV+NR++L +
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKR+R YAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIF                                             +LPC+
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LCLRC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D +ME+P +Q+
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL
        MERQ+TLA+EH+EEYKAALQRAVTLAVPHAY+ SPYGTF + D  EE+ S  SSG SS                                          
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL

Query:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLR-DQLVERPL---SYSNHSGRIDPSRVVQVSWRPRVF
         F   K      +S +F      S     SM S     LLLS+  SF +  A+ +    RK LR +Q ++  +    +S HS RIDPSRVVQ+SW+PR  
Subjt:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLR-DQLVERPL---SYSNHSGRIDPSRVVQVSWRPRVF

Query:  LYKGFLSDEECDHLISLASNSED------NPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMS
              SDEECDHL+SLA   ED      +  GNT + +++KS  + L+  D++++RIE RI+ WTFLPK++S   Q+  +  EE +  + ++GN+S + 
Subjt:  LYKGFLSDEECDHLISLASNSED------NPSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMS

Query:  SSEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTI
         +EPL+ATV+LYLS+ +RGG +LFPES++ SK  S+ R+ ++ LRPVKGNA+LFF++H NASPDKSS HTR P+L GE+W ATKF + +   G  +K + 
Subjt:  SSEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTI

Query:  ESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
        +S    C DED +CP+WA++GEC+RN VFM+GSPDYY
Subjt:  ESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

A0A6J5U8N9 Procollagen-proline 4-dioxygenase1.5e-26857.74Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILCG P++ECV CL C RW WKRCLHTAGHDSETWG AT +EFEP+PR+CRYILAVYEDD+R+PLWEP GGYGI PDWL++KKTY+DTQG+APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDHDHADIVLA RGLN+A+ESDYAVL+DN+LGKKKFDGGYVHNGLLKAA WVLD E E LKDLV++YP+YTLTF GHSLGSGVAA+LT+VVVQ+R++L N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ        TPLEDIFKSLF                                          LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LC+RC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR GRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D++ME+PP+QK
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL
        MERQ+TLA+EH+EEY+AALQRAVTLAVPHAY+ S YGTF + D  EEE S  SSG SS    +     + R      SR  +  +  +K           
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL

Query:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG
                                       ++ +HF                                    +S HS RIDPSR VQ+SWRPRVFLY+G
Subjt:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG

Query:  FLSDEECDHLISLASNSEDN------PSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP
        FLSDEECDHL+SLA   E+N        GNT + ++  S  + LN  D+I++RIE RI+ WTFLPK++S   Q+ +   EEAE    F+GN+S +  SEP
Subjt:  FLSDEECDHLISLASNSEDN------PSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP

Query:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDV
        L+ATV+LY+S+ +RGG +LFPES++ S+ WS+  K ++ L+P KGNA+LFF++  NASPDKSS H+R P+L GE+W ATKF Y +   G  +K + +S+ 
Subjt:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDV

Query:  DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
          C DED +CP WA+IGEC+RN VFM+GSPDYY
Subjt:  DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

A0A6J5WND9 Procollagen-proline 4-dioxygenase2.3e-26957.86Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILCG P++ECV CL C RW WKRCLHTAGHDSETWG AT +EFEP+PR+CRYILAVYEDD+R+PLWEP GGYGI PDWL++KKTY+DTQG+APPYILY
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDHDHADIVLA RGLN+A+ESDYAVL+DN+LGKKKFDGGYVHNGLLKAA WVLD E E LKDLV++YP+YTLTF GHSLGSGVAA+LT+VVVQ+R++L N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        IDRKR+R YAIAPARC+SLNLAVRYADVINSVVLQ        TPLEDIFKSLF                                          LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LC+RC+RDTC+ E+KMLKDPRRLYAPGRLYHIVERKPFR GRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL+LM + D++ME+PP+QK
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL
        MERQ+TLA+EH+EEY+AALQRAVTLAVPHAY+ S YGTF + D  EEE S  SSG SS    +     + R      SR  +  +  +K           
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFL

Query:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG
                                       ++ +HF                                    +S HS RIDPSR VQ+SWRPRVFLY+G
Subjt:  GFNCLKMIENMPRSSNFLGVMKFSNILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKG

Query:  FLSDEECDHLISLASNSEDN------PSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP
        FLSDEECDHL+SLA   E+N        GNT + ++ KS  + LN  D+I++RIE RI+ WTFLPK++S   Q+ +   EEAE    F+GN+S +  SEP
Subjt:  FLSDEECDHLISLASNSEDN------PSGNTVSTKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEP

Query:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDV
        L+ATV+LY+S+ +RGG +LFPES++ S+ WS+  K ++ L+P KGNA+LFF++  NASPDKSS H+R P+L GE+W ATKF Y +   G  +K + +S+ 
Subjt:  LMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDV

Query:  DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
          C DED +CP WA+IGEC+RN VFM+GSPDYY
Subjt:  DGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

SwissProt top hitse value%identityAlignment
A2XCJ1 UPF0496 protein 37.7e-4438.44Show/hide
Query:  DVREEYVNAFRTESYIDFWTRVL--ALNKGDDLITQLSVES--TTSTRLSSYRLFVEHLLDPPQSTIKRILTSP---HIGLNSYSLLLNYFSHTANASLL
        D REEY +AFRTESY DFW RVL   L  G  L+ +         S RL SYRLF EHLL+P Q  +   L SP    +  +   LL  Y++ TANAS L
Subjt:  DVREEYVNAFRTESYIDFWTRVL--ALNKGDDLITQLSVES--TTSTRLSSYRLFVEHLLDPPQSTIKRILTSP---HIGLNSYSLLLNYFSHTANASLL

Query:  CSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLIHLVEFSNTHNPFIPSIEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRV-RYFQHNSAGF
        CS LLK I+H+RL+  PL+  L+ L +      +     L                S  +++ +Q G   LL+ L+  R KA+ +++ V R  +  S  F
Subjt:  CSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLIHLVEFSNTHNPFIPSIEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRV-RYFQHNSAGF

Query:  L--VAITASLTIIIVTHGIALFVAAPGFLVGAIKLATR---SRKLAKEVARLNVTAKGTYTLNRDFDTIGRLLARLSHELEHMRVMARFWLDK-----GE
        +  VA+ A +   I  H +A F A P  ++    L  R    R   + + +L   AKGTY LNRD +TI RL+AR+  E EHM  + R  ++        
Subjt:  L--VAITASLTIIIVTHGIALFVAAPGFLVGAIKLATR---SRKLAKEVARLNVTAKGTYTLNRDFDTIGRLLARLSHELEHMRVMARFWLDK-----GE

Query:  DRCRAIDELVRQLNQSHVKFSQQLDELEEHLYLCFMTINRARNLVV
         + R + E++RQL+++   F QQLDELEEHL+LCFMTIN+AR +V+
Subjt:  DRCRAIDELVRQLNQSHVKFSQQLDELEEHLYLCFMTINRARNLVV

F4J0A8 Probable prolyl 4-hydroxylase 65.2e-4839.39Show/hide
Query:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-------EDNPSGNTVSTKVLKSSGVIL-NTTDDIIARIENRIALWTFLPKDHSMPF
        S+ S  +DP+R+ Q+SW PR FLYKGFLSDEECDHLI LA           D  SG +  ++V  SSG+ L    DDI+A +E ++A WTFLP+++    
Subjt:  SNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNS-------EDNPSGNTVSTKVLKSSGVIL-NTTDDIIARIENRIALWTFLPKDHSMPF

Query:  QIMQYRG---EEAEHKYFYGNRSAMSSSEPLMATVVLYLSHSSRGGVMLFPESK-----VNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYH
        QI+ Y      +    YFY ++ A+      +ATV++YLS+ ++GG  +FP  K     +    WS   K+   ++P KG+A+LFF++HLN + D +S H
Subjt:  QIMQYRG---EEAEHKYFYGNRSAMSSSEPLMATVVLYLSHSSRGGVMLFPESK-----VNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYH

Query:  TRSPILNGELWVATKFFYLRPTTGNKDKHTIESDVDGCIDEDKSCPQWAAIGECERNAVFMIGS
           P++ GE W AT++ ++R + G K           C+D+ +SC +WA  GECE+N ++M+GS
Subjt:  TRSPILNGELWVATKFFYLRPTTGNKDKHTIESDVDGCIDEDKSCPQWAAIGECERNAVFMIGS

Q8GXT7 Probable prolyl 4-hydroxylase 124.1e-5340.67Show/hide
Query:  SRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLV-----ERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGNTVS
        SR+  +L+++++ S     +  +    RK LRD+ +     +   SY   S  +DP+RV+Q+SW PRVFLY+GFLS+EECDHLISL             +
Subjt:  SRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLV-----ERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGNTVS

Query:  TKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNR
        T+V           D ++A IE +++ WTFLP ++    ++  Y  E++  K  ++G   +    E L+ATVVLYLS++++GG +LFP S++  K  ++ 
Subjt:  TKVLKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKY-FYGNRSAMSSSEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNR

Query:  RKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY
         +  N LRPVKGNA+LFF+  LNAS D  S H R P++ GEL VATK  Y       K +  IE   + C DED++C +WA +GEC++N V+MIGSPDYY
Subjt:  RKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYY

Q8L970 Probable prolyl 4-hydroxylase 72.1e-4937.22Show/hide
Query:  MDSRLHFLLLLSIAFSFSTCL---AQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP
        MDSR+   L  S+ F F+  L   A +  ++     RD  V + +  S  S   DP+RV Q+SW PRVFLY+GFLSDEECDH I LA      S   DN 
Subjt:  MDSRLHFLLLLSIAFSFSTCL---AQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLA------SNSEDNP

Query:  SGNTVSTKVLKSSGVILN-TTDDIIARIENRIALWTFLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSHSSRGGVMLFP----
        SG +V ++V  SSG+ L+   DDI++ +E ++A WTFLP+++    QI+ Y  G++ E H  ++ +++ +      +ATV++YLS+  +GG  +FP    
Subjt:  SGNTVSTKVLKSSGVILN-TTDDIIARIENRIALWTFLPKDHSMPFQIMQY-RGEEAE-HKYFYGNRSAMSSSEPLMATVVLYLSHSSRGGVMLFP----

Query:  -ESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDVDGCIDEDKSCPQWAAIGECE
          +++    W+   K+   ++P KG+A+LFF++H NA+ D +S H   P++ GE W AT++ +++      +K +      GC+DE+ SC +WA  GEC+
Subjt:  -ESKVNSKFWSNRRKKNNFLRPVKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDVDGCIDEDKSCPQWAAIGECE

Query:  RNAVFMIGS
        +N  +M+GS
Subjt:  RNAVFMIGS

Q9SMU4 UPF0496 protein At3g490701.9e-8248.33Show/hide
Query:  SGPHITSHPVDIDVREEYVNAFRTESYIDFWTRVLALNKGDDLITQLS-----VESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLNSYSLLLNY
        SG        D+DVREEY NAFRTESY  FWTRV+ L++    ++  S       S+TS RL SYRLF  +LLDP  +TI RIL    +G ++ +LL +Y
Subjt:  SGPHITSHPVDIDVREEYVNAFRTESYIDFWTRVLALNKGDDLITQLS-----VESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLNSYSLLLNY

Query:  FSHTANASLLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLI-HLVEFSNTHNPFIPSIEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRV
        F  TANA LLC++LLK+I HLR K   L         K +FH E+H    LI    E S   +PFI S  ++Q+I++GC  LLKRLE  RDK +AKLK +
Subjt:  FSHTANASLLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLI-HLVEFSNTHNPFIPSIEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRV

Query:  RYFQHNSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAIKLATRSRKLAKEVARLNVTAKGTYTLNRDFDTIGRLLARLSHELEHMRVMARFWLDKGE
            H+S   ++A+T +L + I +H  ALF+AAP  L    K A    KL K  ARL+V AKGTY L+RD DTI RL+ R++ E+ H+R MA FW+ +G 
Subjt:  RYFQHNSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAIKLATRSRKLAKEVARLNVTAKGTYTLNRDFDTIGRLLARLSHELEHMRVMARFWLDKGE

Query:  DRCRAIDELVRQLNQSHVKFSQQLDELEEHLYLCFMTINRARNLVVKEVLDSGQPIKISY
         R R  +E+ R+L +    FS++LDELEEH+YLCFMTINRARNL+VKE+LDS  P   S+
Subjt:  DRCRAIDELVRQLNQSHVKFSQQLDELEEHLYLCFMTINRARNLVVKEVLDSGQPIKISY

Arabidopsis top hitse value%identityAlignment
AT3G49050.1 alpha/beta-Hydrolases superfamily protein4.3e-19165.37Show/hide
Query:  MSILCG-VPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYIL
        MSILCG  P+LECV CLGCARW +KRCL+TAGHDSE WG AT DEFEP+PR CRYILAVYEDDIR PLWEP  GYGINPDWLL+KKTY+DTQGRAP YIL
Subjt:  MSILCG-VPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYIL

Query:  YLDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLE
        YLDH H DIV+AIRGLN+AKESDYA+LLDN+LG++KFDGGYVHNGL+K+AG+VLD E + LK+LVK+YP YTLTFAGHSLGSGVA ML L+VV++ E+L 
Subjt:  YLDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLE

Query:  NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPC
        NIDRKR+RC+AIAPARCMSLNLAVRYADVINSV+LQDDFLPRTATPLEDIFKS+F                                          LPC
Subjt:  NIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPC

Query:  LLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQ
        LLC+RC++DTCV E KMLKDPRRLYAPGR+YHIVERKP R GR+PPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIE+EA+ AL LM +++K ME+P +Q
Subjt:  LLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQ

Query:  KMERQKTLAREHSEEYKAALQRAVTLAVPHAYAIS-PYGTFSQTDEGEEEKSPA------------------SSGGSS--------RKKKETWDELIERL
        +MERQ++LAREH+ EY+AAL+RAVTL VPHA +++  YGTF +T E E E+                     SS  SS        R ++ +WDELIE L
Subjt:  KMERQKTLAREHSEEYKAALQRAVTLAVPHAYAIS-PYGTFSQTDEGEEEKSPA------------------SSGGSS--------RKKKETWDELIERL

Query:  YDKDDSRHTVLKKS
        +++D+S +   +KS
Subjt:  YDKDDSRHTVLKKS

AT3G49070.1 Protein of unknown function (DUF677)1.3e-8348.33Show/hide
Query:  SGPHITSHPVDIDVREEYVNAFRTESYIDFWTRVLALNKGDDLITQLS-----VESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLNSYSLLLNY
        SG        D+DVREEY NAFRTESY  FWTRV+ L++    ++  S       S+TS RL SYRLF  +LLDP  +TI RIL    +G ++ +LL +Y
Subjt:  SGPHITSHPVDIDVREEYVNAFRTESYIDFWTRVLALNKGDDLITQLS-----VESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLNSYSLLLNY

Query:  FSHTANASLLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLI-HLVEFSNTHNPFIPSIEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRV
        F  TANA LLC++LLK+I HLR K   L         K +FH E+H    LI    E S   +PFI S  ++Q+I++GC  LLKRLE  RDK +AKLK +
Subjt:  FSHTANASLLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLI-HLVEFSNTHNPFIPSIEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRV

Query:  RYFQHNSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAIKLATRSRKLAKEVARLNVTAKGTYTLNRDFDTIGRLLARLSHELEHMRVMARFWLDKGE
            H+S   ++A+T +L + I +H  ALF+AAP  L    K A    KL K  ARL+V AKGTY L+RD DTI RL+ R++ E+ H+R MA FW+ +G 
Subjt:  RYFQHNSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAIKLATRSRKLAKEVARLNVTAKGTYTLNRDFDTIGRLLARLSHELEHMRVMARFWLDKGE

Query:  DRCRAIDELVRQLNQSHVKFSQQLDELEEHLYLCFMTINRARNLVVKEVLDSGQPIKISY
         R R  +E+ R+L +    FS++LDELEEH+YLCFMTINRARNL+VKE+LDS  P   S+
Subjt:  DRCRAIDELVRQLNQSHVKFSQQLDELEEHLYLCFMTINRARNLVVKEVLDSGQPIKISY

AT4G00500.1 alpha/beta-Hydrolases superfamily protein1.2e-16156.45Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILC VP+LECV CLGC  W+WK+CL++AGH+SE WG AT DEFEPIPRICR ILAVYE+++  P+W P  GYGI+P+ +++KK Y  T+GR  PY++Y
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDH++ D+VLAIRGLN+AKE DYAVLLDN+LG+ KFDGGYVHNGLLKAA WV + E+  L++L++  P Y+LTF GHSLG+GV ++L L V+QNR +L N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        I+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQDDFLPRT T LE++FKS+                                           LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LCL C++DT   E++ LKD RRLYAPGRLYHIV RKP R GR+PPVV+TAVPVDGRFE IVLSCNAT+DHAIIWIE+E++ AL+LM ++D+VM++P +QK
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEE------EKSPASSGGSSRKKKETWDELIERLYD-KDDSRHTVLKKSLSK
        + RQK++  +H EEY+AA+ +A +L +P + + S YGTF  T+EGE       E SP  SG S +  +  WD+ I+  +   D+S H + K   S+
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEE------EKSPASSGGSSRKKKETWDELIERLYD-KDDSRHTVLKKSLSK

AT4G00500.2 alpha/beta-Hydrolases superfamily protein1.2e-16156.45Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY
        MSILC VP+LECV CLGC  W+WK+CL++AGH+SE WG AT DEFEPIPRICR ILAVYE+++  P+W P  GYGI+P+ +++KK Y  T+GR  PY++Y
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILY

Query:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN
        LDH++ D+VLAIRGLN+AKE DYAVLLDN+LG+ KFDGGYVHNGLLKAA WV + E+  L++L++  P Y+LTF GHSLG+GV ++L L V+QNR +L N
Subjt:  LDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLEN

Query:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL
        I+RKRIRC+AIAP RCMSL+LAV YADVINSVVLQDDFLPRT T LE++FKS+                                           LPCL
Subjt:  IDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCL

Query:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK
        LCL C++DT   E++ LKD RRLYAPGRLYHIV RKP R GR+PPVV+TAVPVDGRFE IVLSCNAT+DHAIIWIE+E++ AL+LM ++D+VM++P +QK
Subjt:  LCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQK

Query:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEE------EKSPASSGGSSRKKKETWDELIERLYD-KDDSRHTVLKKSLSK
        + RQK++  +H EEY+AA+ +A +L +P + + S YGTF  T+EGE       E SP  SG S +  +  WD+ I+  +   D+S H + K   S+
Subjt:  MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEE------EKSPASSGGSSRKKKETWDELIERLYD-KDDSRHTVLKKSLSK

AT5G37710.1 alpha/beta-Hydrolases superfamily protein1.4e-13852.44Show/hide
Query:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEP-VGGYGINPDWLLMKKTYKDTQGRAPPYIL
        MS+ CG   LECV C+G +RW WKRC H    DS TW  ATP+EFEPIPRI R ILAVYE D+R P   P +G + +NP+W++ + T++ TQGR+PPYI+
Subjt:  MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEP-VGGYGINPDWLLMKKTYKDTQGRAPPYIL

Query:  YLDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETL-KDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKL
        Y+DHDH +IVLAIRGLN+AKESDY +LLDN+LG+K   GGYVH GLLK+A WVL+ E+ETL +   +   +Y L FAGHSLGSGVAA++ ++VV     +
Subjt:  YLDHDHADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETL-KDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKL

Query:  ENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLP
         +I R ++RC+A+APARCMSLNLAV+YADVI+SV+LQDDFLPRTATPLEDIFKS+F                                          LP
Subjt:  ENIDRKRIRCYAIAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLP

Query:  CLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHK--DDKVMEMP
        CLL L C+RDT + E + L+DPRRLYAPGR+YHIVERK     RFPP V+TA+PVDGRFEHIVLS NATSDHAI+WIE+EA+ AL+++ +   + V+ M 
Subjt:  CLLCLRCVRDTCVSEDKMLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHK--DDKVMEMP

Query:  PQQK-MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSL
        P++K MER  TL +EH    K AL+RAV+L +PHA +         T E EEE +   +    + KK+ WDE++++L+ + +S   VL  ++
Subjt:  PQQK-MERQKTLAREHSEEYKAALQRAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATCTTATGTGGCGTGCCTATCCTTGAGTGTGTATGCTGTCTAGGATGTGCTCGTTGGGTCTGGAAACGTTGTCTCCATACAGCTGGTCATGACAGTGAA
ACTTGGGGATTTGCTACTCCTGATGAGTTTGAGCCTATTCCCCGCATTTGTCGATATATCCTGGCTGTGTATGAAGATGACATTCGAAAACCCCTTTGGGAACCG
GTTGGTGGGTATGGAATCAATCCAGATTGGTTACTCATGAAGAAGACATACAAAGATACACAAGGGCGGGCTCCTCCATATATTCTATATCTTGATCACGATCAC
GCCGATATTGTTCTTGCCATCAGGGGACTTAACATGGCAAAGGAGAGTGATTATGCAGTTTTATTGGACAACAGGCTGGGAAAGAAGAAATTTGATGGTGGATAT
GTTCACAATGGGCTTCTGAAGGCAGCTGGGTGGGTTTTGGACACTGAGAATGAAACTTTAAAGGATTTGGTGAAGAGGTATCCGGATTATACCTTGACTTTTGCA
GGGCATTCTCTTGGCTCCGGAGTAGCAGCCATGCTAACTTTAGTAGTAGTACAGAATCGCGAAAAATTGGAAAATATTGATCGGAAACGGATAAGGTGTTATGCT
ATTGCTCCTGCCAGGTGCATGTCCCTAAATTTGGCTGTTAGATATGCAGATGTCATCAACTCTGTTGTTCTTCAGGATGACTTTTTACCCAGGACAGCCACACCC
TTGGAAGACATTTTTAAGTCACTCTTCTGGTATGGTTCAGGATTCATGACCCTTCACGGTTTAACTCTTGTTATGTTGTTGACAACTCATTTGCAAAGTAGATAT
TGCTGCAACCATGAATATTTCATAAATTGGTTCTCTTCTCCTTGTAGTTTGCCGTGCCTTCTATGCCTCAGATGTGTTCGGGATACGTGTGTATCAGAGGACAAG
ATGCTCAAAGATCCAAGGAGACTTTATGCACCTGGTCGACTCTATCACATTGTCGAGCGAAAACCTTTCAGGTGTGGAAGGTTTCCGCCAGTCGTGAAGACGGCT
GTTCCAGTGGATGGGAGGTTTGAACACATAGTTCTTTCTTGTAATGCAACTTCAGACCATGCTATCATTTGGATAGAGAAAGAAGCCAAATGGGCCCTGGAATTA
ATGCACAAAGATGATAAGGTCATGGAGATGCCACCCCAACAGAAGATGGAGAGGCAAAAGACTTTAGCCAGAGAGCACAGTGAAGAGTACAAGGCTGCATTGCAG
CGGGCCGTGACATTAGCTGTGCCACATGCATATGCGATTTCCCCATACGGAACCTTCAGCCAAACAGATGAAGGGGAAGAAGAAAAATCACCGGCATCGAGTGGA
GGGTCGTCGAGGAAGAAGAAAGAAACTTGGGATGAACTGATTGAGCGTCTCTATGACAAGGATGATTCAAGACACACAGTGTTGAAGAAATCACTTAGTAAATGT
TGGATTATCTTTGAGGATGTTGATTTTCTTGGCTTCAATTGTTTGAAGATGATCGAAAATATGCCGCGTTCATCCAATTTTTTAGGAGTGATGAAGTTCAGCAAT
ATTCTTGATCCATCAATGGATTCTCGTCTTCATTTTTTGCTTCTTTTATCAATTGCATTTTCATTCTCAACCTGCCTTGCACAAAGCAATTTGATTAGTGGCCGG
AAGGGTTTAAGGGACCAATTGGTTGAAAGACCTTTGAGCTACTCAAATCATTCAGGAAGAATTGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTT
TTCTTGTATAAAGGTTTTCTCTCAGATGAGGAGTGTGATCATCTTATTTCTTTGGCTTCAAATTCAGAAGACAATCCTTCCGGGAACACTGTCTCAACCAAAGTG
CTAAAGAGTTCAGGAGTCATTTTAAACACAACAGATGATATCATTGCAAGAATTGAAAATCGAATTGCACTGTGGACTTTTCTCCCAAAAGATCATAGCATGCCT
TTCCAGATCATGCAATACAGGGGTGAAGAAGCAGAGCACAAGTACTTTTATGGTAACAGATCTGCAATGTCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTG
TATCTCTCACATTCTTCTCGCGGTGGCGTGATGCTCTTTCCAGAGTCAAAGGTAAACAGCAAATTTTGGTCAAACCGGAGAAAGAAAAACAACTTTCTGAGACCA
GTGAAAGGCAATGCAGTACTTTTTTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTACCACACTCGATCCCCAATACTCAATGGGGAATTGTGGGTT
GCTACAAAATTCTTCTACTTAAGACCAACCACTGGGAATAAAGATAAACACACAATTGAATCTGATGTAGACGGTTGCATTGATGAAGATAAAAGCTGCCCCCAA
TGGGCTGCCATTGGCGAATGCGAAAGAAACGCTGTATTCATGATCGGTTCTCCAGATTACTATGACAGTGGGCCACACATTACAAGTCATCCAGTAGACATAGAT
GTGCGGGAGGAGTACGTGAATGCTTTTCGCACCGAATCATACATCGACTTTTGGACACGTGTCCTTGCCTTAAACAAGGGAGACGATCTCATAACCCAACTTTCA
GTAGAGTCCACAACTTCAACTCGTCTCTCATCTTATAGGCTATTTGTTGAACATCTACTTGATCCACCTCAATCCACAATTAAAAGAATCCTAACTTCGCCCCAT
ATTGGACTCAATTCCTACTCCCTTCTCTTAAATTACTTCTCTCATACCGCCAATGCCTCTCTTTTATGCAGTCGTTTACTAAAACACATCGACCACTTACGTCTC
AAACTCCATCCCTTAGAAATCAACCTTCAGTCCTTAGAAAACAAGGAAGAATTTCATCATGAGTCTCACTTCAAACAACTCTTAATTCATTTGGTAGAATTCTCC
AACACCCACAACCCATTTATACCATCTATAGAACAAGTTCAAATCATCCAAAATGGATGCTCAAAATTGTTAAAGCGACTTGAGTTTAGCCGTGACAAGGCTCAA
GCAAAACTCAAGAGAGTTAGATACTTTCAACACAATTCAGCTGGCTTCTTGGTGGCTATAACCGCATCACTTACTATAATAATTGTGACTCATGGAATTGCATTA
TTTGTTGCTGCACCTGGCTTCCTTGTGGGTGCTATAAAGTTGGCTACTAGGTCGAGGAAGCTAGCGAAGGAAGTTGCTCGACTCAACGTTACTGCCAAAGGGACT
TACACTTTGAATAGGGATTTCGATACAATTGGTAGGCTCTTGGCTCGATTGAGTCATGAACTCGAACATATGAGAGTGATGGCAAGATTTTGGCTTGACAAAGGA
GAAGATAGATGCCGAGCCATTGATGAACTAGTGCGTCAGTTAAATCAAAGTCATGTGAAATTTAGCCAACAATTAGATGAGCTTGAGGAGCATTTGTATTTGTGT
TTTATGACCATAAATCGAGCTAGAAATCTTGTAGTGAAAGAGGTTTTGGATTCGGGTCAACCTATAAAGATTTCGTATTTATGA
mRNA sequenceShow/hide mRNA sequence
ATCCAATTTTCCCCTAAAATTCACACAATCTCCCAAATTCGTAACCCCAGTTCCAAATACAGTCGTTGACTTCCGGGCATCAATGTCTATCTTATGTGGCGTGCC
TATCCTTGAGTGTGTATGCTGTCTAGGATGTGCTCGTTGGGTCTGGAAACGTTGTCTCCATACAGCTGGTCATGACAGTGAAACTTGGGGATTTGCTACTCCTGA
TGAGTTTGAGCCTATTCCCCGCATTTGTCGATATATCCTGGCTGTGTATGAAGATGACATTCGAAAACCCCTTTGGGAACCGGTTGGTGGGTATGGAATCAATCC
AGATTGGTTACTCATGAAGAAGACATACAAAGATACACAAGGGCGGGCTCCTCCATATATTCTATATCTTGATCACGATCACGCCGATATTGTTCTTGCCATCAG
GGGACTTAACATGGCAAAGGAGAGTGATTATGCAGTTTTATTGGACAACAGGCTGGGAAAGAAGAAATTTGATGGTGGATATGTTCACAATGGGCTTCTGAAGGC
AGCTGGGTGGGTTTTGGACACTGAGAATGAAACTTTAAAGGATTTGGTGAAGAGGTATCCGGATTATACCTTGACTTTTGCAGGGCATTCTCTTGGCTCCGGAGT
AGCAGCCATGCTAACTTTAGTAGTAGTACAGAATCGCGAAAAATTGGAAAATATTGATCGGAAACGGATAAGGTGTTATGCTATTGCTCCTGCCAGGTGCATGTC
CCTAAATTTGGCTGTTAGATATGCAGATGTCATCAACTCTGTTGTTCTTCAGGATGACTTTTTACCCAGGACAGCCACACCCTTGGAAGACATTTTTAAGTCACT
CTTCTGGTATGGTTCAGGATTCATGACCCTTCACGGTTTAACTCTTGTTATGTTGTTGACAACTCATTTGCAAAGTAGATATTGCTGCAACCATGAATATTTCAT
AAATTGGTTCTCTTCTCCTTGTAGTTTGCCGTGCCTTCTATGCCTCAGATGTGTTCGGGATACGTGTGTATCAGAGGACAAGATGCTCAAAGATCCAAGGAGACT
TTATGCACCTGGTCGACTCTATCACATTGTCGAGCGAAAACCTTTCAGGTGTGGAAGGTTTCCGCCAGTCGTGAAGACGGCTGTTCCAGTGGATGGGAGGTTTGA
ACACATAGTTCTTTCTTGTAATGCAACTTCAGACCATGCTATCATTTGGATAGAGAAAGAAGCCAAATGGGCCCTGGAATTAATGCACAAAGATGATAAGGTCAT
GGAGATGCCACCCCAACAGAAGATGGAGAGGCAAAAGACTTTAGCCAGAGAGCACAGTGAAGAGTACAAGGCTGCATTGCAGCGGGCCGTGACATTAGCTGTGCC
ACATGCATATGCGATTTCCCCATACGGAACCTTCAGCCAAACAGATGAAGGGGAAGAAGAAAAATCACCGGCATCGAGTGGAGGGTCGTCGAGGAAGAAGAAAGA
AACTTGGGATGAACTGATTGAGCGTCTCTATGACAAGGATGATTCAAGACACACAGTGTTGAAGAAATCACTTAGTAAATGTTGGATTATCTTTGAGGATGTTGA
TTTTCTTGGCTTCAATTGTTTGAAGATGATCGAAAATATGCCGCGTTCATCCAATTTTTTAGGAGTGATGAAGTTCAGCAATATTCTTGATCCATCAATGGATTC
TCGTCTTCATTTTTTGCTTCTTTTATCAATTGCATTTTCATTCTCAACCTGCCTTGCACAAAGCAATTTGATTAGTGGCCGGAAGGGTTTAAGGGACCAATTGGT
TGAAAGACCTTTGAGCTACTCAAATCATTCAGGAAGAATTGACCCATCAAGAGTTGTCCAAGTCTCTTGGCGACCAAGGGTTTTCTTGTATAAAGGTTTTCTCTC
AGATGAGGAGTGTGATCATCTTATTTCTTTGGCTTCAAATTCAGAAGACAATCCTTCCGGGAACACTGTCTCAACCAAAGTGCTAAAGAGTTCAGGAGTCATTTT
AAACACAACAGATGATATCATTGCAAGAATTGAAAATCGAATTGCACTGTGGACTTTTCTCCCAAAAGATCATAGCATGCCTTTCCAGATCATGCAATACAGGGG
TGAAGAAGCAGAGCACAAGTACTTTTATGGTAACAGATCTGCAATGTCGTCCAGTGAGCCTTTGATGGCCACAGTAGTTTTGTATCTCTCACATTCTTCTCGCGG
TGGCGTGATGCTCTTTCCAGAGTCAAAGGTAAACAGCAAATTTTGGTCAAACCGGAGAAAGAAAAACAACTTTCTGAGACCAGTGAAAGGCAATGCAGTACTTTT
TTTCTCTGTGCATCTTAATGCTTCTCCAGACAAGAGTAGCTACCACACTCGATCCCCAATACTCAATGGGGAATTGTGGGTTGCTACAAAATTCTTCTACTTAAG
ACCAACCACTGGGAATAAAGATAAACACACAATTGAATCTGATGTAGACGGTTGCATTGATGAAGATAAAAGCTGCCCCCAATGGGCTGCCATTGGCGAATGCGA
AAGAAACGCTGTATTCATGATCGGTTCTCCAGATTACTATGACAGTGGGCCACACATTACAAGTCATCCAGTAGACATAGATGTGCGGGAGGAGTACGTGAATGC
TTTTCGCACCGAATCATACATCGACTTTTGGACACGTGTCCTTGCCTTAAACAAGGGAGACGATCTCATAACCCAACTTTCAGTAGAGTCCACAACTTCAACTCG
TCTCTCATCTTATAGGCTATTTGTTGAACATCTACTTGATCCACCTCAATCCACAATTAAAAGAATCCTAACTTCGCCCCATATTGGACTCAATTCCTACTCCCT
TCTCTTAAATTACTTCTCTCATACCGCCAATGCCTCTCTTTTATGCAGTCGTTTACTAAAACACATCGACCACTTACGTCTCAAACTCCATCCCTTAGAAATCAA
CCTTCAGTCCTTAGAAAACAAGGAAGAATTTCATCATGAGTCTCACTTCAAACAACTCTTAATTCATTTGGTAGAATTCTCCAACACCCACAACCCATTTATACC
ATCTATAGAACAAGTTCAAATCATCCAAAATGGATGCTCAAAATTGTTAAAGCGACTTGAGTTTAGCCGTGACAAGGCTCAAGCAAAACTCAAGAGAGTTAGATA
CTTTCAACACAATTCAGCTGGCTTCTTGGTGGCTATAACCGCATCACTTACTATAATAATTGTGACTCATGGAATTGCATTATTTGTTGCTGCACCTGGCTTCCT
TGTGGGTGCTATAAAGTTGGCTACTAGGTCGAGGAAGCTAGCGAAGGAAGTTGCTCGACTCAACGTTACTGCCAAAGGGACTTACACTTTGAATAGGGATTTCGA
TACAATTGGTAGGCTCTTGGCTCGATTGAGTCATGAACTCGAACATATGAGAGTGATGGCAAGATTTTGGCTTGACAAAGGAGAAGATAGATGCCGAGCCATTGA
TGAACTAGTGCGTCAGTTAAATCAAAGTCATGTGAAATTTAGCCAACAATTAGATGAGCTTGAGGAGCATTTGTATTTGTGTTTTATGACCATAAATCGAGCTAG
AAATCTTGTAGTGAAAGAGGTTTTGGATTCGGGTCAACCTATAAAGATTTCGTATTTATGATCATGGAAAAGGAAAGAAGAAG
Protein sequenceShow/hide protein sequence
MSILCGVPILECVCCLGCARWVWKRCLHTAGHDSETWGFATPDEFEPIPRICRYILAVYEDDIRKPLWEPVGGYGINPDWLLMKKTYKDTQGRAPPYILYLDHDH
ADIVLAIRGLNMAKESDYAVLLDNRLGKKKFDGGYVHNGLLKAAGWVLDTENETLKDLVKRYPDYTLTFAGHSLGSGVAAMLTLVVVQNREKLENIDRKRIRCYA
IAPARCMSLNLAVRYADVINSVVLQDDFLPRTATPLEDIFKSLFWYGSGFMTLHGLTLVMLLTTHLQSRYCCNHEYFINWFSSPCSLPCLLCLRCVRDTCVSEDK
MLKDPRRLYAPGRLYHIVERKPFRCGRFPPVVKTAVPVDGRFEHIVLSCNATSDHAIIWIEKEAKWALELMHKDDKVMEMPPQQKMERQKTLAREHSEEYKAALQ
RAVTLAVPHAYAISPYGTFSQTDEGEEEKSPASSGGSSRKKKETWDELIERLYDKDDSRHTVLKKSLSKCWIIFEDVDFLGFNCLKMIENMPRSSNFLGVMKFSN
ILDPSMDSRLHFLLLLSIAFSFSTCLAQSNLISGRKGLRDQLVERPLSYSNHSGRIDPSRVVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSGNTVSTKV
LKSSGVILNTTDDIIARIENRIALWTFLPKDHSMPFQIMQYRGEEAEHKYFYGNRSAMSSSEPLMATVVLYLSHSSRGGVMLFPESKVNSKFWSNRRKKNNFLRP
VKGNAVLFFSVHLNASPDKSSYHTRSPILNGELWVATKFFYLRPTTGNKDKHTIESDVDGCIDEDKSCPQWAAIGECERNAVFMIGSPDYYDSGPHITSHPVDID
VREEYVNAFRTESYIDFWTRVLALNKGDDLITQLSVESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLNSYSLLLNYFSHTANASLLCSRLLKHIDHLRL
KLHPLEINLQSLENKEEFHHESHFKQLLIHLVEFSNTHNPFIPSIEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRVRYFQHNSAGFLVAITASLTIIIVTHGIAL
FVAAPGFLVGAIKLATRSRKLAKEVARLNVTAKGTYTLNRDFDTIGRLLARLSHELEHMRVMARFWLDKGEDRCRAIDELVRQLNQSHVKFSQQLDELEEHLYLC
FMTINRARNLVVKEVLDSGQPIKISYL