; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh06G014510 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh06G014510
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionProcollagen-proline 4-dioxygenase
Genome locationCma_Chr06:9282589..9289802
RNA-Seq ExpressionCmaCh06G014510
SyntenyCmaCh06G014510
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR003582 - ShKT domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR044862 - Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domain
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597483.1 putative prolyl 4-hydroxylase 4, partial [Cucurbita argyrosperma subsp. sororia]3.1e-17299.67Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

RXH80310.1 hypothetical protein DVH24_041457 [Malus domestica]8.7e-16739.29Show/hide
Query:  MQGRESGSSGFDLPDRVLQVLPSDPFEQLDVARKITSIALSTRVSMLESESSVLRSKIAEKDEIVADLRFQIESLNDSLSETVDKLARADEEKESLEKEN
        M  +ESGSS FDLP+ VL+VLP DPFEQLDVARKITS+ALSTRVS LESESS LR K+AEKD ++ADL+ Q+ESL+ SLSE+ DKLA A++EKE L KE 
Subjt:  MQGRESGSSGFDLPDRVLQVLPSDPFEQLDVARKITSIALSTRVSMLESESSVLRSKIAEKDEIVADLRFQIESLNDSLSETVDKLARADEEKESLEKEN

Query:  ASLSNTVKKLRGDVAKLEVFRKTLMLSLQEEGDIPTEVPEMVAKIPSQPSTVSQIKEDVSS---------------LPSSRYSSAQSQTSEVGNSLAEDH
        A L+NTV+KL  DV+KLEVFRKTLM SL E+ + P+   ++VAK    P+   Q +E  SS               LP SR SS QS T   G++  ED 
Subjt:  ASLSNTVKKLRGDVAKLEVFRKTLMLSLQEEGDIPTEVPEMVAKIPSQPSTVSQIKEDVSS---------------LPSSRYSSAQSQTSEVGNSLAEDH

Query:  DFDGIRPRIAPGLLLALQLSTPRLTPHNSSPSLSASISPKRTSRPVSPRRHSMSFSTSRNVFEDRSSVCSSAPSISSDKGQARVDGKELFRQVRTRLSYD
        D D  RPRIA  LLLA Q STPR TP  S P  SAS SP RTS+P SP+RHSMSF+TSR +F++RSSV SS     S+ G+ RVDGKE FRQVR+RLSY+
Subjt:  DFDGIRPRIAPGLLLALQLSTPRLTPHNSSPSLSASISPKRTSRPVSPRRHSMSFSTSRNVFEDRSSVCSSAPSISSDKGQARVDGKELFRQVRTRLSYD

Query:  QFAAFLSNVKDLNSHKQTKE--------------------------------------------------------------------------------
        QF AFL+NVK+LN HKQTKE                                                                                
Subjt:  QFAAFLSNVKDLNSHKQTKE--------------------------------------------------------------------------------

Query:  --------------------------------------------------------EMLKKSP-------------------------------------
                                                                 +LKK P                                     
Subjt:  --------------------------------------------------------EMLKKSP-------------------------------------

Query:  ------------------------------------------------------LKKLNQKCSQT-----------------------------------
                                                              +K+L +   +                                    
Subjt:  ------------------------------------------------------LKKLNQKCSQT-----------------------------------

Query:  -----------------------------------LAYLLSGSLASTVHD--------------------------PNTLSH----------------FF
                                           L YL  G     +H                           P++ +H                FF
Subjt:  -----------------------------------LAYLLSGSLASTVHD--------------------------PNTLSH----------------FF

Query:  -------DTDTV-----------------------------------TIIAISSSIYRDPIQS----------------------------------MAK
                TD                                     T +  ++  ++  +Q                                   M +
Subjt:  -------DTDTV-----------------------------------TIIAISSSIYRDPIQS----------------------------------MAK

Query:  FCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAK
         C   LL +L   +S+    +SSS A S +  VNP+KVKQISW+PRAFVYEG LTD ECDHLIS+AK+ELKRSAVAD+LSG+SK+SEVRTSSG FI KAK
Subjt:  FCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAK

Query:  DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGI
        DPIV+GIEDK++ WTFLPKENGEDIQVLRYE GQKY+ H+DYF DKVNIARGGHR+ATVLMYL++V KGGETVFP AE   RR+A+E +  LS+CAKKGI
Subjt:  DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGI

Query:  AVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        AVKP++GDALLFFSL P+AVPD  SLH GCPVIEGEKWSATKWIHVDSFD  +     C D N SCERWA LGECT N EYMVG+PELPGYCR+SCK
Subjt:  AVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

XP_022954026.1 probable prolyl 4-hydroxylase 4 [Cucurbita moschata]6.9e-17299.34Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_022973641.1 probable prolyl 4-hydroxylase 4 [Cucurbita maxima]1.4e-172100Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

XP_023539189.1 probable prolyl 4-hydroxylase 4 [Cucurbita pepo subsp. pepo]9.0e-17299.34Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYD H+DYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

TrEMBL top hitse value%identityAlignment
A0A1S3AWU7 Procollagen-proline 4-dioxygenase5.5e-15991.39Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MA+F   NLLF+ +++IS LLRRAS+SYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS+VAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF DKVNIARGGHRMATVLMYLS+VEKGGETVFP+AEESQRRQASETN+DLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNA+PDT SLHGGCPVIEGEKWSATKWIHVDSFDTI  DHT+C D N SCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
         C
Subjt:  VC

A0A498IAV7 Uncharacterized protein4.2e-16739.29Show/hide
Query:  MQGRESGSSGFDLPDRVLQVLPSDPFEQLDVARKITSIALSTRVSMLESESSVLRSKIAEKDEIVADLRFQIESLNDSLSETVDKLARADEEKESLEKEN
        M  +ESGSS FDLP+ VL+VLP DPFEQLDVARKITS+ALSTRVS LESESS LR K+AEKD ++ADL+ Q+ESL+ SLSE+ DKLA A++EKE L KE 
Subjt:  MQGRESGSSGFDLPDRVLQVLPSDPFEQLDVARKITSIALSTRVSMLESESSVLRSKIAEKDEIVADLRFQIESLNDSLSETVDKLARADEEKESLEKEN

Query:  ASLSNTVKKLRGDVAKLEVFRKTLMLSLQEEGDIPTEVPEMVAKIPSQPSTVSQIKEDVSS---------------LPSSRYSSAQSQTSEVGNSLAEDH
        A L+NTV+KL  DV+KLEVFRKTLM SL E+ + P+   ++VAK    P+   Q +E  SS               LP SR SS QS T   G++  ED 
Subjt:  ASLSNTVKKLRGDVAKLEVFRKTLMLSLQEEGDIPTEVPEMVAKIPSQPSTVSQIKEDVSS---------------LPSSRYSSAQSQTSEVGNSLAEDH

Query:  DFDGIRPRIAPGLLLALQLSTPRLTPHNSSPSLSASISPKRTSRPVSPRRHSMSFSTSRNVFEDRSSVCSSAPSISSDKGQARVDGKELFRQVRTRLSYD
        D D  RPRIA  LLLA Q STPR TP  S P  SAS SP RTS+P SP+RHSMSF+TSR +F++RSSV SS     S+ G+ RVDGKE FRQVR+RLSY+
Subjt:  DFDGIRPRIAPGLLLALQLSTPRLTPHNSSPSLSASISPKRTSRPVSPRRHSMSFSTSRNVFEDRSSVCSSAPSISSDKGQARVDGKELFRQVRTRLSYD

Query:  QFAAFLSNVKDLNSHKQTKE--------------------------------------------------------------------------------
        QF AFL+NVK+LN HKQTKE                                                                                
Subjt:  QFAAFLSNVKDLNSHKQTKE--------------------------------------------------------------------------------

Query:  --------------------------------------------------------EMLKKSP-------------------------------------
                                                                 +LKK P                                     
Subjt:  --------------------------------------------------------EMLKKSP-------------------------------------

Query:  ------------------------------------------------------LKKLNQKCSQT-----------------------------------
                                                              +K+L +   +                                    
Subjt:  ------------------------------------------------------LKKLNQKCSQT-----------------------------------

Query:  -----------------------------------LAYLLSGSLASTVHD--------------------------PNTLSH----------------FF
                                           L YL  G     +H                           P++ +H                FF
Subjt:  -----------------------------------LAYLLSGSLASTVHD--------------------------PNTLSH----------------FF

Query:  -------DTDTV-----------------------------------TIIAISSSIYRDPIQS----------------------------------MAK
                TD                                     T +  ++  ++  +Q                                   M +
Subjt:  -------DTDTV-----------------------------------TIIAISSSIYRDPIQS----------------------------------MAK

Query:  FCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAK
         C   LL +L   +S+    +SSS A S +  VNP+KVKQISW+PRAFVYEG LTD ECDHLIS+AK+ELKRSAVAD+LSG+SK+SEVRTSSG FI KAK
Subjt:  FCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAK

Query:  DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGI
        DPIV+GIEDK++ WTFLPKENGEDIQVLRYE GQKY+ H+DYF DKVNIARGGHR+ATVLMYL++V KGGETVFP AE   RR+A+E +  LS+CAKKGI
Subjt:  DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGI

Query:  AVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        AVKP++GDALLFFSL P+AVPD  SLH GCPVIEGEKWSATKWIHVDSFD  +     C D N SCERWA LGECT N EYMVG+PELPGYCR+SCK
Subjt:  AVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

A0A6J1C7M6 Procollagen-proline 4-dioxygenase2.6e-16494.04Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MA+FCSC+LLF  S+SISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDH+ISLAKAELKRSAVAD+LSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
        KAKDPI+SGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYF+DKVNIARGGHRMATVLMYLSNVEKGGETVFP+AEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPR+GDALLFFSLHPNAVPDT SLHGGCPVIEGEKWSATKWIHVDSFDTI+ DHT+C D +ASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1GPQ8 Procollagen-proline 4-dioxygenase3.3e-17299.34Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAH+DYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFF+LHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

A0A6J1I971 Procollagen-proline 4-dioxygenase6.7e-173100Show/hide
Query:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
        MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH
Subjt:  MAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIH

Query:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
        KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK
Subjt:  KAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAK

Query:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
        KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK
Subjt:  KGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCK

Query:  VC
        VC
Subjt:  VC

SwissProt top hitse value%identityAlignment
F4J0A8 Probable prolyl 4-hydroxylase 65.1e-9359.73Show/hide
Query:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        + L+ S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ 
Subjt:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLSNV KGGETVFPN    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LH N   D  SLHG CPVIEGEKWSAT+WIHV SF         CVD++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

F4JAU3 Prolyl 4-hydroxylase 27.8e-12673.88Show/hide
Query:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        +L ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVAD+ +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRKGD
        DK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLSNV KGGETVFP+A+E  RR  SE  +DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+L  +A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD I++   +C D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK C
Subjt:  ALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

Q8L970 Probable prolyl 4-hydroxylase 72.8e-9958.63Show/hide
Query:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SGES  SEVRTSSG
Subjt:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG

Query:  AFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNED-L
         F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  
Subjt:  AFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNED-L

Query:  SDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYC
        ++CAK+G AVKPRKGDALLFF+LHPNA  D+ SLHG CPV+EGEKWSAT+WIHV SF+   +  + C+D N SCE+WA+ GEC  NP YMVGS +  GYC
Subjt:  SDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYC

Query:  RKSCKVC
        RKSCK C
Subjt:  RKSCKVC

Q8LAN3 Probable prolyl 4-hydroxylase 48.9e-13076.79Show/hide
Query:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRSAVAD+ SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLSNV KGGETVFP+AE   RR  SE  EDLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD IV+   +C D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

Q93W28 Uncharacterized protein At4g155458.1e-7557.47Show/hide
Query:  SGSSGFDLPDRVLQVLPSDPFEQLDVARKITSIALSTRVSMLESESSVLRSKIAEKDEIVADLRFQIESLNDSLSETVDKLARADEEKESLEKENASLSN
        +GS  FDLPD +LQVLPSDPFEQLDVARKITSIALSTRVS LESESS LR  +AEK++   +L+  +ESL  SLS+   KL+ AD EKE+L +ENASLSN
Subjt:  SGSSGFDLPDRVLQVLPSDPFEQLDVARKITSIALSTRVSMLESESSVLRSKIAEKDEIVADLRFQIESLNDSLSETVDKLARADEEKESLEKENASLSN

Query:  TVKKLRGDVAKLEVFRKTLMLSLQEEGDIPTEVPEMVAKIPSQPSTVSQIKEDVSSLPSSRYSSAQS-QTSEVGNSLAEDHDFDGIRPRIAPGLLLALQL
        TVK+L+ DV+KLE FRKTLM+SLQ++ D      +++AK    P+      +D +    SR+SS QS Q SE     A D++ D  +P ++  L L  Q 
Subjt:  TVKKLRGDVAKLEVFRKTLMLSLQEEGDIPTEVPEMVAKIPSQPSTVSQIKEDVSSLPSSRYSSAQS-QTSEVGNSLAEDHDFDGIRPRIAPGLLLALQL

Query:  STPRLTPHNSSPSLSASISPKRTSRPVSPRRHSMSFSTSRNVFED-RSSVCSSAPSISSDKGQARVDGKELFRQVRTRLSYDQFAAFLSNVKDLNSHKQT
        +TPRLTP  S P LSAS +PK TSRP+SPRRHS+SF+T+R +F+D RSS+  S P   S   + RVDGKE FRQVR+RLSY+QF AFL NVKDLN+HKQT
Subjt:  STPRLTPHNSSPSLSASISPKRTSRPVSPRRHSMSFSTSRNVFED-RSSVCSSAPSISSDKGQARVDGKELFRQVRTRLSYDQFAAFLSNVKDLNSHKQT

Query:  KEEMLKKS
        +EE L+K+
Subjt:  KEEMLKKS

Arabidopsis top hitse value%identityAlignment
AT3G06300.1 P4H isoform 25.5e-12773.88Show/hide
Query:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE
        +L ++I L+L ++S+    S SSI+NP+KVKQ+S  PRAFVYEGFLTDLECDHLISLAK  L+RSAVAD+ +GES+VS+VRTSSG FI K KDPIVSGIE
Subjt:  ILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIE

Query:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRKGD
        DK++ WTFLPKENGED+QVLRYE+GQKYDAHFDYF DKVNIARGGHR+ATVL+YLSNV KGGETVFP+A+E  RR  SE  +DLSDCAKKGIAVKP+KG+
Subjt:  DKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRKGD

Query:  ALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        ALLFF+L  +A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD I++   +C D N SCERWA LGEC  NPEYMVG+PE+PG CR+SCK C
Subjt:  ALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

AT3G28480.1 Oxoglutarate/iron-dependent oxygenase2.0e-10058.63Show/hide
Query:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SGES  SEVRTSSG
Subjt:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSG

Query:  AFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNED-L
         F+ K +D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A++  +D  
Subjt:  AFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNED-L

Query:  SDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYC
        ++CAK+G AVKPRKGDALLFF+LHPNA  D+ SLHG CPV+EGEKWSAT+WIHV SF+   +  + C+D N SCE+WA+ GEC  NP YMVGS +  GYC
Subjt:  SDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYC

Query:  RKSCKVC
        RKSCK C
Subjt:  RKSCKVC

AT3G28480.2 Oxoglutarate/iron-dependent oxygenase8.0e-9455.87Show/hide
Query:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGES-----KVSEV
        C    L ++S + +  L R+S++  G       SASS   +P +V Q+SW+PR F+YEGFL+D ECDH I LAK +L++S VAD+ SGES      VS V
Subjt:  CSCNLLFILSISISLLLRRASSSYAG-------SASSI-VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGES-----KVSEV

Query:  RTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQA
        R SS    +      D IVS +E K+AAWTFLP+ENGE +Q+L YE GQKY+ HFDYF D+ N+  GGHR+ATVLMYLSNVEKGGETVFP      + +A
Subjt:  RTSSGAFIHKAK---DPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQA

Query:  SETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVG
        ++  +D  ++CAK+G AVKPRKGDALLFF+LHPNA  D+ SLHG CPV+EGEKWSAT+WIHV SF+   +  + C+D N SCE+WA+ GEC  NP YMVG
Subjt:  SETNED-LSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVG

Query:  SPELPGYCRKSCKVC
        S +  GYCRKSCK C
Subjt:  SPELPGYCRKSCKVC

AT3G28490.1 Oxoglutarate/iron-dependent oxygenase3.6e-9459.73Show/hide
Query:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        + L+ S+SLLL     S   S S  V+P ++ Q+SW+PRAF+Y+GFL+D ECDHLI LAK +L++S  VAD  SGES+ SEVRTSSG F+ K +D IV+ 
Subjt:  FILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRS-AVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRK
        +E K+AAWTFLP+ENGE +Q+L YE GQKYD HFDYF DK  +  GGHR+ATVLMYLSNV KGGETVFPN    + +     ++  S CAK+G AVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LH N   D  SLHG CPVIEGEKWSAT+WIHV SF         CVD++ SC+ WA+ GEC  NP YMVGS    G+CRKSCK C
Subjt:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC

AT5G18900.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.3e-13176.79Show/hide
Query:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG
        L I   +I  +L ++S+S   S+S  VNP+KVKQ+S  PRAFVYEGFLT+LECDH++SLAKA LKRSAVAD+ SGESK SEVRTSSG FI K KDPIVSG
Subjt:  LFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSG

Query:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRK
        IEDKI+ WTFLPKENGEDIQVLRYE+GQKYDAHFDYF DKVNI RGGHRMAT+LMYLSNV KGGETVFP+AE   RR  SE  EDLSDCAK+GIAVKPRK
Subjt:  IEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHRMATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRK

Query:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC
        GDALLFF+LHP+A+PD  SLHGGCPVIEGEKWSATKWIHVDSFD IV+   +C D N SCERWA LGECT NPEYMVG+ ELPGYCR+SCK C
Subjt:  GDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCVDNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGGAGAGAATCTGGCAGTTCCGGCTTTGATCTTCCTGATAGAGTTTTGCAAGTGCTTCCTTCTGATCCTTTTGAACAGCTCGATGTCGCCCGCAAAATC
ACTTCCATCGCTCTTTCGACTCGCGTCTCGATGCTTGAATCAGAGTCCTCTGTTCTGCGCTCTAAAATTGCTGAGAAAGATGAGATTGTAGCAGACTTAAGGTTC
CAGATCGAATCGCTCAACGATTCTCTCTCCGAGACCGTCGATAAACTCGCCAGGGCCGACGAGGAGAAGGAGAGCTTGGAGAAAGAGAACGCCTCGTTGTCGAAC
ACCGTGAAGAAGCTGAGGGGAGATGTTGCGAAGTTGGAGGTGTTTAGGAAGACACTGATGCTATCACTTCAGGAGGAAGGAGATATCCCTACAGAAGTTCCAGAA
ATGGTCGCTAAGATACCGAGCCAACCATCTACTGTTTCCCAGATTAAAGAAGATGTTTCATCCTTGCCGTCCTCCAGATACTCATCGGCTCAGAGCCAGACTTCT
GAAGTGGGAAATTCATTAGCAGAGGATCATGATTTCGATGGAATTCGACCTCGTATTGCACCTGGCCTCCTGTTAGCCTTACAACTAAGTACACCTCGGCTTACT
CCCCATAATTCTTCTCCTTCACTATCAGCATCAATATCTCCAAAGAGAACATCTAGACCTGTGTCACCTAGGCGACATTCAATGTCCTTTTCAACCTCCAGGAAC
GTTTTTGAAGATAGATCTTCAGTATGTTCTTCCGCGCCCTCAATTTCCTCCGACAAAGGGCAAGCTCGGGTCGATGGGAAGGAACTTTTCCGACAAGTCAGGACC
CGTCTGTCTTATGATCAGTTTGCTGCATTTTTATCAAATGTGAAGGATCTAAATTCCCACAAGCAAACGAAAGAGGAGATGCTTAAGAAGTCTCCATTGAAGAAA
CTAAACCAGAAATGCAGCCAAACGTTGGCATATCTGCTATCAGGTTCGTTGGCATCAACTGTACATGATCCAAACACTTTGTCTCATTTTTTCGATACGGATACA
GTAACGATCATAGCGATCTCCTCTAGTATATACAGAGATCCGATTCAGTCAATGGCGAAATTTTGTAGTTGCAATCTGCTGTTTATCCTCTCGATATCGATCTCG
TTGCTTCTCCGGCGAGCTTCAAGCTCTTATGCAGGTTCCGCTAGCTCAATCGTCAATCCTGCAAAAGTCAAACAGATTTCATGGAGTCCCCGGGCTTTCGTGTAT
GAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCGCTTGCTAAAGCGGAGTTGAAGAGATCTGCTGTTGCGGATCATTTGTCTGGAGAGAGCAAGGTC
AGCGAGGTCCGAACTAGCTCTGGGGCATTTATTCATAAAGCCAAGGATCCGATCGTTTCTGGTATTGAAGACAAAATTGCAGCATGGACATTCCTGCCAAAAGAA
AATGGAGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTATGATGCCCATTTCGATTACTTTACTGACAAGGTTAATATTGCCCGAGGTGGACACCGA
ATGGCAACTGTTCTTATGTATCTTTCAAATGTAGAAAAAGGTGGTGAAACTGTGTTTCCTAATGCCGAGGAATCTCAAAGACGGCAGGCTTCTGAAACAAATGAA
GATCTTTCAGACTGTGCAAAGAAAGGGATAGCAGTGAAACCACGGAAGGGTGATGCTCTTCTCTTCTTCAGTCTTCATCCAAATGCTGTTCCAGACACAAAAAGT
CTGCATGGAGGTTGCCCTGTGATTGAAGGAGAGAAATGGTCAGCAACAAAGTGGATTCATGTCGATTCTTTCGACACGATCGTGAGTGATCATACGAGTTGCGTT
GATAATAATGCAAGTTGTGAGAGATGGGCTGAACTCGGTGAGTGCACGAATAACCCGGAGTATATGGTGGGATCTCCTGAGCTTCCTGGCTATTGCAGGAAAAGT
TGCAAGGTTTGTTGA
mRNA sequenceShow/hide mRNA sequence
CGCAGACCCTTTGTTCAACGCCACGTGTCTCAGTTCCAAAGGAACACTCTATTTTCCCTCAATTCTGTGTTTCAATCTTCTCAATTTGTCTTTCTTCTCCGCTTT
GCTACTGTTCATGATTCTTCCTTGTTCATCTGGTTCGATCATAGCTTATCTACCGATGGCGGACGGGAAATGAAGGCTTGTAATTGAAGTTACGAGAAGCTACGT
AGAGAGAGAGAGAGAAACAGAGACAGCGTTGAAAATGCAAGGGAGAGAATCTGGCAGTTCCGGCTTTGATCTTCCTGATAGAGTTTTGCAAGTGCTTCCTTCTGA
TCCTTTTGAACAGCTCGATGTCGCCCGCAAAATCACTTCCATCGCTCTTTCGACTCGCGTCTCGATGCTTGAATCAGAGTCCTCTGTTCTGCGCTCTAAAATTGC
TGAGAAAGATGAGATTGTAGCAGACTTAAGGTTCCAGATCGAATCGCTCAACGATTCTCTCTCCGAGACCGTCGATAAACTCGCCAGGGCCGACGAGGAGAAGGA
GAGCTTGGAGAAAGAGAACGCCTCGTTGTCGAACACCGTGAAGAAGCTGAGGGGAGATGTTGCGAAGTTGGAGGTGTTTAGGAAGACACTGATGCTATCACTTCA
GGAGGAAGGAGATATCCCTACAGAAGTTCCAGAAATGGTCGCTAAGATACCGAGCCAACCATCTACTGTTTCCCAGATTAAAGAAGATGTTTCATCCTTGCCGTC
CTCCAGATACTCATCGGCTCAGAGCCAGACTTCTGAAGTGGGAAATTCATTAGCAGAGGATCATGATTTCGATGGAATTCGACCTCGTATTGCACCTGGCCTCCT
GTTAGCCTTACAACTAAGTACACCTCGGCTTACTCCCCATAATTCTTCTCCTTCACTATCAGCATCAATATCTCCAAAGAGAACATCTAGACCTGTGTCACCTAG
GCGACATTCAATGTCCTTTTCAACCTCCAGGAACGTTTTTGAAGATAGATCTTCAGTATGTTCTTCCGCGCCCTCAATTTCCTCCGACAAAGGGCAAGCTCGGGT
CGATGGGAAGGAACTTTTCCGACAAGTCAGGACCCGTCTGTCTTATGATCAGTTTGCTGCATTTTTATCAAATGTGAAGGATCTAAATTCCCACAAGCAAACGAA
AGAGGAGATGCTTAAGAAGTCTCCATTGAAGAAACTAAACCAGAAATGCAGCCAAACGTTGGCATATCTGCTATCAGGTTCGTTGGCATCAACTGTACATGATCC
AAACACTTTGTCTCATTTTTTCGATACGGATACAGTAACGATCATAGCGATCTCCTCTAGTATATACAGAGATCCGATTCAGTCAATGGCGAAATTTTGTAGTTG
CAATCTGCTGTTTATCCTCTCGATATCGATCTCGTTGCTTCTCCGGCGAGCTTCAAGCTCTTATGCAGGTTCCGCTAGCTCAATCGTCAATCCTGCAAAAGTCAA
ACAGATTTCATGGAGTCCCCGGGCTTTCGTGTATGAAGGTTTTCTCACGGACTTAGAATGCGATCATCTCATCTCGCTTGCTAAAGCGGAGTTGAAGAGATCTGC
TGTTGCGGATCATTTGTCTGGAGAGAGCAAGGTCAGCGAGGTCCGAACTAGCTCTGGGGCATTTATTCATAAAGCCAAGGATCCGATCGTTTCTGGTATTGAAGA
CAAAATTGCAGCATGGACATTCCTGCCAAAAGAAAATGGAGAAGACATTCAAGTGTTGAGATATGAATATGGGCAGAAGTATGATGCCCATTTCGATTACTTTAC
TGACAAGGTTAATATTGCCCGAGGTGGACACCGAATGGCAACTGTTCTTATGTATCTTTCAAATGTAGAAAAAGGTGGTGAAACTGTGTTTCCTAATGCCGAGGA
ATCTCAAAGACGGCAGGCTTCTGAAACAAATGAAGATCTTTCAGACTGTGCAAAGAAAGGGATAGCAGTGAAACCACGGAAGGGTGATGCTCTTCTCTTCTTCAG
TCTTCATCCAAATGCTGTTCCAGACACAAAAAGTCTGCATGGAGGTTGCCCTGTGATTGAAGGAGAGAAATGGTCAGCAACAAAGTGGATTCATGTCGATTCTTT
CGACACGATCGTGAGTGATCATACGAGTTGCGTTGATAATAATGCAAGTTGTGAGAGATGGGCTGAACTCGGTGAGTGCACGAATAACCCGGAGTATATGGTGGG
ATCTCCTGAGCTTCCTGGCTATTGCAGGAAAAGTTGCAAGGTTTGTTGA
Protein sequenceShow/hide protein sequence
MQGRESGSSGFDLPDRVLQVLPSDPFEQLDVARKITSIALSTRVSMLESESSVLRSKIAEKDEIVADLRFQIESLNDSLSETVDKLARADEEKESLEKENASLSN
TVKKLRGDVAKLEVFRKTLMLSLQEEGDIPTEVPEMVAKIPSQPSTVSQIKEDVSSLPSSRYSSAQSQTSEVGNSLAEDHDFDGIRPRIAPGLLLALQLSTPRLT
PHNSSPSLSASISPKRTSRPVSPRRHSMSFSTSRNVFEDRSSVCSSAPSISSDKGQARVDGKELFRQVRTRLSYDQFAAFLSNVKDLNSHKQTKEEMLKKSPLKK
LNQKCSQTLAYLLSGSLASTVHDPNTLSHFFDTDTVTIIAISSSIYRDPIQSMAKFCSCNLLFILSISISLLLRRASSSYAGSASSIVNPAKVKQISWSPRAFVY
EGFLTDLECDHLISLAKAELKRSAVADHLSGESKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKENGEDIQVLRYEYGQKYDAHFDYFTDKVNIARGGHR
MATVLMYLSNVEKGGETVFPNAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAVPDTKSLHGGCPVIEGEKWSATKWIHVDSFDTIVSDHTSCV
DNNASCERWAELGECTNNPEYMVGSPELPGYCRKSCKVC