; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011491 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011491
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr01:6550488..6553006
RNA-Seq ExpressionHG10011491
SyntenyHG10011491
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015541.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-16565.92Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF
        M I  LSRNLR+FS PYFYSIYSFA HSPFPSSSPS H  RF+R YT P SDSAA S L+CNSTSQSVPSLCLFERCNG T DLNLGLFR  YRSC RSF
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF

Query:  SSSSYEKPQCGT----------------------------------------------------------------------------------------
        SS S+EKPQ GT                                                                                        
Subjt:  SSSSYEKPQCGT----------------------------------------------------------------------------------------

Query:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ
                                           DEDS+KI VERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERR SFYNIIVREQ
Subjt:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ

Query:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA
        CRRNDFEGARCTLDEMRQ GCSPDVGILNYLLSSLCKNDK  EA NLFEEMLERDCPPNSLTFEVIICH CEIGNIESAL+FLDMMVSRGLEPRLSTHAA
Subjt:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA

Query:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN
        FVK YFNS+RYEEAYRY +DSSLKH MA+NATYSLLATLHEKRGNLVDAQK+L ELIDAGLRPNFPVYMRVLKKLQ+QG+ED ANDLKGK SN
Subjt:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN

XP_022932243.1 pentatricopeptide repeat-containing protein At1g05670, mitochondrial-like [Cucurbita moschata]1.4e-16566.13Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF
        M I  LSRNLR+FS PYFYSIYSFA HSPFPSSSPS H  RF+R YT P SDSAA S L+CNSTSQSVPSLCLFERCNG T DLNLGLFR  YRSC RSF
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF

Query:  SSSSYEKPQCGT----------------------------------------------------------------------------------------
        SS S+EKPQ GT                                                                                        
Subjt:  SSSSYEKPQCGT----------------------------------------------------------------------------------------

Query:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ
                                           DEDS+KI VERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERR SFYNIIVREQ
Subjt:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ

Query:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA
        CRRNDFEGARCTLDEMRQ GCSPDVGILNYLLSSLCKNDK  EA NLFEEMLERDCPPNSLTFEVIICHLCEIGNIESAL+FLDMMVSRGLEPRLSTHAA
Subjt:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA

Query:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN
        FVK YFNS+RYEEAYRY +DSSLKH MA+NATYSLLATLHEKRGNLVDAQK+L ELIDAGLRPNFPVYMRVLKKLQ+QG+ED ANDLKGK SN
Subjt:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN

XP_023007595.1 pentatricopeptide repeat-containing protein At1g09820-like [Cucurbita maxima]6.1e-16666.53Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF
        M I  LSRNLR+FS PYFYSIYSFA HSPFPSSSPS H  RF+R YT P SDSAA S L+CNST QSVPSLCLFERCNG T DLNLGLFR  YRSC RSF
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF

Query:  SSSSYEKPQCGTDE--------------------------------------------------------------------------------------
        SS S+EKPQ GT E                                                                                      
Subjt:  SSSSYEKPQCGTDE--------------------------------------------------------------------------------------

Query:  -------------------------------------DSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ
                                             DS+KISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERR SFYNIIVREQ
Subjt:  -------------------------------------DSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ

Query:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA
        CRRNDFEGARCTLDEMRQ GCSPDVGILNYLLSSLCKNDKF EAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESAL+FLD MVSRGLEPRLSTHAA
Subjt:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA

Query:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN
        FVK YFNS+RYEEAYRY VDSSLKH MA+NATYSLLATLHEKRGNLVDAQKIL ELIDAGLRPNFPVYMRVLKKLQ+QG+ED ANDLKGK SN
Subjt:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN

XP_038906258.1 pentatricopeptide repeat-containing protein At1g09820-like isoform X1 [Benincasa hispida]1.1e-16466Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPA-SDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSCRSF
        MEI CLSRNLRNFS+PYFYSIYSF  HSPFPSSSPS H +RF R YTTPA SDS A SSLICNS+SQSVPSLCLFERCNGGTLDLN GLFRH YRS RSF
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPA-SDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSCRSF

Query:  SSSSYEKPQCGT----------------------------------------------------------------------------------------
        SSSS+EKPQ GT                                                                                        
Subjt:  SSSSYEKPQCGT----------------------------------------------------------------------------------------

Query:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ
                                           +EDS+KISVERLVKLLNEVGGSCRISGVMALIEMFCS GSFG+AKFVIEITERRASF NIIVREQ
Subjt:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ

Query:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA
        CRR++FEGA+CTLDEMRQ G  PDVGILNYLLSSLCKNDKFGEAH LFEEMLE DCPPNSLTFEVIICHLCEIGNIESA S+LDMMVSRGLEPR STHAA
Subjt:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA

Query:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACMLGK
        FVK YFNSRRYEEAYRY VDSSLKH MA+N TYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFP+YMRVLKKLQLQGKED ANDLKGK +N    LGK
Subjt:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACMLGK

XP_038906259.1 pentatricopeptide repeat-containing protein At1g52640, mitochondrial-like isoform X2 [Benincasa hispida]1.1e-16465.87Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPA-SDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSCRSF
        MEI CLSRNLRNFS+PYFYSIYSF  HSPFPSSSPS H +RF R YTTPA SDS A SSLICNS+SQSVPSLCLFERCNGGTLDLN GLFRH YRS RSF
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPA-SDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSCRSF

Query:  SSSSYEKPQCGT----------------------------------------------------------------------------------------
        SSSS+EKPQ GT                                                                                        
Subjt:  SSSSYEKPQCGT----------------------------------------------------------------------------------------

Query:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ
                                           +EDS+KISVERLVKLLNEVGGSCRISGVMALIEMFCS GSFG+AKFVIEITERRASF NIIVREQ
Subjt:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ

Query:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA
        CRR++FEGA+CTLDEMRQ G  PDVGILNYLLSSLCKNDKFGEAH LFEEMLE DCPPNSLTFEVIICHLCEIGNIESA S+LDMMVSRGLEPR STHAA
Subjt:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA

Query:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACMLGK
        FVK YFNSRRYEEAYRY VDSSLKH MA+N TYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFP+YMRVLKKLQLQGKED ANDLKGK +N+  M  K
Subjt:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACMLGK

Query:  P
        P
Subjt:  P

TrEMBL top hitse value%identityAlignment
A0A0A0L3E7 Uncharacterized protein7.3e-14959.36Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSCRSFS
        MEIPCLSRNLRNFS+PYFYSIYSFA  SPF         YRF R YT  ASDSAAGSSLICNSTSQSVPSLCLFERCNGGT DLNL LFRH YRSCR+FS
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSCRSFS

Query:  SSSYEKPQCGT-----------------------------------------------------------------------------------------
        S S EK QCGT                                                                                         
Subjt:  SSSYEKPQCGT-----------------------------------------------------------------------------------------

Query:  ---------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQCR
                                         +E S+K+SV +LVKLLNE GG+CR+SG+MALIEMFCSLGSFGMAKFVIEITE+R+SFY IIVRE+C+
Subjt:  ---------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQCR

Query:  RNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFV
        + DFEGARCTLDEMRQ GC PD GILNYLLSSLCKNDKFGEAHNL EEMLE++C PNSLTFE+IICHLC+IGNIESAL +LDMMV+ GL PRLSTHAAFV
Subjt:  RNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFV

Query:  KCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACMLG
        K YF+S+RYEEAY+YAVDSSLK+   +NATYSLLATLHEKRGNLVDAQKILSEL+DAGL+P+F VY R+LKKLQ+QG+ D ANDLK K+SN +   G
Subjt:  KCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACMLG

A0A5D3D9T8 Pentatricopeptide repeat-containing protein2.8e-14860.04Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFL-RLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSCRSF
        MEIPCLSRNLRNFS+PYFYSIYSFA +SPFP        YRF  R YTTPASDSAAGSSLICNSTSQSV SLCLFERCNG T DLNL LFRH YRSCRSF
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFL-RLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSCRSF

Query:  SSSSYEKPQCGT----------------------------------------------------------------------------------------
        SS S +K Q GT                                                                                        
Subjt:  SSSSYEKPQCGT----------------------------------------------------------------------------------------

Query:  ----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQC
                                          +EDS+K SV +LVK+LN+  G+CRISGV ALIEMFCS+GS  MAKFVIEITE+R+SFY IIVREQC
Subjt:  ----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQC

Query:  RRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAF
        +R DFEGARCTLDEMRQ GC PD GI NYLLSSLCKNDKFGEAHNLFEEMLE++CPPNSL+FEVIICHLC+IGNIESAL FLD MV+RGL+PRLSTHA F
Subjt:  RRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAF

Query:  VKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACMLG
        VK YF SRRYEEAY+YAVDSS K+ M +NATYSLLATLHEKRGNLVDAQKILSEL+DAGL+PNF V  RVLKKLQ+QG+ED ANDLKGKLSN    +G
Subjt:  VKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACMLG

A0A6J1CJ38 pentatricopeptide repeat-containing protein At1g62930, chloroplastic-like6.0e-14358.11Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTH--------------CYRFLRLYTTPASDS------AAGSSLICNSTSQSVPSLCLFERCNGG
        MEI  L R LRNFS+P  Y IYSF+S SP+PS S  +H               + F   Y+  ASDS      AAG SLICN+ S S PSLCLF RCNGG
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTH--------------CYRFLRLYTTPASDS------AAGSSLICNSTSQSVPSLCLFERCNGG

Query:  TLDLNLGLFRHR-YRSCRSFSSSSY------EKPQCGT--------------------------------------------------------------
         LDLNLGL + R Y +CRSF SSS+      EKPQCGT                                                              
Subjt:  TLDLNLGLFRHR-YRSCRSFSSSSY------EKPQCGT--------------------------------------------------------------

Query:  -------------------------------------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLG
                                                                     +E S+KISVER++KLLNEVGGSCRISGVM+LIEMFCS G
Subjt:  -------------------------------------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLG

Query:  SFGMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIG
        S+GMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTL+EMRQ GCSPDVGILNYLLS LCKND+F EA ++FE ML++DCPPNSLTFEVIICHLCEIG
Subjt:  SFGMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIG

Query:  NIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKK
         IESALSFLDMMVSRGLEPRLSTHAAFVK YFNS+RYEEAYRYAVDSS KHA A+NATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVY RV KK
Subjt:  NIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKK

Query:  LQLQGKEDWANDLKGKLS
        LQLQGKED ANDLKGK S
Subjt:  LQLQGKEDWANDLKGKLS

A0A6J1F148 pentatricopeptide repeat-containing protein At1g05670, mitochondrial-like6.6e-16666.13Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF
        M I  LSRNLR+FS PYFYSIYSFA HSPFPSSSPS H  RF+R YT P SDSAA S L+CNSTSQSVPSLCLFERCNG T DLNLGLFR  YRSC RSF
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF

Query:  SSSSYEKPQCGT----------------------------------------------------------------------------------------
        SS S+EKPQ GT                                                                                        
Subjt:  SSSSYEKPQCGT----------------------------------------------------------------------------------------

Query:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ
                                           DEDS+KI VERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERR SFYNIIVREQ
Subjt:  -----------------------------------DEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ

Query:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA
        CRRNDFEGARCTLDEMRQ GCSPDVGILNYLLSSLCKNDK  EA NLFEEMLERDCPPNSLTFEVIICHLCEIGNIESAL+FLDMMVSRGLEPRLSTHAA
Subjt:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA

Query:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN
        FVK YFNS+RYEEAYRY +DSSLKH MA+NATYSLLATLHEKRGNLVDAQK+L ELIDAGLRPNFPVYMRVLKKLQ+QG+ED ANDLKGK SN
Subjt:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN

A0A6J1L3E8 pentatricopeptide repeat-containing protein At1g09820-like2.9e-16666.53Show/hide
Query:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF
        M I  LSRNLR+FS PYFYSIYSFA HSPFPSSSPS H  RF+R YT P SDSAA S L+CNST QSVPSLCLFERCNG T DLNLGLFR  YRSC RSF
Subjt:  MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSC-RSF

Query:  SSSSYEKPQCGTDE--------------------------------------------------------------------------------------
        SS S+EKPQ GT E                                                                                      
Subjt:  SSSSYEKPQCGTDE--------------------------------------------------------------------------------------

Query:  -------------------------------------DSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ
                                             DS+KISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERR SFYNIIVREQ
Subjt:  -------------------------------------DSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQ

Query:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA
        CRRNDFEGARCTLDEMRQ GCSPDVGILNYLLSSLCKNDKF EAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESAL+FLD MVSRGLEPRLSTHAA
Subjt:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA

Query:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN
        FVK YFNS+RYEEAYRY VDSSLKH MA+NATYSLLATLHEKRGNLVDAQKIL ELIDAGLRPNFPVYMRVLKKLQ+QG+ED ANDLKGK SN
Subjt:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSN

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200903.4e-1827.41Show/hide
Query:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA
        C+    + A   LDEM+  GCSP   I N L+  LCK         L + M  + C PN +T+  +I  LC  G ++ A+S L+ MVS    P   T+  
Subjt:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA

Query:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACM
         +      RR  +A R       +        YS+L +   K G   +A  +  ++ + G +PN  VY  ++  L  +GK + A ++  ++  + C+
Subjt:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACM

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099003.6e-2026.17Show/hide
Query:  RLVKLLNEVGGSCRISGVM---ALIEMFCSLGSFGMAKFVIE--ITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDK
        +  K+L  + GS  +  V+    +I  +C  G    A  V++          YN I+R  C     + A   LD M Q  C PDV     L+ + C++  
Subjt:  RLVKLLNEVGGSCRISGVM---ALIEMFCSLGSFGMAKFVIE--ITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDK

Query:  FGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLH
         G A  L +EM +R C P+ +T+ V++  +C+ G ++ A+ FL+ M S G +P + TH   ++   ++ R+ +A +   D   K       T+++L    
Subjt:  FGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLH

Query:  EKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNAC
         ++G L  A  IL ++   G +PN   Y  +L     + K D A +   ++ +  C
Subjt:  EKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNAC

Q9FLJ4 Pentatricopeptide repeat-containing protein At5g614005.2e-1927.01Show/hide
Query:  VGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLE
        V G C+   ++    +F  +  FG+        +     YN ++   C+  +   A   L EM     SPDV     L++ LC  D+  EA+ LF++M  
Subjt:  VGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLE

Query:  RDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKIL
            P+S T+  +I   C+  N+E AL     M + G+EP + T +  +  Y N R  + A     + ++K  +    TY+ L   H K  N+ +A ++ 
Subjt:  RDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKIL

Query:  SELIDAGLRPN
        S++++AG+ PN
Subjt:  SELIDAGLRPN

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial1.2e-1826.24Show/hide
Query:  YNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLE
        YN ++   C    ++     L +M +   SP+V   + L+ S  K  K  EA  L +EM++R   PN++T+  +I   C+   +E A+  +D+M+S+G +
Subjt:  YNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLE

Query:  PRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLS
        P + T    +  Y  + R ++      + SL+  +A   TY+ L     + G L  A+K+  E++   +RP+   Y  +L  L   G+ + A ++ GK+ 
Subjt:  PRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLS

Query:  NN
         +
Subjt:  NN

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655606.2e-2025.74Show/hide
Query:  ALIEMFCSLGSFGMAKFVIEITERRASF-----YNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPN
        +LI+  C  G+F  A  ++ +   R        Y  ++   C+    E A    D + Q G +P+V +   L+   CK  K  EAH + E+ML ++C PN
Subjt:  ALIEMFCSLGSFGMAKFVIEITERRASF-----YNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPN

Query:  SLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDA
        SLTF  +I  LC  G ++ A    + MV  GL+P +ST    +        ++ AY                TY+     + + G L+DA+ +++++ + 
Subjt:  SLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDA

Query:  GLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNAC
        G+ P+   Y  ++K     G+ ++A D+  ++ +  C
Subjt:  GLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNAC

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.6e-2126.17Show/hide
Query:  RLVKLLNEVGGSCRISGVM---ALIEMFCSLGSFGMAKFVIE--ITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDK
        +  K+L  + GS  +  V+    +I  +C  G    A  V++          YN I+R  C     + A   LD M Q  C PDV     L+ + C++  
Subjt:  RLVKLLNEVGGSCRISGVM---ALIEMFCSLGSFGMAKFVIE--ITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDK

Query:  FGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLH
         G A  L +EM +R C P+ +T+ V++  +C+ G ++ A+ FL+ M S G +P + TH   ++   ++ R+ +A +   D   K       T+++L    
Subjt:  FGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLH

Query:  EKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNAC
         ++G L  A  IL ++   G +PN   Y  +L     + K D A +   ++ +  C
Subjt:  EKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNAC

AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein8.3e-2026.24Show/hide
Query:  YNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLE
        YN ++   C    ++     L +M +   SP+V   + L+ S  K  K  EA  L +EM++R   PN++T+  +I   C+   +E A+  +D+M+S+G +
Subjt:  YNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLE

Query:  PRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLS
        P + T    +  Y  + R ++      + SL+  +A   TY+ L     + G L  A+K+  E++   +RP+   Y  +L  L   G+ + A ++ GK+ 
Subjt:  PRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLS

Query:  NN
         +
Subjt:  NN

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-1927.41Show/hide
Query:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA
        C+    + A   LDEM+  GCSP   I N L+  LCK         L + M  + C PN +T+  +I  LC  G ++ A+S L+ MVS    P   T+  
Subjt:  CRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAA

Query:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACM
         +      RR  +A R       +        YS+L +   K G   +A  +  ++ + G +PN  VY  ++  L  +GK + A ++  ++  + C+
Subjt:  FVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACM

AT5G61400.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-2027.01Show/hide
Query:  VGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLE
        V G C+   ++    +F  +  FG+        +     YN ++   C+  +   A   L EM     SPDV     L++ LC  D+  EA+ LF++M  
Subjt:  VGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLE

Query:  RDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKIL
            P+S T+  +I   C+  N+E AL     M + G+EP + T +  +  Y N R  + A     + ++K  +    TY+ L   H K  N+ +A ++ 
Subjt:  RDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKIL

Query:  SELIDAGLRPN
        S++++AG+ PN
Subjt:  SELIDAGLRPN

AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein4.4e-2125.74Show/hide
Query:  ALIEMFCSLGSFGMAKFVIEITERRASF-----YNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPN
        +LI+  C  G+F  A  ++ +   R        Y  ++   C+    E A    D + Q G +P+V +   L+   CK  K  EAH + E+ML ++C PN
Subjt:  ALIEMFCSLGSFGMAKFVIEITERRASF-----YNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEAHNLFEEMLERDCPPN

Query:  SLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDA
        SLTF  +I  LC  G ++ A    + MV  GL+P +ST    +        ++ AY                TY+     + + G L+DA+ +++++ + 
Subjt:  SLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILSELIDA

Query:  GLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNAC
        G+ P+   Y  ++K     G+ ++A D+  ++ +  C
Subjt:  GLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTCCTTGTCTTTCTCGAAATCTACGTAATTTCTCCAGCCCTTACTTCTACTCGATTTACTCATTTGCTTCTCATTCTCCATTTCCTTCATCCTCGCCTTCCAC
ACATTGTTACCGTTTTCTTCGATTGTATACTACACCAGCATCCGATTCTGCTGCTGGTTCTTCCTTAATCTGCAATTCAACTTCTCAATCTGTGCCTTCATTGTGTTTGT
TTGAGAGATGCAATGGTGGAACATTGGATTTGAATCTTGGATTGTTTCGACACCGCTACAGAAGCTGCCGTTCATTTTCCTCCTCTTCGTATGAGAAGCCCCAGTGTGGG
ACTGATGAAGATTCACTTAAGATTTCTGTTGAAAGACTGGTGAAGTTGTTGAATGAAGTAGGAGGATCTTGTAGAATATCTGGGGTCATGGCATTGATTGAGATGTTCTG
TTCTCTTGGTTCTTTTGGAATGGCCAAGTTTGTAATTGAGATAACTGAGAGAAGAGCATCTTTCTACAACATCATTGTTCGGGAACAATGTCGACGAAACGATTTCGAAG
GAGCTAGATGTACACTTGATGAGATGAGGCAACATGGTTGTAGCCCAGATGTGGGGATTCTCAATTATCTGCTCAGTAGTTTATGCAAGAATGACAAATTTGGTGAAGCT
CATAACTTGTTTGAAGAAATGCTCGAAAGAGATTGTCCTCCAAACTCCTTGACATTTGAAGTCATTATCTGCCATCTCTGTGAAATTGGTAATATTGAATCAGCACTCAG
CTTTCTTGACATGATGGTGTCAAGGGGTCTTGAGCCTCGCCTTTCGACACACGCTGCCTTCGTGAAATGCTACTTCAATTCACGGAGATATGAGGAAGCGTATCGGTATG
CGGTTGATTCTAGCTTGAAACATGCCATGGCAAAGAATGCAACATATAGCTTGCTTGCAACTCTTCATGAGAAGAGAGGAAACTTAGTTGATGCTCAAAAAATTCTTTCT
GAGTTGATAGATGCAGGTCTTAGACCAAACTTTCCCGTGTATATGAGAGTTCTGAAGAAGCTTCAGCTTCAAGGCAAGGAAGATTGGGCCAATGATTTGAAGGGAAAACT
CTCCAATAACGCGTGTATGCTAGGGAAGCCGAAGACCAGGCGACGGGGGTCACTAGAAACTTTTGGATGTCCACAACGATGGACTCCCTCCAAGGTGAGAAAGAGCAAAG
TTGATTGTACCAGTGAAGTTGTTGTTGGCAAGGAGTTTCATGGTCTTCATTCCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTCCTTGTCTTTCTCGAAATCTACGTAATTTCTCCAGCCCTTACTTCTACTCGATTTACTCATTTGCTTCTCATTCTCCATTTCCTTCATCCTCGCCTTCCAC
ACATTGTTACCGTTTTCTTCGATTGTATACTACACCAGCATCCGATTCTGCTGCTGGTTCTTCCTTAATCTGCAATTCAACTTCTCAATCTGTGCCTTCATTGTGTTTGT
TTGAGAGATGCAATGGTGGAACATTGGATTTGAATCTTGGATTGTTTCGACACCGCTACAGAAGCTGCCGTTCATTTTCCTCCTCTTCGTATGAGAAGCCCCAGTGTGGG
ACTGATGAAGATTCACTTAAGATTTCTGTTGAAAGACTGGTGAAGTTGTTGAATGAAGTAGGAGGATCTTGTAGAATATCTGGGGTCATGGCATTGATTGAGATGTTCTG
TTCTCTTGGTTCTTTTGGAATGGCCAAGTTTGTAATTGAGATAACTGAGAGAAGAGCATCTTTCTACAACATCATTGTTCGGGAACAATGTCGACGAAACGATTTCGAAG
GAGCTAGATGTACACTTGATGAGATGAGGCAACATGGTTGTAGCCCAGATGTGGGGATTCTCAATTATCTGCTCAGTAGTTTATGCAAGAATGACAAATTTGGTGAAGCT
CATAACTTGTTTGAAGAAATGCTCGAAAGAGATTGTCCTCCAAACTCCTTGACATTTGAAGTCATTATCTGCCATCTCTGTGAAATTGGTAATATTGAATCAGCACTCAG
CTTTCTTGACATGATGGTGTCAAGGGGTCTTGAGCCTCGCCTTTCGACACACGCTGCCTTCGTGAAATGCTACTTCAATTCACGGAGATATGAGGAAGCGTATCGGTATG
CGGTTGATTCTAGCTTGAAACATGCCATGGCAAAGAATGCAACATATAGCTTGCTTGCAACTCTTCATGAGAAGAGAGGAAACTTAGTTGATGCTCAAAAAATTCTTTCT
GAGTTGATAGATGCAGGTCTTAGACCAAACTTTCCCGTGTATATGAGAGTTCTGAAGAAGCTTCAGCTTCAAGGCAAGGAAGATTGGGCCAATGATTTGAAGGGAAAACT
CTCCAATAACGCGTGTATGCTAGGGAAGCCGAAGACCAGGCGACGGGGGTCACTAGAAACTTTTGGATGTCCACAACGATGGACTCCCTCCAAGGTGAGAAAGAGCAAAG
TTGATTGTACCAGTGAAGTTGTTGTTGGCAAGGAGTTTCATGGTCTTCATTCCATTTGA
Protein sequenceShow/hide protein sequence
MEIPCLSRNLRNFSSPYFYSIYSFASHSPFPSSSPSTHCYRFLRLYTTPASDSAAGSSLICNSTSQSVPSLCLFERCNGGTLDLNLGLFRHRYRSCRSFSSSSYEKPQCG
TDEDSLKISVERLVKLLNEVGGSCRISGVMALIEMFCSLGSFGMAKFVIEITERRASFYNIIVREQCRRNDFEGARCTLDEMRQHGCSPDVGILNYLLSSLCKNDKFGEA
HNLFEEMLERDCPPNSLTFEVIICHLCEIGNIESALSFLDMMVSRGLEPRLSTHAAFVKCYFNSRRYEEAYRYAVDSSLKHAMAKNATYSLLATLHEKRGNLVDAQKILS
ELIDAGLRPNFPVYMRVLKKLQLQGKEDWANDLKGKLSNNACMLGKPKTRRRGSLETFGCPQRWTPSKVRKSKVDCTSEVVVGKEFHGLHSI