; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G009070 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G009070
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
Descriptionpentatricopeptide repeat-containing protein At4g21705, mitochondrial-like
Genome locationCma_Chr04:4716319..4720443
RNA-Seq ExpressionCmaCh04G009070
SyntenyCmaCh04G009070
Gene Ontology termsGO:0009664 - plant-type cell wall organization (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR002963 - Expansin
IPR007112 - Expansin/pollen allergen, DPBB domain
IPR007117 - Expansin, cellulose-binding-like domain
IPR007118 - Expansin/Lol pI
IPR009009 - RlpA-like protein, double-psi beta-barrel domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036749 - Expansin, cellulose-binding-like domain superfamily
IPR036908 - RlpA-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600732.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]7.9e-28996.88Show/hide
Query:  FFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDL
        FFLKLEVSG PALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFE+RRIVRDL
Subjt:  FFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDL

Query:  RNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDI
        RNCRRYGQAL+VSEWMRSKGLFSFTTRDFAVQLDLIGRV+G+DSAEKYF SVSNQEE+GKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDI
Subjt:  RNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDI

Query:  MCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL
        MCLYLNTGQVDKVPNVLSEMK NGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL
Subjt:  MCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL

Query:  GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRI
        GFNHLISLYTSLG KDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEW SSCECYDFRVPNVLLIGYS++GLIERAEKMLQNIISDGRI
Subjt:  GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRI

Query:  PPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADA
        PPPNSWGIIAAGYLEKQN E+AFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADA
Subjt:  PPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADA

Query:  LKESQPCLAQLD
        LK+SQPCLAQ+D
Subjt:  LKESQPCLAQLD

KAG7031372.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-27996.95Show/hide
Query:  MFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRS
        MFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFE+RRIVRDLRNCRRYGQAL+VSEWMRS
Subjt:  MFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRS

Query:  KGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLS
        KGLFSFTTRDFAVQLDLIGRV+G+DSAEKYF SVSNQEE+GKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLS
Subjt:  KGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLS

Query:  EMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSLGCKDEV
        EMK+NGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSLG KDEV
Subjt:  EMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSLGCKDEV

Query:  MRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQN
        MRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLV+EW SSCECYDFRVPNVLLIGYS++GLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQN
Subjt:  MRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQN

Query:  LEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADALKESQPCLAQ
         E+AFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADALK+SQPCLAQ
Subjt:  LEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADALKESQPCLAQ

XP_022942045.1 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucurbita moschata]1.3e-28896.68Show/hide
Query:  FFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDL
        FFLKLEVSG PALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFE+RRIVRDL
Subjt:  FFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDL

Query:  RNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDI
        RNCRRYGQAL+VSEWMRSKGLFSFTTRDFAVQLDLIGRV+G+DSAEKYFSSVSNQEE+GKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDI
Subjt:  RNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDI

Query:  MCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL
        MCLYLNTGQVDKVPNVLSEMK+NGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHE+AMSYLRKCEDKVNQDAL
Subjt:  MCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL

Query:  GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRI
        GFNHLISLYTSLG KDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLV+EW SSCECYDFRVPNVLLIGYS+RGLIERAEKMLQNIISDGRI
Subjt:  GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRI

Query:  PPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADA
        PPPNSWGIIAAGYLEKQN E+AFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLK VPSMDGKLSNAFDELLETLKNNDETTADA
Subjt:  PPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADA

Query:  LKESQPCLAQLD
        LK+SQPCLAQ+D
Subjt:  LKESQPCLAQLD

XP_022989754.1 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucurbita maxima]9.6e-303100Show/hide
Query:  VALKTVETRFFFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKD
        VALKTVETRFFFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKD
Subjt:  VALKTVETRFFFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKD

Query:  FELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGF
        FELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGF
Subjt:  FELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGF

Query:  ASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRK
        ASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRK
Subjt:  ASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRK

Query:  CEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKM
        CEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKM
Subjt:  CEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKM

Query:  LQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETL
        LQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETL
Subjt:  LQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETL

Query:  KNNDETTADALKESQPCLAQLD
        KNNDETTADALKESQPCLAQLD
Subjt:  KNNDETTADALKESQPCLAQLD

XP_023545982.1 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucurbita pepo subsp. pepo]2.1e-29497.14Show/hide
Query:  VALKTVETRF---FFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRM
        VALKTVETRF   FFLKLEVSG PALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRM
Subjt:  VALKTVETRF---FFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRM

Query:  IKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKE
        IKDFE+RRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEE+GKLYGALLNCYVREGLVDKALSHMQKMKE
Subjt:  IKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKE

Query:  MGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSY
        MGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVL+EMESQTHISMDWTTYSMVANFFIKAGMHE+AMSY
Subjt:  MGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSY

Query:  LRKCEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERA
        LRKCEDKVNQDALGFNHLISLYTSLG KDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEW SSCECYDFRVPNVLLIGYS+RGLIERA
Subjt:  LRKCEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERA

Query:  EKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELL
        EKMLQNIISDGRIPPPNSWGIIAAGYLEKQN E+AFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELL
Subjt:  EKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELL

Query:  ETLKNNDETTADALKESQPCLAQLD
        ETLKNNDETTADALK+SQPCLAQ+D
Subjt:  ETLKNNDETTADALKESQPCLAQLD

TrEMBL top hitse value%identityAlignment
A0A0A0L7Y2 Uncharacterized protein3.0e-23381.63Show/hide
Query:  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEV
        +AAA AMFKIL   SSG TRT R ETDAFCFVALRLYS RR+C+RRNL+ARISPLG PE +VVP+L+QWI+EGR IKDFELRRIVRDLR CRRY QALEV
Subjt:  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEV

Query:  SEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDK
        SEWM SKGLFS TTRDFA+QLDLIG+V+GLDSAEKYF SVSNQ+E+GKLYGALLNCYVREGL+DK+L+HMQKMKEMG ASSPLCYNDIMCLYLNTGQ DK
Subjt:  SEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDK

Query:  VPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSL
        VPNVLSEMKENGVLPDN+SYRICISSYGARSD+I M  VLKEME QTHISMDWTTYSMVA FFIKAGMH+KAM+YLRKCEDKV++DALGFNHLIS YT+L
Subjt:  VPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSL

Query:  GCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAG
        G K+EVMRLWAL KK KKQ+NRDYITMLG LVKLE LEEAE LV EW SSC+CYDFRVPNV+LIGYS++GLIE+AEKML+NII +G IP PNSWGIIA+G
Subjt:  GCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAG

Query:  YLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADALK
        YLEKQNLEKAF+CMKEA+AV+ QNK WRPKP+VLSSILRWLSEN RYEE+KEF+SSLKTVPSMD KL+NA DELLE + N+D  + D L+
Subjt:  YLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADALK

A0A3Q7HHL7 Uncharacterized protein6.8e-23055.43Show/hide
Query:  YALFFMLVGGSFVHGDGGAWLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFC
        + L F       V  +   WL+  ATFYG +Q+P++ GGACGYDN +HAGFG+NTAA+S ALFR G+ACGAC+ V CN  +D +WC+   AV +TATNFC
Subjt:  YALFFMLVGGSFVHGDGGAWLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFC

Query:  PSNNNGGWCDPPRAHFDMSSPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDVKAAWVKGSRTRTWMPMHRNWGANWQA
        P NN+GGWCD PR HFDMS PAFL IARQGNEG+VPVLY RV+C+R GGVRFTL+GQSNFNMVMISNVGGSGD+K+  ++GSRT+TW+PM+RNWG NWQ+
Subjt:  PSNNNGGWCDPPRAHFDMSSPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDVKAAWVKGSRTRTWMPMHRNWGANWQA

Query:  NVDLRNQRMSFKVTLIDGRTLEFVNVVALKTVETRFFFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARI
          DLR+Q +SF++TL+D      V+  + K + +   F  L                  RS S    + A     A+C     +YS+     R NLF+RI
Subjt:  NVDLRNQRMSFKVTLIDGRTLEFVNVVALKTVETRFFFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARI

Query:  SPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGA
        SP+   +L ++P+LD+W+ EGR +  FEL+RIVRDLR+ +R+ QAL+VSEWM  +GL  F + D AV LDLIG V G ++AE YF++++++++  K  GA
Subjt:  SPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGA

Query:  LLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMD
        L NCY+RE LV+K+L+H QKMKE+G+A   L YN++MCLY +TGQ+++V  VLSEMKENGV P+N+SYRICI+  G R+D  GM K+L+EMESQ  IS D
Subjt:  LLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMD

Query:  WTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSC
        W TYSM+AN +IKA   EKA+ +L+K ED++++D +G+NHLIS Y +LG K+E++RLW + K  CKKQ+NRDYITMLG LVKL  LE AE ++KEW SSC
Subjt:  WTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSC

Query:  ECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELK
          YDFRVPN+LLIGY +  L+++AE ML +II  G+ P PNSW IIAAGYL   N+EKAF+CMK+A+AV+ QN  WRPKP ++SSI +WL +     E++
Subjt:  ECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELK

Query:  EFLSSLKTV
         FLSSLKTV
Subjt:  EFLSSLKTV

A0A6J1CNQ2 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like1.1e-23885.74Show/hide
Query:  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEV
        +AAALAMFKIL S SS F RT RTETD+FC + LRLYS RR CNRRNLFARISPLG PELSVV ILDQWI+EGR IKDFELRRIVRDLR+CRRYGQALEV
Subjt:  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEV

Query:  SEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDK
        SEWM SKGLF  TTRDFAVQLDLIGRV+GLDSAEKYFSSVSNQEE+GKLYGALLNCYVREGLVDK+L+HMQ+MK+MGFAS+PL YNDIMCLYLNTG VDK
Subjt:  SEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDK

Query:  VPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSL
        VPNVLSEMKENGVLPDN+SYRICI+SYGARSDLI M KVLKEMESQ+HISMDWTTYSMVANFFIKA MHE+A++YLRKCEDKV++DALGFNHLISLYTSL
Subjt:  VPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDALGFNHLISLYTSL

Query:  GCKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAA
        G  DEVMRLWALQK KCKKQVNRDYITMLG LVKLE LEEAE L+KEW SSC+CYDFRVPNVLLIGYS++GLIERAEKML++I  +GRIPPPNSW IIAA
Subjt:  GCKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAA

Query:  GYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDE
        GYLEKQNLEKAFKCM EA+AV+EQN GWRPKPSV+SSILRWLSENGRY ELKEFLSSLKTVPSMDGKL NA DEL+ETL+N+ E
Subjt:  GYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDE

A0A6J1FVG8 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like6.5e-28996.68Show/hide
Query:  FFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDL
        FFLKLEVSG PALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFE+RRIVRDL
Subjt:  FFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDL

Query:  RNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDI
        RNCRRYGQAL+VSEWMRSKGLFSFTTRDFAVQLDLIGRV+G+DSAEKYFSSVSNQEE+GKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDI
Subjt:  RNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDI

Query:  MCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL
        MCLYLNTGQVDKVPNVLSEMK+NGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHE+AMSYLRKCEDKVNQDAL
Subjt:  MCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL

Query:  GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRI
        GFNHLISLYTSLG KDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLV+EW SSCECYDFRVPNVLLIGYS+RGLIERAEKMLQNIISDGRI
Subjt:  GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRI

Query:  PPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADA
        PPPNSWGIIAAGYLEKQN E+AFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLK VPSMDGKLSNAFDELLETLKNNDETTADA
Subjt:  PPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADA

Query:  LKESQPCLAQLD
        LK+SQPCLAQ+D
Subjt:  LKESQPCLAQLD

A0A6J1JQ72 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like4.6e-303100Show/hide
Query:  VALKTVETRFFFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKD
        VALKTVETRFFFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKD
Subjt:  VALKTVETRFFFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKD

Query:  FELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGF
        FELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGF
Subjt:  FELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGF

Query:  ASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRK
        ASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRK
Subjt:  ASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRK

Query:  CEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKM
        CEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKM
Subjt:  CEDKVNQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKM

Query:  LQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETL
        LQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETL
Subjt:  LQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETL

Query:  KNNDETTADALKESQPCLAQLD
        KNNDETTADALKESQPCLAQLD
Subjt:  KNNDETTADALKESQPCLAQLD

SwissProt top hitse value%identityAlignment
O22714 Pentatricopeptide repeat-containing protein At1g607704.2e-6731.73Show/hide
Query:  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDL
        +A+R  S  R   +R+        L+ R+   G  E+ V   L+Q+++  + +  +E+   ++ LRN   Y  AL++SE M  +G+ + T  D A+ LDL
Subjt:  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDL

Query:  IGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRIC
        + + + + + E YF  +    +    YG+LLNCY +E L +KA   + KMKE+    S + YN +M LY  TG+ +KVP ++ E+K   V+PD+Y+Y + 
Subjt:  IGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRIC

Query:  ISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGCKDEVMRLW-ALQKKCKKQVN
        + +  A +D+ G+ +V++EM     ++ DWTTYS +A+ ++ AG+ +KA   L++ E K  Q D   +  LI+LY  LG   EV R+W +L+    K  N
Subjt:  ISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGCKDEVMRLW-ALQKKCKKQVN

Query:  RDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQ
          Y+ M+  LVKL  L  AE L KEW ++C  YD R+ NVL+  Y++ GLI++A ++ +     G      +W I    Y++  ++ +A +CM +AV++ 
Subjt:  RDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQ

Query:  EQNKG-WRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET
        + + G W P P  + +++ +  +       +  L  LK     D   +  F+ L+ T
Subjt:  EQNKG-WRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET

Q84JR3 Pentatricopeptide repeat-containing protein At4g21705, mitochondrial3.0e-11844.81Show/hide
Query:  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLD
        +A R Y   R   +  L+++ISPLG P+ SV P L  W+Q G+ +   EL RIV DLR  +R+  ALEVS+WM   G+  F+  + AV LDLIGRV G  
Subjt:  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLD

Query:  SAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARS
        +AE+YF ++  Q +  K YGALLNCYVR+  V+K+L H +KMKEMGF +S L YN+IMCLY N GQ +KVP VL EMKE  V PDNYSYRICI+++GA  
Subjt:  SAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARS

Query:  DLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKV-NQDALGFNHLISLYTSLGCKDEVMRLWALQKK-CKKQVNRDYITMLG
        DL  +   L++ME +  I+MDW TY++ A F+I  G  ++A+  L+  E+++  +D  G+NHLI+LY  LG K EV+RLW L+K  CK+++N+DY+T+L 
Subjt:  DLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKV-NQDALGFNHLISLYTSLGCKDEVMRLWALQKK-CKKQVNRDYITMLG

Query:  CLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRP
         LVK++ L EAE+++ EW SS  CYDFRVPN ++ GY  + + E+AE ML+++   G+   P SW ++A  Y EK  LE AFKCMK A+ V+  ++ WRP
Subjt:  CLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRP

Query:  KPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNA------------FDELLETLKNN----DETTADALKESQPC
          ++++S+L W+ + G  +E++ F++SL+    ++ ++ +A             D LL+ +K++    DE T   L    PC
Subjt:  KPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNA------------FDELLETLKNN----DETTADALKESQPC

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021503.0e-7334.05Show/hide
Query:  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKG-LFSFTTRDFAVQLDLIGRVQGLDSAEK
        Y  R       ++ +IS +  PEL    +L+QW + GR +  +EL R+V++LR  +R  QALEV +WM ++G  F  +  D A+QLDLIG+V+G+  AE+
Subjt:  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKG-LFSFTTRDFAVQLDLIGRVQGLDSAEK

Query:  YFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIG
        +F  +    +  ++YG+LLN YVR    +KA + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK+  +  D YSY I +SS G+   +  
Subjt:  YFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIG

Query:  MLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKV-NQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK
        M  V ++M+S   I  +WTT+S +A  +IK G  EKA   LRK E ++  ++ + +++L+SLY SLG K E+ R+W + K     + N  Y  ++  LV+
Subjt:  MLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKV-NQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK

Query:  LEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSV
        +  +E AEK+ +EW+     YD R+PN+L+  Y K   +E AE +  +++  G  P  ++W I+A G+  K+ + +A  C++ A +  E +  WRPK  +
Subjt:  LEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSV

Query:  LSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET-LKNNDETTADALKESQPCLAQL
        LS   +   E       +  L  L+    ++ K   A  ++ E    NN E  A    E+   L QL
Subjt:  LSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET-LKNNDETTADALKESQPCLAQL

Q9LDJ3 Expansin-A121.1e-8667.79Show/hide
Query:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS
        W+ AHAT+YG + +P SLGGACGYDN +HAGFG +TAA+SG LFR GE+CG C+ V C++  DPKWCLR  AV +TATNFCP+NNN GWC+ PR HFDMS
Subjt:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS

Query:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDVKAAWVKGSRTRTWMPMHRNWGANWQANVDLRNQRMSFKVTLIDGR
        SPAF  IAR+GNEGIVPV Y+RV C+R+GGVRFT+RGQ NFNMVMISNVGG G V++  V+GS+ +TW+ M RNWGANWQ++ DLR QR+SFKVTL D +
Subjt:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDVKAAWVKGSRTRTWMPMHRNWGANWQANVDLRNQRMSFKVTLIDGR

Query:  TLEFVNVV
        T  F+NVV
Subjt:  TLEFVNVV

Q9SKU6 Pentatricopeptide repeat-containing protein At2g20710, mitochondrial1.8e-7836.07Show/hide
Query:  RISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLY
        R++  G P  S++ +LD W+ +G ++K  EL  I++ LR   R+  AL++S+WM    +   +  D A++LDLI +V GL  AEK+F ++  +     LY
Subjt:  RISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLY

Query:  GALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHIS
        GALLNCY  + ++ KA    Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM++  V PD ++    + +Y   SD+ GM K L   E+   + 
Subjt:  GALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHIS

Query:  MDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL--GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWV
        +DW TY+  AN +IKAG+ EKA+  LRK E  VN       +  L+S Y + G K+EV RLW+L K+     N  YI+++  L+K++ +EE EK+++EW 
Subjt:  MDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL--GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWV

Query:  SSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYE
        +    +D R+P++L+ GY K+G++E+AE+++  ++   R+   ++W  +A GY     +EKA +  K A+ V +   GWRP   VL S + +L      E
Subjt:  SSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYE

Query:  ELKEFLSSLKTVPSMDGKLSNAFDELL
         L++ L  L    S  G +S  +D+LL
Subjt:  ELKEFLSSLKTVPSMDGKLSNAFDELL

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-7434.05Show/hide
Query:  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKG-LFSFTTRDFAVQLDLIGRVQGLDSAEK
        Y  R       ++ +IS +  PEL    +L+QW + GR +  +EL R+V++LR  +R  QALEV +WM ++G  F  +  D A+QLDLIG+V+G+  AE+
Subjt:  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKG-LFSFTTRDFAVQLDLIGRVQGLDSAEK

Query:  YFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIG
        +F  +    +  ++YG+LLN YVR    +KA + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK+  +  D YSY I +SS G+   +  
Subjt:  YFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIG

Query:  MLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKV-NQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK
        M  V ++M+S   I  +WTT+S +A  +IK G  EKA   LRK E ++  ++ + +++L+SLY SLG K E+ R+W + K     + N  Y  ++  LV+
Subjt:  MLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKV-NQDALGFNHLISLYTSLGCKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK

Query:  LEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSV
        +  +E AEK+ +EW+     YD R+PN+L+  Y K   +E AE +  +++  G  P  ++W I+A G+  K+ + +A  C++ A +  E +  WRPK  +
Subjt:  LEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSV

Query:  LSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET-LKNNDETTADALKESQPCLAQL
        LS   +   E       +  L  L+    ++ K   A  ++ E    NN E  A    E+   L QL
Subjt:  LSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET-LKNNDETTADALKESQPCLAQL

AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-6831.73Show/hide
Query:  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDL
        +A+R  S  R   +R+        L+ R+   G  E+ V   L+Q+++  + +  +E+   ++ LRN   Y  AL++SE M  +G+ + T  D A+ LDL
Subjt:  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDL

Query:  IGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRIC
        + + + + + E YF  +    +    YG+LLNCY +E L +KA   + KMKE+    S + YN +M LY  TG+ +KVP ++ E+K   V+PD+Y+Y + 
Subjt:  IGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRIC

Query:  ISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGCKDEVMRLW-ALQKKCKKQVN
        + +  A +D+ G+ +V++EM     ++ DWTTYS +A+ ++ AG+ +KA   L++ E K  Q D   +  LI+LY  LG   EV R+W +L+    K  N
Subjt:  ISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGCKDEVMRLW-ALQKKCKKQVN

Query:  RDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQ
          Y+ M+  LVKL  L  AE L KEW ++C  YD R+ NVL+  Y++ GLI++A ++ +     G      +W I    Y++  ++ +A +CM +AV++ 
Subjt:  RDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQ

Query:  EQNKG-WRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET
        + + G W P P  + +++ +  +       +  L  LK     D   +  F+ L+ T
Subjt:  EQNKG-WRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET

AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-7936.07Show/hide
Query:  RISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLY
        R++  G P  S++ +LD W+ +G ++K  EL  I++ LR   R+  AL++S+WM    +   +  D A++LDLI +V GL  AEK+F ++  +     LY
Subjt:  RISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLY

Query:  GALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHIS
        GALLNCY  + ++ KA    Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM++  V PD ++    + +Y   SD+ GM K L   E+   + 
Subjt:  GALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHIS

Query:  MDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL--GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWV
        +DW TY+  AN +IKAG+ EKA+  LRK E  VN       +  L+S Y + G K+EV RLW+L K+     N  YI+++  L+K++ +EE EK+++EW 
Subjt:  MDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDAL--GFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWV

Query:  SSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYE
        +    +D R+P++L+ GY K+G++E+AE+++  ++   R+   ++W  +A GY     +EKA +  K A+ V +   GWRP   VL S + +L      E
Subjt:  SSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYE

Query:  ELKEFLSSLKTVPSMDGKLSNAFDELL
         L++ L  L    S  G +S  +D+LL
Subjt:  ELKEFLSSLKTVPSMDGKLSNAFDELL

AT3G15370.1 expansin 127.5e-8867.79Show/hide
Query:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS
        W+ AHAT+YG + +P SLGGACGYDN +HAGFG +TAA+SG LFR GE+CG C+ V C++  DPKWCLR  AV +TATNFCP+NNN GWC+ PR HFDMS
Subjt:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS

Query:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDVKAAWVKGSRTRTWMPMHRNWGANWQANVDLRNQRMSFKVTLIDGR
        SPAF  IAR+GNEGIVPV Y+RV C+R+GGVRFT+RGQ NFNMVMISNVGG G V++  V+GS+ +TW+ M RNWGANWQ++ DLR QR+SFKVTL D +
Subjt:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDVKAAWVKGSRTRTWMPMHRNWGANWQANVDLRNQRMSFKVTLIDGR

Query:  TLEFVNVV
        T  F+NVV
Subjt:  TLEFVNVV

AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-11944.81Show/hide
Query:  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLD
        +A R Y   R   +  L+++ISPLG P+ SV P L  W+Q G+ +   EL RIV DLR  +R+  ALEVS+WM   G+  F+  + AV LDLIGRV G  
Subjt:  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLD

Query:  SAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARS
        +AE+YF ++  Q +  K YGALLNCYVR+  V+K+L H +KMKEMGF +S L YN+IMCLY N GQ +KVP VL EMKE  V PDNYSYRICI+++GA  
Subjt:  SAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARS

Query:  DLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKV-NQDALGFNHLISLYTSLGCKDEVMRLWALQKK-CKKQVNRDYITMLG
        DL  +   L++ME +  I+MDW TY++ A F+I  G  ++A+  L+  E+++  +D  G+NHLI+LY  LG K EV+RLW L+K  CK+++N+DY+T+L 
Subjt:  DLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKV-NQDALGFNHLISLYTSLGCKDEVMRLWALQKK-CKKQVNRDYITMLG

Query:  CLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRP
         LVK++ L EAE+++ EW SS  CYDFRVPN ++ GY  + + E+AE ML+++   G+   P SW ++A  Y EK  LE AFKCMK A+ V+  ++ WRP
Subjt:  CLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRP

Query:  KPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNA------------FDELLETLKNN----DETTADALKESQPC
          ++++S+L W+ + G  +E++ F++SL+    ++ ++ +A             D LL+ +K++    DE T   L    PC
Subjt:  KPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNA------------FDELLETLKNN----DETTADALKESQPC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCCATGATTTATGGGAATGGTGGGTTTGTTTGGGTTTTGTATGCGTTGTTTTTCATGCTGGTTGGTGGGTCCTTCGTCCATGGGGACGGTGGTGCTTGGCTTGA
TGCTCATGCAACATTCTATGGAGCTGATCAAAACCCTACTAGCCTTGGAGGAGCATGTGGTTACGACAATACATTTCATGCCGGATTTGGAATAAACACGGCGGCGGTGA
GCGGCGCACTTTTCAGAGGAGGAGAGGCTTGCGGCGCTTGCTTCCTAGTAATTTGCAACTACAACGTGGACCCCAAGTGGTGCCTCCGCCGCCGCGCCGTCGCCATCACC
GCCACAAACTTCTGTCCCTCCAATAACAACGGGGGCTGGTGCGACCCCCCTCGCGCGCACTTCGACATGTCGTCACCTGCCTTTCTTACCATTGCTCGCCAAGGCAACGA
AGGGATCGTCCCTGTCCTTTACAAGAGGGTAAGTTGTAGAAGGAAGGGAGGAGTTCGATTCACATTGAGAGGACAATCAAACTTCAATATGGTGATGATATCGAACGTCG
GTGGCAGCGGCGACGTCAAGGCTGCATGGGTTAAGGGGTCGAGGACGAGGACGTGGATGCCCATGCATCGTAATTGGGGTGCAAACTGGCAGGCCAATGTCGACCTTCGA
AACCAAAGAATGTCGTTTAAGGTTACTCTAATAGATGGGAGGACATTGGAGTTTGTCAATGTGGTGGCGCTCAAAACTGTCGAAACTCGGTTTTTTTTCCTGAAATTGGA
GGTTTCCGGAGGTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACGGCAAGAACGGAGACAGATGCATTCTGTT
TTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCCTGAGCTTAGTGTAGTTCCGATTCTTGAT
CAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGCTGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATGGCCAAGCCCTTGAGGTGTCTGAATGGAT
GCGTAGCAAGGGACTTTTTTCCTTTACAACTAGAGACTTTGCTGTACAGCTTGATCTAATCGGCCGAGTTCAGGGGCTCGATTCCGCAGAGAAGTATTTCAGCAGTGTTT
CTAACCAAGAGGAAATGGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTAGATAAGGCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGT
TTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTACTTTCTGAAATGAAGGAGAATGGTGTTCT
TCCTGACAATTATAGCTATAGAATTTGCATCAGTTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAAAGAAATGGAGAGTCAAACTCACATATCTA
TGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGAAAGCAATGAGTTACCTTCGGAAATGCGAGGACAAGGTCAATCAAGACGCT
CTCGGCTTCAATCATCTCATTTCACTTTACACCAGTCTGGGATGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGAAGTGCAAGAAGCAAGTCAATAGGGATTA
TATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTCAAGGAATGGGTGTCATCTTGCGAGTGTTATGATTTTCGGGTTCCGAATG
TTCTTCTCATCGGATACTCGAAAAGGGGGCTAATTGAAAGAGCTGAAAAGATGCTCCAAAACATCATCAGTGACGGGAGGATCCCACCACCAAATAGTTGGGGCATTATT
GCAGCAGGGTACTTGGAAAAGCAGAACCTGGAGAAAGCTTTCAAGTGCATGAAGGAAGCTGTGGCTGTACAAGAGCAAAACAAAGGGTGGAGGCCCAAACCTAGCGTCTT
ATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGACTGTTCCTTCCATGGACGGAAAACTAAGTAATGCCT
TCGATGAGCTTCTGGAAACCTTAAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGGAATCACAACCTTGTTTAGCTCAGCTAGATTAA
mRNA sequenceShow/hide mRNA sequence
CTGTTTGTTTCTCGGGAAAATTTATAGGATGGTAGCCATGATTTATGGGAATGGTGGGTTTGTTTGGGTTTTGTATGCGTTGTTTTTCATGCTGGTTGGTGGGTCCTTCG
TCCATGGGGACGGTGGTGCTTGGCTTGATGCTCATGCAACATTCTATGGAGCTGATCAAAACCCTACTAGCCTTGGAGGAGCATGTGGTTACGACAATACATTTCATGCC
GGATTTGGAATAAACACGGCGGCGGTGAGCGGCGCACTTTTCAGAGGAGGAGAGGCTTGCGGCGCTTGCTTCCTAGTAATTTGCAACTACAACGTGGACCCCAAGTGGTG
CCTCCGCCGCCGCGCCGTCGCCATCACCGCCACAAACTTCTGTCCCTCCAATAACAACGGGGGCTGGTGCGACCCCCCTCGCGCGCACTTCGACATGTCGTCACCTGCCT
TTCTTACCATTGCTCGCCAAGGCAACGAAGGGATCGTCCCTGTCCTTTACAAGAGGGTAAGTTGTAGAAGGAAGGGAGGAGTTCGATTCACATTGAGAGGACAATCAAAC
TTCAATATGGTGATGATATCGAACGTCGGTGGCAGCGGCGACGTCAAGGCTGCATGGGTTAAGGGGTCGAGGACGAGGACGTGGATGCCCATGCATCGTAATTGGGGTGC
AAACTGGCAGGCCAATGTCGACCTTCGAAACCAAAGAATGTCGTTTAAGGTTACTCTAATAGATGGGAGGACATTGGAGTTTGTCAATGTGGTGGCGCTCAAAACTGTCG
AAACTCGGTTTTTTTTCCTGAAATTGGAGGTTTCCGGAGGTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACG
GCAAGAACGGAGACAGATGCATTCTGTTTTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCC
TGAGCTTAGTGTAGTTCCGATTCTTGATCAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGCTGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATG
GCCAAGCCCTTGAGGTGTCTGAATGGATGCGTAGCAAGGGACTTTTTTCCTTTACAACTAGAGACTTTGCTGTACAGCTTGATCTAATCGGCCGAGTTCAGGGGCTCGAT
TCCGCAGAGAAGTATTTCAGCAGTGTTTCTAACCAAGAGGAAATGGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTAGATAAGGCCCTTTC
CCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTAC
TTTCTGAAATGAAGGAGAATGGTGTTCTTCCTGACAATTATAGCTATAGAATTTGCATCAGTTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAAA
GAAATGGAGAGTCAAACTCACATATCTATGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGAAAGCAATGAGTTACCTTCGGAA
ATGCGAGGACAAGGTCAATCAAGACGCTCTCGGCTTCAATCATCTCATTTCACTTTACACCAGTCTGGGATGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGA
AGTGCAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTCAAGGAATGGGTGTCATCTTGC
GAGTGTTATGATTTTCGGGTTCCGAATGTTCTTCTCATCGGATACTCGAAAAGGGGGCTAATTGAAAGAGCTGAAAAGATGCTCCAAAACATCATCAGTGACGGGAGGAT
CCCACCACCAAATAGTTGGGGCATTATTGCAGCAGGGTACTTGGAAAAGCAGAACCTGGAGAAAGCTTTCAAGTGCATGAAGGAAGCTGTGGCTGTACAAGAGCAAAACA
AAGGGTGGAGGCCCAAACCTAGCGTCTTATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGACTGTTCCT
TCCATGGACGGAAAACTAAGTAATGCCTTCGATGAGCTTCTGGAAACCTTAAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGGAATCACAACCTTGTTTAGCTCA
GCTAGATTAATGGTATGGAGGTAGTATCTTATAGGTAGTATGTGTATATAATTTAAATTAGAACTCATAAAAGACCAGTTGTCAAGTGGGTTAATTCTTATTGAGTTGGT
CGGATTTAGATAATAGATCTGTATTTTAATGCCACATCATATTTTGTTACATAAAATATA
Protein sequenceShow/hide protein sequence
MVAMIYGNGGFVWVLYALFFMLVGGSFVHGDGGAWLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAIT
ATNFCPSNNNGGWCDPPRAHFDMSSPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDVKAAWVKGSRTRTWMPMHRNWGANWQANVDLR
NQRMSFKVTLIDGRTLEFVNVVALKTVETRFFFLKLEVSGGPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILD
QWIQEGRMIKDFELRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEMGKLYGALLNCYVREGLVDKALSHMQKMKEMG
FASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEKAMSYLRKCEDKVNQDA
LGFNHLISLYTSLGCKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWVSSCECYDFRVPNVLLIGYSKRGLIERAEKMLQNIISDGRIPPPNSWGII
AAGYLEKQNLEKAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADALKESQPCLAQLD