; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G009740 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G009740
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionpentatricopeptide repeat-containing protein At4g21705, mitochondrial-like
Genome locationCmo_Chr04:4949678..4953724
RNA-Seq ExpressionCmoCh04G009740
SyntenyCmoCh04G009740
Gene Ontology termsGO:0009664 - plant-type cell wall organization (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR002963 - Expansin
IPR007112 - Expansin/pollen allergen, DPBB domain
IPR007117 - Expansin, cellulose-binding-like domain
IPR007118 - Expansin/Lol pI
IPR009009 - RlpA-like protein, double-psi beta-barrel domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036749 - Expansin, cellulose-binding-like domain superfamily
IPR036908 - RlpA-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600732.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]1.1e-29098.04Show/hide
Query:  MSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN
        +  +VSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN
Subjt:  MSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN

Query:  CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC
        CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYF SVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC
Subjt:  CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC

Query:  LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF
        LYLNTGQVDKVPNVLSEMK NGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHE+AMSYLRKCEDKVNQDALGF
Subjt:  LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF

Query:  NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP
        NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLV+EWESSCECYDFRVPNVLLIGYSQ+GLIERAEKMLQNIISDGRIPP
Subjt:  NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP

Query:  PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK
        PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLK VPSMDGKLSNAFDELLETLKNNDETTADALK
Subjt:  PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK

Query:  KSQPCLAQVD
        KSQPCLAQVD
Subjt:  KSQPCLAQVD

KAG7031372.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-28599.19Show/hide
Query:  MFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRS
        MFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRS
Subjt:  MFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRS

Query:  KGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLS
        KGLFSFTTRDFAVQLDLIGRVRGIDSAEKYF SVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLS
Subjt:  KGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLS

Query:  EMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRKDEV
        EMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHE+AMSYLRKCEDKVNQDALGFNHLISLYTSLGRKDEV
Subjt:  EMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRKDEV

Query:  MRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQN
        MRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQ+GLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQN
Subjt:  MRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQN

Query:  PERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALKKSQPCLAQ
        PERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLK VPSMDGKLSNAFDELLETLKNNDETTADALKKSQPCLAQ
Subjt:  PERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALKKSQPCLAQ

XP_022942045.1 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucurbita moschata]6.0e-29499.22Show/hide
Query:  MSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN
        +  +VSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN
Subjt:  MSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN

Query:  CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC
        CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC
Subjt:  CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC

Query:  LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF
        LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF
Subjt:  LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF

Query:  NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP
        NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP
Subjt:  NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP

Query:  PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK
        PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK
Subjt:  PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK

Query:  KSQPCLAQVD
        KSQPCLAQVD
Subjt:  KSQPCLAQVD

XP_022989754.1 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucurbita maxima]6.0e-28694.97Show/hide
Query:  VDLRNQIMSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRR
        V+ R   +  +VSG PALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFE+RR
Subjt:  VDLRNQIMSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRR

Query:  IVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPL
        IVRDLRNCRRYGQAL+VSEWMRSKGLFSFTTRDFAVQLDLIGRV+G+DSAEKYFSSVSNQEE+GKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPL
Subjt:  IVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPL

Query:  CYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV
        CYNDIMCLYLNTGQVDKVPNVLSEMK+NGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHE+AMSYLRKCEDKV
Subjt:  CYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV

Query:  NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNII
        NQDALGFNHLISLYTSLG KDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLV+EW SSCECYDFRVPNVLLIGYS+RGLIERAEKMLQNII
Subjt:  NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNII

Query:  SDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDE
        SDGRIPPPNSWGIIAAGYLEKQN E+AFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLK VPSMDGKLSNAFDELLETLKNNDE
Subjt:  SDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDE

Query:  TTADALKKSQPCLAQVD
        TTADALK+SQPCLAQ+D
Subjt:  TTADALKKSQPCLAQVD

XP_023545982.1 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucurbita pepo subsp. pepo]3.6e-29197.84Show/hide
Query:  MSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN
        +  +VSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN
Subjt:  MSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN

Query:  CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC
        CRRYGQAL+VSEWMRSKGLFSFTTRDFAVQLDLIGRV+G+DSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC
Subjt:  CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC

Query:  LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF
        LYLNTGQVDKVPNVLSEMK+NGVLPDNYSYRICISSYGARSDLIGMLKVL+EMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF
Subjt:  LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF

Query:  NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP
        NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLV+EWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP
Subjt:  NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP

Query:  PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK
        PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLK VPSMDGKLSNAFDELLETLKNNDETTADALK
Subjt:  PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK

Query:  KSQPCLAQVD
        KSQPCLAQVD
Subjt:  KSQPCLAQVD

TrEMBL top hitse value%identityAlignment
A0A0A0L7Y2 Uncharacterized protein5.4e-23280.82Show/hide
Query:  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQV
        +AAA AMFKIL   SSG TRT R ETDAFCFVALRLYS RR+C+RRNL+ARISPLG PE +VVP+L+QWI+EGR IKDFE+RRIVRDLR CRRY QAL+V
Subjt:  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQV

Query:  SEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDK
        SEWM SKGLFS TTRDFA+QLDLIG+VRG+DSAEKYF SVSNQ+EIGKLYGALLNCYVREGL+DK+L+HMQKMKEMG ASSPLCYNDIMCLYLNTGQ DK
Subjt:  SEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDK

Query:  VPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL
        VPNVLSEMK+NGVLPDN+SYRICISSYGARSD+I M  VLKEME QTHISMDWTTYSMVA FFIKAGMH++AM+YLRKCEDKV++DALGFNHLIS YT+L
Subjt:  VPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL

Query:  GRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAG
        G K+EVMRLWAL KK KKQ+NRDYITMLG LVKLE LEEAE LV EWESSC+CYDFRVPNV+LIGYSQ+GLIE+AEKML+NII +G IP PNSWGIIA+G
Subjt:  GRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAG

Query:  YLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK
        YLEKQN E+AF+CMKEA+AV+ QNK WRPKP+VLSSILRWLSEN RYEE+KEF+SSLK VPSMD KL+NA DELLE + N+D  + D L+
Subjt:  YLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK

A0A3Q7HHL7 Uncharacterized protein1.4e-22756.97Show/hide
Query:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS
        WL+  ATFYG +Q+P++ GGACGYDN +HAGFG+NTAA+S ALFR G+ACGAC+ V CN  +D +WC+   AV +TATNFCP NN+GGWCD PR HFDMS
Subjt:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS

Query:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDIKAAWVKGSRTRTWMLMHRNWGANWQANVDLRNQIMSFKVS-----
         PAFL IARQGNEG+VPVLY RV+C+R GGVRFTL+GQSNFNMVMISNVGGSGDIK+  ++GSRT+TW+ M+RNWG NWQ+  DLR+Q +SF+++     
Subjt:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDIKAAWVKGSRTRTWMLMHRNWGANWQANVDLRNQIMSFKVS-----

Query:  --GAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRY
              +++++    + RS S    + A     A+C     +YS+     R NLF+RISP+   +L ++P+LD+W+ EGR +  FE++RIVRDLR+ +R+
Subjt:  --GAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRY

Query:  GQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLN
         QALQVSEWM  +GL  F + D AV LDLIG V G ++AE YF++++++++  K  GAL NCY+RE LV+K+L+H QKMKE+G+A   L YN++MCLY +
Subjt:  GQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLN

Query:  TGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLI
        TGQ+++V  VLSEMK+NGV P+N+SYRICI+  G R+D  GM K+L+EMESQ  IS DW TYSM+AN +IKA   E+A+ +L+K ED++++D +G+NHLI
Subjt:  TGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLI

Query:  SLYTSLGRKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNS
        S Y +LG K+E++RLW + K  CKKQ+NRDYITMLG LVKL  LE AE +++EWESSC  YDFRVPN+LLIGY Q  L+++AE ML +II  G+ P PNS
Subjt:  SLYTSLGRKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNS

Query:  WGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAV
        W IIAAGYL   N E+AF+CMK+A+AV+ QN  WRPKP ++SSI +WL +     E++ FLSSLK V
Subjt:  WGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAV

A0A6J1CNQ2 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like1.1e-23784.92Show/hide
Query:  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQV
        +AAALAMFKIL S SS F RT RTETD+FC + LRLYS RR CNRRNLFARISPLG PELSVV ILDQWI+EGR IKDFE+RRIVRDLR+CRRYGQAL+V
Subjt:  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQV

Query:  SEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDK
        SEWM SKGLF  TTRDFAVQLDLIGRVRG+DSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDK+L+HMQ+MK+MGFAS+PL YNDIMCLYLNTG VDK
Subjt:  SEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDK

Query:  VPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL
        VPNVLSEMK+NGVLPDN+SYRICI+SYGARSDLI M KVLKEMESQ+HISMDWTTYSMVANFFIKA MHE+A++YLRKCEDKV++DALGFNHLISLYTSL
Subjt:  VPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL

Query:  GRKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAA
        G  DEVMRLWALQK KCKKQVNRDYITMLG LVKLE LEEAE L++EWESSC+CYDFRVPNVLLIGYSQ+GLIERAEKML++I  +GRIPPPNSW IIAA
Subjt:  GRKDEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAA

Query:  GYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDE
        GYLEKQN E+AFKCM EA+AV+EQN GWRPKPSV+SSILRWLSENGRY ELKEFLSSLK VPSMDGKL NA DEL+ETL+N+ E
Subjt:  GYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDE

A0A6J1FVG8 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like2.9e-29499.22Show/hide
Query:  MSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN
        +  +VSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN
Subjt:  MSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRN

Query:  CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC
        CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC
Subjt:  CRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMC

Query:  LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF
        LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF
Subjt:  LYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGF

Query:  NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP
        NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP
Subjt:  NHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPP

Query:  PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK
        PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK
Subjt:  PNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALK

Query:  KSQPCLAQVD
        KSQPCLAQVD
Subjt:  KSQPCLAQVD

A0A6J1JQ72 pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like2.9e-28694.97Show/hide
Query:  VDLRNQIMSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRR
        V+ R   +  +VSG PALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFE+RR
Subjt:  VDLRNQIMSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRR

Query:  IVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPL
        IVRDLRNCRRYGQAL+VSEWMRSKGLFSFTTRDFAVQLDLIGRV+G+DSAEKYFSSVSNQEE+GKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPL
Subjt:  IVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPL

Query:  CYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV
        CYNDIMCLYLNTGQVDKVPNVLSEMK+NGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHE+AMSYLRKCEDKV
Subjt:  CYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV

Query:  NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNII
        NQDALGFNHLISLYTSLG KDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLV+EW SSCECYDFRVPNVLLIGYS+RGLIERAEKMLQNII
Subjt:  NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNII

Query:  SDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDE
        SDGRIPPPNSWGIIAAGYLEKQN E+AFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLK VPSMDGKLSNAFDELLETLKNNDE
Subjt:  SDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDE

Query:  TTADALKKSQPCLAQVD
        TTADALK+SQPCLAQ+D
Subjt:  TTADALKKSQPCLAQVD

SwissProt top hitse value%identityAlignment
O22714 Pentatricopeptide repeat-containing protein At1g607708.1e-6832.17Show/hide
Query:  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDL
        +A+R  S  R   +R+        L+ R+   G  E+ V   L+Q+++  + +  +E+   ++ LRN   Y  AL++SE M  +G+ + T  D A+ LDL
Subjt:  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDL

Query:  IGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRIC
        + + R I + E YF  +    +    YG+LLNCY +E L +KA   + KMKE+    S + YN +M LY  TG+ +KVP ++ E+K   V+PD+Y+Y + 
Subjt:  IGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRIC

Query:  ISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGRKDEVMRLW-ALQKKCKKQVN
        + +  A +D+ G+ +V++EM     ++ DWTTYS +A+ ++ AG+ ++A   L++ E K  Q D   +  LI+LY  LG+  EV R+W +L+    K  N
Subjt:  ISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGRKDEVMRLW-ALQKKCKKQVN

Query:  RDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQ
          Y+ M+  LVKL  L  AE L +EW+++C  YD R+ NVL+  Y+Q GLI++A ++ +     G      +W I    Y++  +  RA +CM +AV++ 
Subjt:  RDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQ

Query:  EQNKG-WRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET
        + + G W P P  + +++ +  +       +  L  LK     D   +  F+ L+ T
Subjt:  EQNKG-WRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET

Q84JR3 Pentatricopeptide repeat-containing protein At4g21705, mitochondrial1.2e-11643.98Show/hide
Query:  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGID
        +A R Y   R   +  L+++ISPLG P+ SV P L  W+Q G+ +   E+ RIV DLR  +R+  AL+VS+WM   G+  F+  + AV LDLIGRV G  
Subjt:  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGID

Query:  SAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARS
        +AE+YF ++  Q +  K YGALLNCYVR+  V+K+L H +KMKEMGF +S L YN+IMCLY N GQ +KVP VL EMK+  V PDNYSYRICI+++GA  
Subjt:  SAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARS

Query:  DLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKK-CKKQVNRDYITMLG
        DL  +   L++ME +  I+MDW TY++ A F+I  G  ++A+  L+  E+++  +D  G+NHLI+LY  LG+K EV+RLW L+K  CK+++N+DY+T+L 
Subjt:  DLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKK-CKKQVNRDYITMLG

Query:  CLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRP
         LVK++ L EAE+++ EW+SS  CYDFRVPN ++ GY  + + E+AE ML+++   G+   P SW ++A  Y EK   E AFKCMK A+ V+  ++ WRP
Subjt:  CLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRP

Query:  KPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNA------------FDELLETLKNN----DETTADALKKSQPC
          ++++S+L W+ + G  +E++ F++SL+    ++ ++ +A             D LL+ +K++    DE T   L    PC
Subjt:  KPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNA------------FDELLETLKNN----DETTADALKKSQPC

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021502.1e-7133.92Show/hide
Query:  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKG-LFSFTTRDFAVQLDLIGRVRGIDSAEK
        Y  R       ++ +IS +  PEL    +L+QW + GR +  +E+ R+V++LR  +R  QAL+V +WM ++G  F  +  D A+QLDLIG+VRGI  AE+
Subjt:  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKG-LFSFTTRDFAVQLDLIGRVRGIDSAEK

Query:  YFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIG
        +F  +    +  ++YG+LLN YVR    +KA + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK   +  D YSY I +SS G+   +  
Subjt:  YFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIG

Query:  MLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK
        M  V ++M+S   I  +WTT+S +A  +IK G  E+A   LRK E ++  ++ + +++L+SLY SLG K E+ R+W + K     + N  Y  ++  LV+
Subjt:  MLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK

Query:  LEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSV
        +  +E AEK+ EEW      YD R+PN+L+  Y +   +E AE +  +++  G  P  ++W I+A G+  K+    A  C++ A +  E +  WRPK  +
Subjt:  LEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSV

Query:  LSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET-LKNNDETTA
        LS   +   E       +  L  L+    ++ K   A  ++ E    NN E  A
Subjt:  LSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET-LKNNDETTA

Q9LDJ3 Expansin-A122.2e-8167.18Show/hide
Query:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS
        W+ AHAT+YG + +P SLGGACGYDN +HAGFG +TAA+SG LFR GE+CG C+ V C++  DPKWCLR  AV +TATNFCP+NNN GWC+ PR HFDMS
Subjt:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS

Query:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDIKAAWVKGSRTRTWMLMHRNWGANWQANVDLRNQIMSFKVS
        SPAF  IAR+GNEGIVPV Y+RV C+R+GGVRFT+RGQ NFNMVMISNVGG G +++  V+GS+ +TW+ M RNWGANWQ++ DLR Q +SFKV+
Subjt:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDIKAAWVKGSRTRTWMLMHRNWGANWQANVDLRNQIMSFKVS

Q9SKU6 Pentatricopeptide repeat-containing protein At2g20710, mitochondrial1.3e-7835.83Show/hide
Query:  RISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLY
        R++  G P  S++ +LD W+ +G ++K  E+  I++ LR   R+  ALQ+S+WM    +   +  D A++LDLI +V G+  AEK+F ++  +     LY
Subjt:  RISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLY

Query:  GALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHIS
        GALLNCY  + ++ KA    Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM+D  V PD ++    + +Y   SD+ GM K L   E+   + 
Subjt:  GALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHIS

Query:  MDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDAL--GFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWE
        +DW TY+  AN +IKAG+ E+A+  LRK E  VN       +  L+S Y + G+K+EV RLW+L K+     N  YI+++  L+K++ +EE EK++EEWE
Subjt:  MDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDAL--GFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWE

Query:  SSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYE
        +    +D R+P++L+ GY ++G++E+AE+++  ++   R+   ++W  +A GY      E+A +  K A+ V +   GWRP   VL S + +L      E
Subjt:  SSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYE

Query:  ELKEFLSSLKAVPSMDGKLSNAFDELL
         L++ L  L    S  G +S  +D+LL
Subjt:  ELKEFLSSLKAVPSMDGKLSNAFDELL

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-7233.92Show/hide
Query:  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKG-LFSFTTRDFAVQLDLIGRVRGIDSAEK
        Y  R       ++ +IS +  PEL    +L+QW + GR +  +E+ R+V++LR  +R  QAL+V +WM ++G  F  +  D A+QLDLIG+VRGI  AE+
Subjt:  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKG-LFSFTTRDFAVQLDLIGRVRGIDSAEK

Query:  YFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIG
        +F  +    +  ++YG+LLN YVR    +KA + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK   +  D YSY I +SS G+   +  
Subjt:  YFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIG

Query:  MLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK
        M  V ++M+S   I  +WTT+S +A  +IK G  E+A   LRK E ++  ++ + +++L+SLY SLG K E+ R+W + K     + N  Y  ++  LV+
Subjt:  MLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK

Query:  LEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSV
        +  +E AEK+ EEW      YD R+PN+L+  Y +   +E AE +  +++  G  P  ++W I+A G+  K+    A  C++ A +  E +  WRPK  +
Subjt:  LEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSV

Query:  LSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET-LKNNDETTA
        LS   +   E       +  L  L+    ++ K   A  ++ E    NN E  A
Subjt:  LSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET-LKNNDETTA

AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.8e-6932.17Show/hide
Query:  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDL
        +A+R  S  R   +R+        L+ R+   G  E+ V   L+Q+++  + +  +E+   ++ LRN   Y  AL++SE M  +G+ + T  D A+ LDL
Subjt:  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDL

Query:  IGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRIC
        + + R I + E YF  +    +    YG+LLNCY +E L +KA   + KMKE+    S + YN +M LY  TG+ +KVP ++ E+K   V+PD+Y+Y + 
Subjt:  IGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRIC

Query:  ISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGRKDEVMRLW-ALQKKCKKQVN
        + +  A +D+ G+ +V++EM     ++ DWTTYS +A+ ++ AG+ ++A   L++ E K  Q D   +  LI+LY  LG+  EV R+W +L+    K  N
Subjt:  ISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGRKDEVMRLW-ALQKKCKKQVN

Query:  RDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQ
          Y+ M+  LVKL  L  AE L +EW+++C  YD R+ NVL+  Y+Q GLI++A ++ +     G      +W I    Y++  +  RA +CM +AV++ 
Subjt:  RDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQ

Query:  EQNKG-WRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET
        + + G W P P  + +++ +  +       +  L  LK     D   +  F+ L+ T
Subjt:  EQNKG-WRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET

AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.5e-8035.83Show/hide
Query:  RISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLY
        R++  G P  S++ +LD W+ +G ++K  E+  I++ LR   R+  ALQ+S+WM    +   +  D A++LDLI +V G+  AEK+F ++  +     LY
Subjt:  RISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLY

Query:  GALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHIS
        GALLNCY  + ++ KA    Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM+D  V PD ++    + +Y   SD+ GM K L   E+   + 
Subjt:  GALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHIS

Query:  MDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDAL--GFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWE
        +DW TY+  AN +IKAG+ E+A+  LRK E  VN       +  L+S Y + G+K+EV RLW+L K+     N  YI+++  L+K++ +EE EK++EEWE
Subjt:  MDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDAL--GFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWE

Query:  SSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYE
        +    +D R+P++L+ GY ++G++E+AE+++  ++   R+   ++W  +A GY      E+A +  K A+ V +   GWRP   VL S + +L      E
Subjt:  SSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYE

Query:  ELKEFLSSLKAVPSMDGKLSNAFDELL
         L++ L  L    S  G +S  +D+LL
Subjt:  ELKEFLSSLKAVPSMDGKLSNAFDELL

AT3G15370.1 expansin 121.6e-8267.18Show/hide
Query:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS
        W+ AHAT+YG + +P SLGGACGYDN +HAGFG +TAA+SG LFR GE+CG C+ V C++  DPKWCLR  AV +TATNFCP+NNN GWC+ PR HFDMS
Subjt:  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMS

Query:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDIKAAWVKGSRTRTWMLMHRNWGANWQANVDLRNQIMSFKVS
        SPAF  IAR+GNEGIVPV Y+RV C+R+GGVRFT+RGQ NFNMVMISNVGG G +++  V+GS+ +TW+ M RNWGANWQ++ DLR Q +SFKV+
Subjt:  SPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDIKAAWVKGSRTRTWMLMHRNWGANWQANVDLRNQIMSFKVS

AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.8e-11843.98Show/hide
Query:  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGID
        +A R Y   R   +  L+++ISPLG P+ SV P L  W+Q G+ +   E+ RIV DLR  +R+  AL+VS+WM   G+  F+  + AV LDLIGRV G  
Subjt:  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGID

Query:  SAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARS
        +AE+YF ++  Q +  K YGALLNCYVR+  V+K+L H +KMKEMGF +S L YN+IMCLY N GQ +KVP VL EMK+  V PDNYSYRICI+++GA  
Subjt:  SAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARS

Query:  DLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKK-CKKQVNRDYITMLG
        DL  +   L++ME +  I+MDW TY++ A F+I  G  ++A+  L+  E+++  +D  G+NHLI+LY  LG+K EV+RLW L+K  CK+++N+DY+T+L 
Subjt:  DLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKK-CKKQVNRDYITMLG

Query:  CLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRP
         LVK++ L EAE+++ EW+SS  CYDFRVPN ++ GY  + + E+AE ML+++   G+   P SW ++A  Y EK   E AFKCMK A+ V+  ++ WRP
Subjt:  CLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRP

Query:  KPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNA------------FDELLETLKNN----DETTADALKKSQPC
          ++++S+L W+ + G  +E++ F++SL+    ++ ++ +A             D LL+ +K++    DE T   L    PC
Subjt:  KPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNA------------FDELLETLKNN----DETTADALKKSQPC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGTAGCCATGATTTATGGGAATGGTGGGTTTGTTTGGGTTTTGCATGCGTTGTTTTTCATGGTGGTCGGTGGGTCCTTCGTCCATGGGGACGGTGGTGCTTGGCT
TGATGCTCATGCAACTTTCTATGGAGCTGATCAAAACCCTACTAGCCTCGGAGGAGCATGTGGTTACGACAATACATTTCATGCCGGATTTGGAATAAACACGGCGGCGG
TGAGCGGCGCACTTTTCAGAGGAGGAGAGGCTTGCGGCGCTTGCTTCCTAGTAATTTGCAACTACAACGTGGACCCCAAGTGGTGCCTCCGCCGCCGCGCCGTCGCAATC
ACCGCCACGAACTTCTGCCCCTCCAATAACAACGGGGGCTGGTGCGACCCCCCTCGCGCGCACTTCGACATGTCGTCACCTGCCTTTCTTACCATTGCTCGGCAAGGCAA
CGAAGGGATCGTCCCTGTCCTTTACAAGAGGGTAAGTTGTAGAAGGAAGGGAGGAGTTCGATTCACATTGAGAGGACAATCAAACTTCAATATGGTAATGATATCGAACG
TCGGTGGCAGCGGCGACATAAAGGCTGCATGGGTTAAGGGGTCGAGGACGAGGACGTGGATGCTCATGCATCGTAATTGGGGCGCAAACTGGCAAGCCAACGTCGACCTT
CGAAACCAAATAATGTCGTTTAAGGTTTCCGGAGCTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACGGCAAG
AACGGAGACAGATGCATTCTGTTTTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCCTGAGC
TTAGTGTAGTTCCGATTCTTGATCAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGATGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATGGCCAA
GCCCTTCAGGTGTCTGAATGGATGCGTAGCAAGGGACTTTTCTCCTTTACAACTAGAGACTTCGCTGTACAGCTTGATCTGATCGGCCGAGTTCGGGGGATCGATTCTGC
AGAGAAGTATTTCAGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGGCTTGTAGATAAGGCCCTTTCCCATA
TGCAGAAGATGAAAGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTACTTTCT
GAAATGAAGGATAATGGTGTTCTTCCTGACAATTATAGCTATAGAATTTGCATCAGCTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAAAGAAAT
GGAGAGTCAAACTCACATATCTATGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGCAAGCAATGAGTTACCTTCGGAAATGCG
AGGACAAGGTCAACCAAGATGCTCTCGGCTTCAATCACCTCATTTCACTTTACACCAGTCTGGGACGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGAAGTGC
AAGAAGCAAGTCAATAGGGATTATATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTTGAGGAATGGGAGTCATCTTGCGAGTG
TTATGATTTTCGAGTTCCGAATGTTCTTCTCATTGGATACTCGCAAAGGGGGCTAATTGAAAGAGCAGAAAAGATGCTTCAAAACATCATCAGTGATGGGAGGATTCCAC
CACCCAATAGTTGGGGCATTATTGCAGCAGGGTACTTGGAGAAGCAGAACCCGGAGAGAGCTTTCAAGTGCATGAAGGAAGCTGTAGCTGTACAAGAGCAAAACAAAGGG
TGGAGGCCCAAACCTAGCGTCTTATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGGCTGTTCCTTCCAT
GGACGGAAAACTAAGTAATGCCTTCGATGAGCTTCTGGAAACCTTGAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGAAATCACAACCTTGTTTAGCTCAGGTAG
ATTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGTAGCCATGATTTATGGGAATGGTGGGTTTGTTTGGGTTTTGCATGCGTTGTTTTTCATGGTGGTCGGTGGGTCCTTCGTCCATGGGGACGGTGGTGCTTGGCT
TGATGCTCATGCAACTTTCTATGGAGCTGATCAAAACCCTACTAGCCTCGGAGGAGCATGTGGTTACGACAATACATTTCATGCCGGATTTGGAATAAACACGGCGGCGG
TGAGCGGCGCACTTTTCAGAGGAGGAGAGGCTTGCGGCGCTTGCTTCCTAGTAATTTGCAACTACAACGTGGACCCCAAGTGGTGCCTCCGCCGCCGCGCCGTCGCAATC
ACCGCCACGAACTTCTGCCCCTCCAATAACAACGGGGGCTGGTGCGACCCCCCTCGCGCGCACTTCGACATGTCGTCACCTGCCTTTCTTACCATTGCTCGGCAAGGCAA
CGAAGGGATCGTCCCTGTCCTTTACAAGAGGGTAAGTTGTAGAAGGAAGGGAGGAGTTCGATTCACATTGAGAGGACAATCAAACTTCAATATGGTAATGATATCGAACG
TCGGTGGCAGCGGCGACATAAAGGCTGCATGGGTTAAGGGGTCGAGGACGAGGACGTGGATGCTCATGCATCGTAATTGGGGCGCAAACTGGCAAGCCAACGTCGACCTT
CGAAACCAAATAATGTCGTTTAAGGTTTCCGGAGCTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACGGCAAG
AACGGAGACAGATGCATTCTGTTTTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCCTGAGC
TTAGTGTAGTTCCGATTCTTGATCAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGATGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATGGCCAA
GCCCTTCAGGTGTCTGAATGGATGCGTAGCAAGGGACTTTTCTCCTTTACAACTAGAGACTTCGCTGTACAGCTTGATCTGATCGGCCGAGTTCGGGGGATCGATTCTGC
AGAGAAGTATTTCAGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGGCTTGTAGATAAGGCCCTTTCCCATA
TGCAGAAGATGAAAGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTACTTTCT
GAAATGAAGGATAATGGTGTTCTTCCTGACAATTATAGCTATAGAATTTGCATCAGCTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAAAGAAAT
GGAGAGTCAAACTCACATATCTATGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGCAAGCAATGAGTTACCTTCGGAAATGCG
AGGACAAGGTCAACCAAGATGCTCTCGGCTTCAATCACCTCATTTCACTTTACACCAGTCTGGGACGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGAAGTGC
AAGAAGCAAGTCAATAGGGATTATATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTTGAGGAATGGGAGTCATCTTGCGAGTG
TTATGATTTTCGAGTTCCGAATGTTCTTCTCATTGGATACTCGCAAAGGGGGCTAATTGAAAGAGCAGAAAAGATGCTTCAAAACATCATCAGTGATGGGAGGATTCCAC
CACCCAATAGTTGGGGCATTATTGCAGCAGGGTACTTGGAGAAGCAGAACCCGGAGAGAGCTTTCAAGTGCATGAAGGAAGCTGTAGCTGTACAAGAGCAAAACAAAGGG
TGGAGGCCCAAACCTAGCGTCTTATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGGCTGTTCCTTCCAT
GGACGGAAAACTAAGTAATGCCTTCGATGAGCTTCTGGAAACCTTGAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGAAATCACAACCTTGTTTAGCTCAGGTAG
ATTAA
Protein sequenceShow/hide protein sequence
MMVAMIYGNGGFVWVLHALFFMVVGGSFVHGDGGAWLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNYNVDPKWCLRRRAVAI
TATNFCPSNNNGGWCDPPRAHFDMSSPAFLTIARQGNEGIVPVLYKRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDIKAAWVKGSRTRTWMLMHRNWGANWQANVDL
RNQIMSFKVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQ
ALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLS
EMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRKDEVMRLWALQKKC
KKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKG
WRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKNNDETTADALKKSQPCLAQVD