; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G023200 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G023200
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr04:30394260..30408133
RNA-Seq ExpressionLsi04G023200
SyntenyLsi04G023200
Gene Ontology termsGO:0006364 - rRNA processing (biological process)
GO:0006508 - proteolysis (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR002036 - Endoribonuclease YbeY
IPR036412 - HAD-like superfamily
IPR033389 - AUX/IAA domain
IPR032867 - DYW domain
IPR023214 - HAD superfamily
IPR023091 - Metalloprotease catalytic domain superfamily, predicted
IPR020549 - Endoribonuclease YbeY, conserved site
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR006379 - HAD-superfamily hydrolase, subfamily IIB
IPR002885 - Pentatricopeptide repeat
IPR000270 - PB1 domain
IPR000150 - Cof family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038628.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0087.31Show/hide
Query:  SIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ
        SIAEMLRLNVPKALK A NNLKDSEYKTR SSI DVGLFDSVELSVLLCND+FIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ
Subjt:  SIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ

Query:  AEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVSD-
        AEERGHTLLDE+RILL                        VHGLLHL+GFDHE+SEEAE EMEKEEELLLKSLEW+GKGLIQSAIDAETNSNSAAVVSD 
Subjt:  AEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVSD-

Query:  ----------------------------------------------GTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGA
                                                      GTLLNSKSQITPTT +ALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGA
Subjt:  ----------------------------------------------GTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGA

Query:  VSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQKLIFLDTA
        VSE+SPGVFLQGLLVYGRQGKEIYRRNLDQDVCR ACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHT+YHEPKAEVMSSVENLLAAAEIQK++FLDTA
Subjt:  VSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQKLIFLDTA

Query:  EGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLIGPGNDEDG
        EGVS  LRPYWSEA+GEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLG+ALSNGSEKTKAVANLIGP NDEDG
Subjt:  EGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLIGPGNDEDG

Query:  VAE---------------------SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIF
        VAE                     SPI + TLAVLHDYNHRK IHLLLRCAAQLSMRQL+EIQAQ+ ASPIPSIDPNLIAVKFIGVSSSHGN+RHSVLIF
Subjt:  VAE---------------------SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIF

Query:  NHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIA
        NH+LSSPNIFAYNALLKAF+QHNAWH+TISYFNNQLVLP+APNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIA
Subjt:  NHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIA

Query:  QKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQM
        QKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYAR GNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQM
Subjt:  QKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQM

Query:  QHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFA
        QHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRR+K++VGLFLGNALADMYAKCGCILEAKGVF+EMQERDVISWSIIIMGLAMYGYA+EAFN FA
Subjt:  QHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFA

Query:  EMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGER
        EMIEDGL+PNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPK+EHYGC++DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDA RGER
Subjt:  EMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGER

Query:  VVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIH
        VV  ILELDSNHSGSLVYLANVYASMGRLDDAA CRLRMRDNKSMKTPGCSWIEI+NSVYEFFMGD +HPQS +IYSMIREL WKMKVAGYKPKTDLVIH
Subjt:  VVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIH

Query:  NIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
        NIDEEEKEDALSTHSEKLA+AFGLI+TSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFK+GKCSCNDYW
Subjt:  NIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

KAG6591726.1 Endoribonuclease YBEY, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0079.23Show/hide
Query:  PELLDLISRDREWDLTRNGVGSSVSENNKLELRLGLPG-----EGSSCLNSNHENGEFPLSSAKPFPSCLNIRNNKRALYEKAEGMAGDEFQPHKLIFYE
        PELLDLIS DREW+L R GVGSSVSENNKLELRLGLPG     EGSSCLN +HE+ E  LSSAK FPSCLNIR+NKRAL         DEF+PHKL+F E
Subjt:  PELLDLISRDREWDLTRNGVGSSVSENNKLELRLGLPG-----EGSSCLNSNHENGEFPLSSAKPFPSCLNIRNNKRALYEKAEGMAGDEFQPHKLIFYE

Query:  KTAEKVSPHTPSLSASLPSSAFQREAQKLSQPSQSSYLKHLLMPQKLDLVSEEPSKPCSLKTADL-----KSCLNSGSVPAESSEPKHHDKRASVAAAVG
        KTAEKV+P    L ASLPSSAFQREAQKLSQPSQS YL+HLLM Q+LDLVSEEPSK CSLKTA+      KSCLN+GSVPAESSEPKHHDKRASVA+AVG
Subjt:  KTAEKVSPHTPSLSASLPSSAFQREAQKLSQPSQSSYLKHLLMPQKLDLVSEEPSKPCSLKTADL-----KSCLNSGSVPAESSEPKHHDKRASVAAAVG

Query:  WPPIRSFRKNFAAPRSSKPNSLELSKETVQDENGSKLSDCSKGQMFVKVCMDGVPIGRKLNLQAYNSYDQLSAGIDELFHCLLAAQRNYLAAEDGRKTEE
        WPPIRSFRKNFA PRSSKPNSL LSKE +QDENGSKLS CS GQMFVKVCMDG+PIGRKLNLQAYN YDQLSAGIDELFHCLLAAQRNYLAAEDGRK EE
Subjt:  WPPIRSFRKNFAAPRSSKPNSLELSKETVQDENGSKLSDCSKGQMFVKVCMDGVPIGRKLNLQAYNSYDQLSAGIDELFHCLLAAQRNYLAAEDGRKTEE

Query:  TTSVSESKHKNGLYTLVYYDNEGDRMLVGDVPWNMFVSTVKRLRVLKSSVVATEMKYESSSNAKRAQTGNKGNSANFPAPETKLQKKKGLWVGRLGRNQI
        TTS+S+SKH+NGLYTLVYYDNEGDRMLVGDVPWNMFVSTVKRLRVL+++VVATEMK   +          + +S+N                        
Subjt:  TTSVSESKHKNGLYTLVYYDNEGDRMLVGDVPWNMFVSTVKRLRVLKSSVVATEMKYESSSNAKRAQTGNKGNSANFPAPETKLQKKKGLWVGRLGRNQI

Query:  LPTILTNKASNKRSRRHNSSTLYLQFYMLLRASPLLRASHLPVPSGIHMALAAHLPFYPNCKSSTSWRSSPHLVSVCQENGYDRISAKITFSQFVRSFYV
           IL NKA N +S RH+S+TLYL  YMLLRASPLLRASH P  SG+HMALAAHLP  PN +S  SWRSS HL  VCQ NGYDRIS K+TFSQFVRSFYV
Subjt:  LPTILTNKASNKRSRRHNSSTLYLQFYMLLRASPLLRASHLPVPSGIHMALAAHLPFYPNCKSSTSWRSSPHLVSVCQENGYDRISAKITFSQFVRSFYV

Query:  LSGTYNKVQRPCGRISCGQREYRNVRRRAPKSKIKELELNVSICIEEELPDDPEILSIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVEL
        LSGTYNKVQRPCGRI CGQREYR VRRR PK+K KELELNVSICIEE+LPDDPE+LSIAEMLRLNVPKALKLAFNNLKDS+YKTRDSSI DVGLF+SVE+
Subjt:  LSGTYNKVQRPCGRISCGQREYRNVRRRAPKSKIKELELNVSICIEEELPDDPEILSIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVEL

Query:  SVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQAEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGL
        SVLLCNDEFI KLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQAEERGHTLLDE+RILL                        VHGL
Subjt:  SVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQAEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGL

Query:  LHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVS-----------------------DGTLLNSKSQITPTTAKALKEVL
        LHL+GFDHE+SEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNS A VS                       DGTLLNSKSQITPTT KALKE L
Subjt:  LHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVS-----------------------DGTLLNSKSQITPTTAKALKEVL

Query:  SRGVKVVIATGKARPAVISILEKAGLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALH
        SRGVKVVIATGKARPAVISILEKAGLAGKDGAVSE+SPGVFLQGLLVYGRQGKEIYRRNLDQD+CREACLYSLEK VPLIAFGEGRCLTLFDHPL DALH
Subjt:  SRGVKVVIATGKARPAVISILEKAGLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALH

Query:  TIYHEPKAEVMSSVENLLAAAEIQKLIFLDTAEGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEM
        TIYHEPKAEVMSSV+NLLAAAEIQKL+FLDTAEGVS  LRPYWSEASGEHASVVQAVPDMLEIVP GTSKGSGVK+LLDHLGVS+EEVMAIGDGENDIEM
Subjt:  TIYHEPKAEVMSSVENLLAAAEIQKLIFLDTAEGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEM

Query:  LELASLGVALSNGSEKTKAVANLIGPGNDEDGVAE
        LELASLG+A+SNG+EKTKAVANLIGP NDEDGVAE
Subjt:  LELASLGVALSNGSEKTKAVANLIGPGNDEDGVAE

KAG7031735.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0093.92Show/hide
Query:  PIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAW
        PI L TLAVLH+YNHR+IIHLLLRCAAQLSMRQL+EIQAQ+ ASPIPSIDPN++AVKFIGVSSSHGN+RHSVL+FNHYL+SPNIFAYNALLKAFAQHNAW
Subjt:  PIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAW

Query:  HSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFS
        H TI+YFNNQ+V+PDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCN IAQKLFDEM VRDVVSWNTLISGYCFS
Subjt:  HSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFS

Query:  GMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGA
        GMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEAR LFENMP+RNVVSWNAMIAGYAQNEKYADAIELFRQMQHEG  APNDVTLVSVLSACAHLGA
Subjt:  GMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGA

Query:  LDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIE-DGLKPNDISFMGLLTACTHA
        LDLGKWIHKFIRRSKM+VGLFLGNALADMYAKCGCILEAK VF+EMQERDV SWSIIIMGLAMYGYADEAF CFA+MIE DGLKPNDISFMGLLTACTHA
Subjt:  LDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIE-DGLKPNDISFMGLLTACTHA

Query:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA
        GLVDKGLEYFDMMA+VYGITPKVEHYGC+VDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGE  VRHILELDSNHSGSLVYLANVYA
Subjt:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA

Query:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL
        SMGRLDDAA CRL+MRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMK+AGYKPKTDLVIHNIDEEEKEDALSTHSEKLA+AFGL
Subjt:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL

Query:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
        INTSEGTTIRIVKNLRVC+DCH A+KIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
Subjt:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

TYK31227.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0089.12Show/hide
Query:  SIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ
        SIAEMLRLNVPKALK A NNLKDSEYKTR SSI DVGLFDSVELSVLLCND+FIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ
Subjt:  SIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ

Query:  AEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVS--
        AEERGHTLLDE+RILL                        VHGLLHL+GFDHE+SEEAE EMEKEEELLLKSLEW+GKGLIQSAIDAETNSNSAAVVS  
Subjt:  AEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVS--

Query:  ---------------------DGTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIY
                             DGTLLNSKSQITPTT +ALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGAVSE+SPGVFLQGLLVYGRQGKEIY
Subjt:  ---------------------DGTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIY

Query:  RRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQKLIFLDTAEGVSHTLRPYWSEASGEHASVVQA
        RRNLDQDVCR ACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHT+YHEPKAEVMSSVENLLAAAEIQK++FLDTAEGVS  LRPYWSEA+GEHASVVQA
Subjt:  RRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQKLIFLDTAEGVSHTLRPYWSEASGEHASVVQA

Query:  VPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLIGPGNDEDGVAE---------------------
        VPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLG+ALSNGSEKTKAVANLIGP NDEDGVAE                     
Subjt:  VPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLIGPGNDEDGVAE---------------------

Query:  SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNA
        SPI + TLAVLHDYNHRK IHLLLRCAAQLSMRQL+EIQAQ+ ASPIPSIDPNLIAVKFIGVSSSHGN+RHSVLIFNH+LSSPNIFAYNALLKAF+QHNA
Subjt:  SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNA

Query:  WHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF
        WH+TISYFNNQLVLP+APNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF
Subjt:  WHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF

Query:  SGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG
        SGMVDKARMVFDGMMEKNLVSWSTMISGYAR GNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG
Subjt:  SGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG

Query:  ALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHA
        ALDLGKWIHKFIRR+K++VGLFLGNALADMYAKCGCILEAKGVF+EMQERDVISWSIIIMGLAMYGYA+EAFN FAEMIEDGL+PNDISFMGLLTACTHA
Subjt:  ALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHA

Query:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA
        GLVDKGLEYFDMMAQVYGITPK+EHYGC++DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDA RGERVV  ILELDSNHSGSLVYLANVYA
Subjt:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA

Query:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL
        SMGRLDDAA CRLRMRDNKSMKTPGCSWIEI+NSVYEFFMGD +HPQS +IYSMIREL WKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLA+AFGL
Subjt:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL

Query:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
        I+TSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFK+GKCSCNDYW
Subjt:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

XP_011652887.1 pentatricopeptide repeat-containing protein At3g62890 [Cucumis sativus]0.0e+0093.92Show/hide
Query:  SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNA
        SP+ + TLAVLHDYNHRK IHLLLRCA QLSMRQL+EIQAQ+ ASPIPSIDPN+IAVKFIGVSSSHGN+RHSVLIFNH+LS PNIFAYNALLKAF+QHNA
Subjt:  SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNA

Query:  WHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF
        WH+TISYFNNQLVLP+APNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF
Subjt:  WHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF

Query:  SGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG
        SGMVDKARMVFDGMMEKNLVSWSTMISGYAR+GNLEEARQLFENMP+RNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG
Subjt:  SGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG

Query:  ALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHA
        ALDLGKWIH+FIRR+K++VGLFLGNALADMYAKCGC+LEAKGVF+EM ERDVISWSIIIMGLAMYGYA+EAFN FAEMIEDGL+PNDISFMGLLTACTHA
Subjt:  ALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHA

Query:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA
        GLVDKGLEYFDMM QVYGITPK+EHYGC+VDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVV  ILELDSNHSGSLVYLANVYA
Subjt:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA

Query:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL
        SMGRLDDAA+CRLRMRDNKSMKTPGCSWIEI+NSVYEFFMGD SHPQS +IYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLA+AFGL
Subjt:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL

Query:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
        INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFK+GKCSCNDYW
Subjt:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

TrEMBL top hitse value%identityAlignment
A0A0A0LE51 DYW_deaminase domain-containing protein0.0e+0093.92Show/hide
Query:  SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNA
        SP+ + TLAVLHDYNHRK IHLLLRCA QLSMRQL+EIQAQ+ ASPIPSIDPN+IAVKFIGVSSSHGN+RHSVLIFNH+LS PNIFAYNALLKAF+QHNA
Subjt:  SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNA

Query:  WHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF
        WH+TISYFNNQLVLP+APNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF
Subjt:  WHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF

Query:  SGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG
        SGMVDKARMVFDGMMEKNLVSWSTMISGYAR+GNLEEARQLFENMP+RNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG
Subjt:  SGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG

Query:  ALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHA
        ALDLGKWIH+FIRR+K++VGLFLGNALADMYAKCGC+LEAKGVF+EM ERDVISWSIIIMGLAMYGYA+EAFN FAEMIEDGL+PNDISFMGLLTACTHA
Subjt:  ALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHA

Query:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA
        GLVDKGLEYFDMM QVYGITPK+EHYGC+VDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVV  ILELDSNHSGSLVYLANVYA
Subjt:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA

Query:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL
        SMGRLDDAA+CRLRMRDNKSMKTPGCSWIEI+NSVYEFFMGD SHPQS +IYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLA+AFGL
Subjt:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL

Query:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
        INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFK+GKCSCNDYW
Subjt:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

A0A1S4E5V0 pentatricopeptide repeat-containing protein At3g62890-like0.0e+0094.9Show/hide
Query:  MRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKA
        MRQL+EIQAQ+ ASPIPSIDPNLIAVKFIGVSSSHGN+RHSVLIFNH+LSSPNIFAYNALLKAF+QHNAWH+TISYFNNQLVLP+APNPDEYTFTSVLKA
Subjt:  MRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKA

Query:  CAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYAR
        CAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYAR
Subjt:  CAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYAR

Query:  MGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMY
         GNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRR+K++VGLFLGNALADMY
Subjt:  MGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMY

Query:  AKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVD
        AKCGCILEAKGVF+EMQERDVISWSIIIMGLAMYGYA+EAFN FAEMIEDGL+PNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPK+EHYGC++D
Subjt:  AKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVD

Query:  LLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEI
        LLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDA RGERVV  ILELDSNHSGSLVYLANVYASMGRLDDAA CRLRMRDNKSMKTPGCSWIEI
Subjt:  LLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEI

Query:  DNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKI
        +NSVYEFFMGD +HPQS +IYSMIREL WKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLA+AFGLI+TSEGTTIRIVKNLRVCNDCHDAIKIISKI
Subjt:  DNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKI

Query:  VEREIVVRDRSRFHHFKNGKCSCNDYW
        VEREIVVRDRSRFHHFK+GKCSCNDYW
Subjt:  VEREIVVRDRSRFHHFKNGKCSCNDYW

A0A5A7T6Z7 Pentatricopeptide repeat-containing protein0.0e+0087.31Show/hide
Query:  SIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ
        SIAEMLRLNVPKALK A NNLKDSEYKTR SSI DVGLFDSVELSVLLCND+FIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ
Subjt:  SIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ

Query:  AEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVSD-
        AEERGHTLLDE+RILL                        VHGLLHL+GFDHE+SEEAE EMEKEEELLLKSLEW+GKGLIQSAIDAETNSNSAAVVSD 
Subjt:  AEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVSD-

Query:  ----------------------------------------------GTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGA
                                                      GTLLNSKSQITPTT +ALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGA
Subjt:  ----------------------------------------------GTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGA

Query:  VSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQKLIFLDTA
        VSE+SPGVFLQGLLVYGRQGKEIYRRNLDQDVCR ACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHT+YHEPKAEVMSSVENLLAAAEIQK++FLDTA
Subjt:  VSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQKLIFLDTA

Query:  EGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLIGPGNDEDG
        EGVS  LRPYWSEA+GEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLG+ALSNGSEKTKAVANLIGP NDEDG
Subjt:  EGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLIGPGNDEDG

Query:  VAE---------------------SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIF
        VAE                     SPI + TLAVLHDYNHRK IHLLLRCAAQLSMRQL+EIQAQ+ ASPIPSIDPNLIAVKFIGVSSSHGN+RHSVLIF
Subjt:  VAE---------------------SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIF

Query:  NHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIA
        NH+LSSPNIFAYNALLKAF+QHNAWH+TISYFNNQLVLP+APNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIA
Subjt:  NHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIA

Query:  QKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQM
        QKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYAR GNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQM
Subjt:  QKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQM

Query:  QHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFA
        QHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRR+K++VGLFLGNALADMYAKCGCILEAKGVF+EMQERDVISWSIIIMGLAMYGYA+EAFN FA
Subjt:  QHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFA

Query:  EMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGER
        EMIEDGL+PNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPK+EHYGC++DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDA RGER
Subjt:  EMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGER

Query:  VVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIH
        VV  ILELDSNHSGSLVYLANVYASMGRLDDAA CRLRMRDNKSMKTPGCSWIEI+NSVYEFFMGD +HPQS +IYSMIREL WKMKVAGYKPKTDLVIH
Subjt:  VVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIH

Query:  NIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
        NIDEEEKEDALSTHSEKLA+AFGLI+TSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFK+GKCSCNDYW
Subjt:  NIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

A0A5D3E550 Pentatricopeptide repeat-containing protein0.0e+0089.12Show/hide
Query:  SIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ
        SIAEMLRLNVPKALK A NNLKDSEYKTR SSI DVGLFDSVELSVLLCND+FIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ
Subjt:  SIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPELKLPILMLGDIVISVETAARQ

Query:  AEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVS--
        AEERGHTLLDE+RILL                        VHGLLHL+GFDHE+SEEAE EMEKEEELLLKSLEW+GKGLIQSAIDAETNSNSAAVVS  
Subjt:  AEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAIDAETNSNSAAVVS--

Query:  ---------------------DGTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIY
                             DGTLLNSKSQITPTT +ALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGAVSE+SPGVFLQGLLVYGRQGKEIY
Subjt:  ---------------------DGTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIY

Query:  RRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQKLIFLDTAEGVSHTLRPYWSEASGEHASVVQA
        RRNLDQDVCR ACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHT+YHEPKAEVMSSVENLLAAAEIQK++FLDTAEGVS  LRPYWSEA+GEHASVVQA
Subjt:  RRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQKLIFLDTAEGVSHTLRPYWSEASGEHASVVQA

Query:  VPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLIGPGNDEDGVAE---------------------
        VPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLG+ALSNGSEKTKAVANLIGP NDEDGVAE                     
Subjt:  VPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLIGPGNDEDGVAE---------------------

Query:  SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNA
        SPI + TLAVLHDYNHRK IHLLLRCAAQLSMRQL+EIQAQ+ ASPIPSIDPNLIAVKFIGVSSSHGN+RHSVLIFNH+LSSPNIFAYNALLKAF+QHNA
Subjt:  SPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNA

Query:  WHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF
        WH+TISYFNNQLVLP+APNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF
Subjt:  WHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCF

Query:  SGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG
        SGMVDKARMVFDGMMEKNLVSWSTMISGYAR GNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG
Subjt:  SGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLG

Query:  ALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHA
        ALDLGKWIHKFIRR+K++VGLFLGNALADMYAKCGCILEAKGVF+EMQERDVISWSIIIMGLAMYGYA+EAFN FAEMIEDGL+PNDISFMGLLTACTHA
Subjt:  ALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHA

Query:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA
        GLVDKGLEYFDMMAQVYGITPK+EHYGC++DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDA RGERVV  ILELDSNHSGSLVYLANVYA
Subjt:  GLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYA

Query:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL
        SMGRLDDAA CRLRMRDNKSMKTPGCSWIEI+NSVYEFFMGD +HPQS +IYSMIREL WKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLA+AFGL
Subjt:  SMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGL

Query:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
        I+TSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFK+GKCSCNDYW
Subjt:  INTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

A0A6J1FIW1 pentatricopeptide repeat-containing protein At3g62890-like0.0e+0093.79Show/hide
Query:  MRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKA
        MRQL+EIQAQ+ ASPIPSIDPN++AVKFIGVSSSHGN+RHSVL+FNHYL+SPNIFAYNALLKAFAQHNAWH TI+YFNNQ+V+PDAPNPDEYTFTSVLKA
Subjt:  MRQLYEIQAQVFASPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKA

Query:  CAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYAR
        CAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCN IAQKLFDEM VRDVVSWNTLISGYCFSGMVDKARMVFDGMM KNLVSWSTMISGYAR
Subjt:  CAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYAR

Query:  MGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMY
        MGNLEEAR LFENMP+RNVVSWNAMIAGYAQNEKYADAIELFRQMQHEG  APNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKM+VGLFLGNALADMY
Subjt:  MGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMY

Query:  AKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIE-DGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIV
        AKCGCILEAK VF+EMQERDV SWSIIIMGLAMYGYADEAF CFA+MIE DGLKPNDISFMGLLTACTHAGLVDKGL+YFDMMA+VYGITPKVEHYGC+V
Subjt:  AKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIE-DGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIV

Query:  DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIE
        DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGE  VRHILELDS+HSGSLVYLAN+YASMGRLDDAA CRL+MRDNKSMKTPGCSWIE
Subjt:  DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIE

Query:  IDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISK
        IDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMK+AGYKPKTDLVIHNIDEEEKEDALSTHSEKLA+AFGLINTSEGTTIRIVKNLRVC+DCHDA+KIISK
Subjt:  IDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISK

Query:  IVEREIVVRDRSRFHHFKNGKCSCNDYW
        IVEREIVVRDRSRFHHFKNGKCSCNDYW
Subjt:  IVEREIVVRDRSRFHHFKNGKCSCNDYW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic8.1e-14037.55Show/hide
Query:  IHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKF-IGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAP
        I L+ RC   +S+RQL +    +  +   S DP   +  F +   SS  ++ ++  +F+  +  PN FA+N L++A+A       +I  F + +V     
Subjt:  IHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKF-IGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAP

Query:  NPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDK------------
         P++YTF  ++KA A ++ +  GQ +H    K    S++FV NSL+  YF  G    A K+F  +  +DVVSWN++I+G+   G  DK            
Subjt:  NPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDK------------

Query:  ----------------------------------------------------------ARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIR
                                                                  A+ +FD M EK+ V+W+TM+ GYA   + E AR++  +MP +
Subjt:  ----------------------------------------------------------ARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIR

Query:  NVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQ
        ++V+WNA+I+ Y QN K  +A+ +F ++Q +  +  N +TLVS LSACA +GAL+LG+WIH +I++  + +   + +AL  MY+KCG + +++ VFN ++
Subjt:  NVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQ

Query:  ERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINS
        +RDV  WS +I GLAM+G  +EA + F +M E  +KPN ++F  +  AC+H GLVD+    F  M   YGI P+ +HY CIVD+L R+G L++A   I +
Subjt:  ERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINS

Query:  MPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQS
        MP+ P+  VWGALLG C+I+ +    E     +LEL+  + G+ V L+N+YA +G+ ++ +  R  MR     K PGCS IEID  ++EF  GD +HP S
Subjt:  MPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQS

Query:  AKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEE-KEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHF
         K+Y  + E+  K+K  GY+P+   V+  I+EEE KE +L+ HSEKLAI +GLI+T     IR++KNLRVC DCH   K+IS++ +REI+VRDR RFHHF
Subjt:  AKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEE-KEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHF

Query:  KNGKCSCNDYW
        +NG+CSCND+W
Subjt:  KNGKCSCNDYW

Q8L5Z4 Endoribonuclease YBEY, chloroplastic7.2e-19761.6Show/hide
Query:  MLLRASPLLRASHLPVPSGIHMALAAHLPFYPNCKSSTSWRSSPHLVSVCQENGYDRISAKITFSQFVRSFYVLS---GTYNKVQRPCGRISCGQREYRN
        ML R  P LR + +       M  A  L   PN      + SSP    V   N  D  S++ + S F RSF+ L    G + K+ +P G++   +REYR 
Subjt:  MLLRASPLLRASHLPVPSGIHMALAAHLPFYPNCKSSTSWRSSPHLVSVCQENGYDRISAKITFSQFVRSFYVLS---GTYNKVQRPCGRISCGQREYRN

Query:  VRRRAPKSKIKELELNVSICIEEELPDDPEILSIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHAT
        +RRRAPK K +ELEL+VSICIEE+LPDD EI +IAEMLRLNVP A+ LAFN LKDS+YKTR++ I D+G +++VELSV+LCND+FI KLNKEWR EDHAT
Subjt:  VRRRAPKSKIKELELNVSICIEEELPDDPEILSIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHAT

Query:  DVLSMSQHVPELKLPILMLGDIVISVETAARQAEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEEL
        DVLSMSQHVPELKLP+LM+GD+VISVETAARQA ERGHTLLDE+RIL+                        +HGLLHL+GFDHE+S+EAE EME+EEEL
Subjt:  DVLSMSQHVPELKLPILMLGDIVISVETAARQAEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEEL

Query:  LLKSLEWKGKGLIQSAIDAETNSNSAAVVS-----------------------DGTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKA
        LLK+L WKGKGLIQSA D +  +      S                       DGTLLNSKSQI+   AKALKE L RG+KVVIATGK+RP  I IL+ A
Subjt:  LLKSLEWKGKGLIQSAIDAETNSNSAAVVS-----------------------DGTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKA

Query:  GLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQ
         L G DG +SE SPGVF+QGLLVYGRQGKE+YR NLD+DVCRE CLYSLE ++PLIAF + RCLTLFDHPL D+LHTIY+EPKAE++SSV+ L+A A+IQ
Subjt:  GLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQ

Query:  KLIFLDTAEGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLI
        K+IF+DT EGVS  +RPYWSEA+G+ A+VVQA  DMLEIVP GTSKG+GVKMLL+HLGVS +E+MAIGDGENDIEML+LASLGVALSNG+EKTKAVA++I
Subjt:  KLIFLDTAEGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLI

Query:  GPGNDEDGVAES
        G  ND+DGVA++
Subjt:  GPGNDEDGVAES

Q9FI80 Pentatricopeptide repeat-containing protein At5g489101.2e-14643.19Show/hide
Query:  HGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISY-FNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLV
        H ++ ++  IFN  +   N F++N +++ F++ +   + I+     +++  +   P+ +TF SVLKACA   ++ EG+++H    KYG   + FV ++LV
Subjt:  HGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISY-FNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLV

Query:  DLYFKVGCNCIAQKLFDEMVV-RDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQN
         +Y   G    A+ LF + ++ +D+V             M D+ +   DG     +V W+ MI GY R+G+ + AR LF+ M  R+VVSWN MI+GY+ N
Subjt:  DLYFKVGCNCIAQKLFDEMVV-RDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQN

Query:  EKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLA
          + DA+E+FR+M+ +G + PN VTLVSVL A + LG+L+LG+W+H +   S + +   LG+AL DMY+KCG I +A  VF  +   +VI+WS +I G A
Subjt:  EKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLA

Query:  MYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLG
        ++G A +A +CF +M + G++P+D++++ LLTAC+H GLV++G  YF  M  V G+ P++EHYGC+VDLL R+G LD+AE  I +MP++P+ ++W ALLG
Subjt:  MYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLG

Query:  GCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMK
         CR+  + E G+RV   ++++  + SG+ V L+N+YAS G   + +  RLRM++    K PGCS I+ID  ++EF + D SHP++ +I SM+ E+  K++
Subjt:  GCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMK

Query:  VAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
        +AGY+P T  V+ N++EE+KE+ L  HSEK+A AFGLI+TS G  IRIVKNLR+C DCH +IK+ISK+ +R+I VRDR RFHHF++G CSC DYW
Subjt:  VAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic7.5e-14639.47Show/hide
Query:  VKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESN
        ++F  +S     + +++ +F   +  PN+  +N + +  A  +   S +  +   + L   PN   YTF  VLK+CA      EGQ++H  V K GC+ +
Subjt:  VKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESN

Query:  LFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQ----------------
        L+V  SL+ +Y + G    A K+FD+   RDVVS+  LI GY   G ++ A+ +FD +  K++VSW+ MISGYA  GN +EA +                
Subjt:  LFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQ----------------

Query:  ------------------------------------------------------LFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPND
                                                              LFE +P ++V+SWN +I GY     Y +A+ LF++M   G   PND
Subjt:  ------------------------------------------------------LFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPND

Query:  VTLVSVLSACAHLGALDLGKWIHKFIRRSKMDV--GLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLK
        VT++S+L ACAHLGA+D+G+WIH +I +    V     L  +L DMYAKCG I  A  VFN +  + + SW+ +I G AM+G AD +F+ F+ M + G++
Subjt:  VTLVSVLSACAHLGALDLGKWIHKFIRRSKMDV--GLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLK

Query:  PNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILEL
        P+DI+F+GLL+AC+H+G++D G   F  M Q Y +TPK+EHYGC++DLL  +G   +AE +IN M M+P+ ++W +LL  C+++ + E GE    +++++
Subjt:  PNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILEL

Query:  DSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKE
        +  + GS V L+N+YAS GR ++ A  R  + D    K PGCS IEID+ V+EF +GD  HP++ +IY M+ E++  ++ AG+ P T  V+  ++EE KE
Subjt:  DSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKE

Query:  DALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
         AL  HSEKLAIAFGLI+T  GT + IVKNLRVC +CH+A K+ISKI +REI+ RDR+RFHHF++G CSCNDYW
Subjt:  DALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088204.7e-14041.88Show/hide
Query:  RHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFK
        ++S L+F+H    PNIF YN+L+  F  ++ +H T+  F +  +         +TF  VLKAC   +    G  +H  V K G   ++    SL+ +Y  
Subjt:  RHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFK

Query:  VGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGM---------------------------------------MEKNLVSWSTMISGYA
         G    A KLFDE+  R VV+W  L SGY  SG   +A  +F  M                                       M+KN    +T+++ YA
Subjt:  VGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGM---------------------------------------MEKNLVSWSTMISGYA

Query:  RMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADM
        + G +E+AR +F++M  +++V+W+ MI GYA N    + IELF QM  E  L P+  ++V  LS+CA LGALDLG+W    I R +    LF+ NAL DM
Subjt:  RMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADM

Query:  YAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIV
        YAKCG +     VF EM+E+D++  +  I GLA  G+   +F  F +  + G+ P+  +F+GLL  C HAGL+  GL +F+ ++ VY +   VEHYGC+V
Subjt:  YAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIV

Query:  DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIE
        DL  RAG LD A  LI  MPM+PN IVWGALL GCR+ KD +  E V++ ++ L+  ++G+ V L+N+Y+  GR D+AA  R  M      K PG SWIE
Subjt:  DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIE

Query:  IDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISK
        ++  V+EF   D SHP S KIY+ + +L  +M++ G+ P T+ V  +++EEEKE  L  HSEKLA+A GLI+T  G  IR+VKNLRVC DCH+ +K+ISK
Subjt:  IDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISK

Query:  IVEREIVVRDRSRFHHFKNGKCSCNDYW
        I  REIVVRD +RFH F NG CSCNDYW
Subjt:  IVEREIVVRDRSRFHHFKNGKCSCNDYW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.4e-14739.47Show/hide
Query:  VKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESN
        ++F  +S     + +++ +F   +  PN+  +N + +  A  +   S +  +   + L   PN   YTF  VLK+CA      EGQ++H  V K GC+ +
Subjt:  VKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESN

Query:  LFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQ----------------
        L+V  SL+ +Y + G    A K+FD+   RDVVS+  LI GY   G ++ A+ +FD +  K++VSW+ MISGYA  GN +EA +                
Subjt:  LFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQ----------------

Query:  ------------------------------------------------------LFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPND
                                                              LFE +P ++V+SWN +I GY     Y +A+ LF++M   G   PND
Subjt:  ------------------------------------------------------LFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPND

Query:  VTLVSVLSACAHLGALDLGKWIHKFIRRSKMDV--GLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLK
        VT++S+L ACAHLGA+D+G+WIH +I +    V     L  +L DMYAKCG I  A  VFN +  + + SW+ +I G AM+G AD +F+ F+ M + G++
Subjt:  VTLVSVLSACAHLGALDLGKWIHKFIRRSKMDV--GLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLK

Query:  PNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILEL
        P+DI+F+GLL+AC+H+G++D G   F  M Q Y +TPK+EHYGC++DLL  +G   +AE +IN M M+P+ ++W +LL  C+++ + E GE    +++++
Subjt:  PNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILEL

Query:  DSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKE
        +  + GS V L+N+YAS GR ++ A  R  + D    K PGCS IEID+ V+EF +GD  HP++ +IY M+ E++  ++ AG+ P T  V+  ++EE KE
Subjt:  DSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKE

Query:  DALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
         AL  HSEKLAIAFGLI+T  GT + IVKNLRVC +CH+A K+ISKI +REI+ RDR+RFHHF++G CSCNDYW
Subjt:  DALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW

AT2G25870.1 haloacid dehalogenase-like hydrolase family protein5.1e-19861.6Show/hide
Query:  MLLRASPLLRASHLPVPSGIHMALAAHLPFYPNCKSSTSWRSSPHLVSVCQENGYDRISAKITFSQFVRSFYVLS---GTYNKVQRPCGRISCGQREYRN
        ML R  P LR + +       M  A  L   PN      + SSP    V   N  D  S++ + S F RSF+ L    G + K+ +P G++   +REYR 
Subjt:  MLLRASPLLRASHLPVPSGIHMALAAHLPFYPNCKSSTSWRSSPHLVSVCQENGYDRISAKITFSQFVRSFYVLS---GTYNKVQRPCGRISCGQREYRN

Query:  VRRRAPKSKIKELELNVSICIEEELPDDPEILSIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHAT
        +RRRAPK K +ELEL+VSICIEE+LPDD EI +IAEMLRLNVP A+ LAFN LKDS+YKTR++ I D+G +++VELSV+LCND+FI KLNKEWR EDHAT
Subjt:  VRRRAPKSKIKELELNVSICIEEELPDDPEILSIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHAT

Query:  DVLSMSQHVPELKLPILMLGDIVISVETAARQAEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEEL
        DVLSMSQHVPELKLP+LM+GD+VISVETAARQA ERGHTLLDE+RIL+                        +HGLLHL+GFDHE+S+EAE EME+EEEL
Subjt:  DVLSMSQHVPELKLPILMLGDIVISVETAARQAEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEEL

Query:  LLKSLEWKGKGLIQSAIDAETNSNSAAVVS-----------------------DGTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKA
        LLK+L WKGKGLIQSA D +  +      S                       DGTLLNSKSQI+   AKALKE L RG+KVVIATGK+RP  I IL+ A
Subjt:  LLKSLEWKGKGLIQSAIDAETNSNSAAVVS-----------------------DGTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKA

Query:  GLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQ
         L G DG +SE SPGVF+QGLLVYGRQGKE+YR NLD+DVCRE CLYSLE ++PLIAF + RCLTLFDHPL D+LHTIY+EPKAE++SSV+ L+A A+IQ
Subjt:  GLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREACLYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQ

Query:  KLIFLDTAEGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLI
        K+IF+DT EGVS  +RPYWSEA+G+ A+VVQA  DMLEIVP GTSKG+GVKMLL+HLGVS +E+MAIGDGENDIEML+LASLGVALSNG+EKTKAVA++I
Subjt:  KLIFLDTAEGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGVKMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLI

Query:  GPGNDEDGVAES
        G  ND+DGVA++
Subjt:  GPGNDEDGVAES

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-14137.55Show/hide
Query:  IHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKF-IGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAP
        I L+ RC   +S+RQL +    +  +   S DP   +  F +   SS  ++ ++  +F+  +  PN FA+N L++A+A       +I  F + +V     
Subjt:  IHLLLRCAAQLSMRQLYEIQAQVFASPIPSIDPNLIAVKF-IGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAP

Query:  NPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDK------------
         P++YTF  ++KA A ++ +  GQ +H    K    S++FV NSL+  YF  G    A K+F  +  +DVVSWN++I+G+   G  DK            
Subjt:  NPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDK------------

Query:  ----------------------------------------------------------ARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIR
                                                                  A+ +FD M EK+ V+W+TM+ GYA   + E AR++  +MP +
Subjt:  ----------------------------------------------------------ARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIR

Query:  NVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQ
        ++V+WNA+I+ Y QN K  +A+ +F ++Q +  +  N +TLVS LSACA +GAL+LG+WIH +I++  + +   + +AL  MY+KCG + +++ VFN ++
Subjt:  NVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQ

Query:  ERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINS
        +RDV  WS +I GLAM+G  +EA + F +M E  +KPN ++F  +  AC+H GLVD+    F  M   YGI P+ +HY CIVD+L R+G L++A   I +
Subjt:  ERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINS

Query:  MPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQS
        MP+ P+  VWGALLG C+I+ +    E     +LEL+  + G+ V L+N+YA +G+ ++ +  R  MR     K PGCS IEID  ++EF  GD +HP S
Subjt:  MPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQS

Query:  AKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEE-KEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHF
         K+Y  + E+  K+K  GY+P+   V+  I+EEE KE +L+ HSEKLAI +GLI+T     IR++KNLRVC DCH   K+IS++ +REI+VRDR RFHHF
Subjt:  AKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEE-KEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHF

Query:  KNGKCSCNDYW
        +NG+CSCND+W
Subjt:  KNGKCSCNDYW

AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein3.4e-14141.88Show/hide
Query:  RHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFK
        ++S L+F+H    PNIF YN+L+  F  ++ +H T+  F +  +         +TF  VLKAC   +    G  +H  V K G   ++    SL+ +Y  
Subjt:  RHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLVDLYFK

Query:  VGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGM---------------------------------------MEKNLVSWSTMISGYA
         G    A KLFDE+  R VV+W  L SGY  SG   +A  +F  M                                       M+KN    +T+++ YA
Subjt:  VGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGM---------------------------------------MEKNLVSWSTMISGYA

Query:  RMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADM
        + G +E+AR +F++M  +++V+W+ MI GYA N    + IELF QM  E  L P+  ++V  LS+CA LGALDLG+W    I R +    LF+ NAL DM
Subjt:  RMGNLEEARQLFENMPIRNVVSWNAMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADM

Query:  YAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIV
        YAKCG +     VF EM+E+D++  +  I GLA  G+   +F  F +  + G+ P+  +F+GLL  C HAGL+  GL +F+ ++ VY +   VEHYGC+V
Subjt:  YAKCGCILEAKGVFNEMQERDVISWSIIIMGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIV

Query:  DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIE
        DL  RAG LD A  LI  MPM+PN IVWGALL GCR+ KD +  E V++ ++ L+  ++G+ V L+N+Y+  GR D+AA  R  M      K PG SWIE
Subjt:  DLLSRAGRLDQAESLINSMPMQPNVIVWGALLGGCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIE

Query:  IDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISK
        ++  V+EF   D SHP S KIY+ + +L  +M++ G+ P T+ V  +++EEEKE  L  HSEKLA+A GLI+T  G  IR+VKNLRVC DCH+ +K+ISK
Subjt:  IDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISK

Query:  IVEREIVVRDRSRFHHFKNGKCSCNDYW
        I  REIVVRD +RFH F NG CSCNDYW
Subjt:  IVEREIVVRDRSRFHHFKNGKCSCNDYW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein8.3e-14843.19Show/hide
Query:  HGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISY-FNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLV
        H ++ ++  IFN  +   N F++N +++ F++ +   + I+     +++  +   P+ +TF SVLKACA   ++ EG+++H    KYG   + FV ++LV
Subjt:  HGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISY-FNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVTKYGCESNLFVRNSLV

Query:  DLYFKVGCNCIAQKLFDEMVV-RDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQN
         +Y   G    A+ LF + ++ +D+V             M D+ +   DG     +V W+ MI GY R+G+ + AR LF+ M  R+VVSWN MI+GY+ N
Subjt:  DLYFKVGCNCIAQKLFDEMVV-RDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWNAMIAGYAQN

Query:  EKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLA
          + DA+E+FR+M+ +G + PN VTLVSVL A + LG+L+LG+W+H +   S + +   LG+AL DMY+KCG I +A  VF  +   +VI+WS +I G A
Subjt:  EKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIIIMGLA

Query:  MYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLG
        ++G A +A +CF +M + G++P+D++++ LLTAC+H GLV++G  YF  M  V G+ P++EHYGC+VDLL R+G LD+AE  I +MP++P+ ++W ALLG
Subjt:  MYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLG

Query:  GCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMK
         CR+  + E G+RV   ++++  + SG+ V L+N+YAS G   + +  RLRM++    K PGCS I+ID  ++EF + D SHP++ +I SM+ E+  K++
Subjt:  GCRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMK

Query:  VAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW
        +AGY+P T  V+ N++EE+KE+ L  HSEK+A AFGLI+TS G  IRIVKNLR+C DCH +IK+ISK+ +R+I VRDR RFHHF++G CSC DYW
Subjt:  VAGYKPKTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGATGTCCTGAGCTTCTTGACCTGATCTCTAGGGATAGGGAATGGGACCTGACTAGAAATGGTGTTGGAAGTTCTGTTAGTGAGAATAATAAACTTGAA
CTGAGACTTGGATTGCCAGGAGAAGGAAGTAGCTGCTTGAATAGTAACCATGAAAATGGTGAATTCCCTCTCTCTTCTGCTAAGCCATTTCCTTCTTGTTTGAAC
ATTAGAAACAACAAAAGGGCGCTTTATGAGAAAGCTGAAGGAATGGCAGGAGATGAATTCCAACCCCACAAGCTGATTTTCTATGAGAAAACAGCTGAGAAGGTG
TCTCCTCACACTCCCTCTCTTAGTGCTTCCTTACCATCTTCAGCTTTTCAGAGGGAAGCTCAGAAGCTATCTCAACCGTCACAATCTTCATATTTGAAACATTTA
TTGATGCCTCAGAAATTGGACTTGGTGTCAGAGGAGCCATCAAAGCCATGCAGTTTGAAAACAGCTGATTTGAAATCATGTCTCAACAGTGGTTCAGTTCCTGCA
GAATCATCTGAACCTAAACACCATGACAAAAGGGCTTCAGTTGCAGCAGCCGTTGGTTGGCCCCCCATCCGTTCTTTCAGGAAGAACTTTGCTGCTCCCCGCTCT
TCAAAGCCTAACTCGCTGGAGTTGTCAAAAGAAACTGTGCAGGATGAGAATGGCTCAAAATTATCTGACTGTTCCAAGGGACAAATGTTTGTGAAAGTTTGTATG
GATGGGGTTCCCATTGGAAGAAAACTGAATCTCCAAGCCTATAATAGTTATGATCAACTGTCTGCTGGCATAGACGAACTCTTCCACTGTCTTCTTGCAGCACAG
AGAAACTATCTAGCAGCTGAAGATGGGAGAAAGACGGAGGAAACAACTTCAGTATCTGAATCCAAACACAAAAATGGGCTGTACACTCTAGTTTACTATGATAAT
GAAGGGGACAGGATGCTTGTTGGTGATGTCCCGTGGAACATGTTTGTTTCCACTGTGAAGAGGCTACGTGTATTGAAGAGCTCCGTTGTTGCCACTGAAATGAAG
TATGAATCTTCATCCAATGCCAAAAGAGCACAAACAGGAAACAAGGGTAATAGTGCCAATTTTCCAGCACCAGAGACAAAACTCCAAAAAAAAAAGGGCTTGTGG
GTTGGGAGGCTTGGGCGCAATCAAATCCTTCCAACCATCCTCACAAACAAAGCAAGCAATAAAAGAAGCCGGCGGCACAACAGTTCCACTCTGTATCTTCAATTC
TACATGCTTCTCAGAGCCTCCCCTCTTCTTCGCGCCTCCCACCTTCCAGTCCCCTCTGGGATTCACATGGCGCTAGCTGCCCATCTCCCTTTCTATCCCAACTGT
AAAAGTTCCACTTCATGGCGATCTTCTCCGCACTTGGTTTCTGTTTGTCAAGAGAATGGTTACGACCGCATTTCAGCCAAAATAACTTTTTCACAATTTGTTCGA
AGTTTCTACGTTTTATCGGGAACATACAATAAGGTACAGAGACCCTGTGGAAGGATTTCTTGTGGGCAAAGAGAGTATCGAAACGTGAGAAGACGAGCCCCAAAG
AGTAAGATTAAGGAATTAGAGCTCAACGTCAGTATATGTATTGAAGAAGAATTGCCGGACGACCCAGAAATTTTGAGTATTGCAGAAATGCTTCGTTTGAATGTT
CCTAAGGCTCTGAAGTTAGCTTTCAACAATTTAAAGGATTCAGAATATAAAACTCGAGATAGTTCAATAGGTGATGTTGGTTTATTTGATAGTGTCGAGTTGTCC
GTGCTACTTTGTAATGACGAGTTTATTCGTAAACTTAACAAAGAATGGAGGGATGAGGATCATGCCACTGATGTTCTGTCCATGTCACAGCATGTCCCTGAACTG
AAGCTTCCAATTCTTATGTTAGGTGATATTGTCATTTCTGTTGAGACTGCCGCAAGGCAGGCGGAGGAAAGGGGGCACACACTCCTTGATGAAGTTCGGATACTC
CTGGAATGTCAAAATTATGTGAAAGAAAAGAAAAGCAGTCGAATTTTTTACTATAAAAAATTCAACGAAGCAAAGGTGCATGGATTGTTGCATCTTGTGGGATTT
GATCATGAGGTTAGCGAAGAGGCTGAAGTAGAGATGGAGAAGGAGGAGGAACTACTTTTGAAGAGTTTAGAATGGAAAGGGAAAGGGCTAATTCAGAGTGCAATT
GATGCTGAAACAAATTCAAACTCAGCTGCAGTAGTTTCAGATGGTACATTGCTCAACAGTAAAAGTCAAATTACTCCAACAACTGCCAAGGCTCTGAAAGAGGTC
TTGTCAAGGGGTGTGAAAGTTGTTATAGCCACTGGAAAGGCCCGCCCAGCAGTAATAAGCATTTTGGAGAAGGCAGGTTTAGCTGGAAAAGATGGTGCAGTGTCT
GAATATTCTCCTGGAGTTTTCTTACAGGGACTACTCGTTTATGGTAGACAAGGAAAAGAAATTTATAGGCGGAACTTAGACCAAGATGTCTGCAGAGAGGCATGC
CTTTATTCTTTAGAGAAGAAAGTTCCTCTGATTGCATTTGGTGAAGGCCGTTGCTTAACTCTATTTGATCATCCACTTGCTGATGCATTGCACACTATATATCAT
GAGCCCAAGGCAGAGGTCATGTCCTCTGTTGAGAATCTTTTGGCTGCAGCTGAAATACAGAAACTTATTTTCTTGGACACTGCCGAAGGAGTTTCACACACCTTA
CGACCATACTGGTCAGAAGCATCAGGGGAACACGCCTCGGTTGTTCAAGCTGTACCAGATATGCTCGAGATTGTCCCCCGTGGAACCTCAAAAGGTAGTGGAGTA
AAAATGCTGCTCGATCATTTGGGAGTTTCTGCTGAGGAGGTAATGGCTATTGGAGATGGGGAAAATGACATTGAGATGCTTGAATTAGCATCTCTTGGGGTTGCT
TTGAGTAATGGATCAGAGAAAACAAAAGCTGTTGCTAACTTAATTGGTCCCGGCAATGATGAAGATGGTGTTGCAGAGTCGCCAATTTTTCTAAGCACCCTGGCT
GTTCTTCACGACTACAACCACCGCAAAATCATCCACCTTCTTCTACGATGCGCCGCCCAGCTTTCAATGCGTCAGCTTTACGAAATCCAAGCCCAAGTCTTTGCC
TCCCCAATCCCCTCCATCGATCCCAATCTCATTGCCGTCAAGTTCATCGGCGTTTCCTCTTCCCATGGCAACGTCCGCCATTCTGTTCTCATCTTCAATCACTAT
CTATCTTCTCCAAACATCTTTGCCTACAATGCTCTTCTCAAAGCCTTTGCTCAACACAATGCTTGGCACAGTACCATTTCTTACTTCAATAACCAGTTGGTTTTG
CCTGATGCTCCAAACCCAGATGAATATACCTTCACTTCTGTACTCAAGGCCTGCGCTGGCCTTGCCCAGGTGCTCGAAGGCCAAAAAGTCCACTGTTTCGTCACC
AAATATGGCTGCGAATCAAACCTGTTTGTTAGGAATTCACTGGTCGATTTGTATTTTAAGGTCGGTTGCAATTGTATTGCCCAGAAGCTGTTTGATGAAATGGTT
GTGAGAGATGTTGTTTCATGGAACACTTTGATTTCAGGATATTGTTTTAGTGGGATGGTAGACAAAGCTCGGATGGTATTTGATGGGATGATGGAGAAAAACTTG
GTGTCCTGGTCGACAATGATCAGCGGATATGCAAGGATGGGAAATTTAGAAGAAGCACGGCAACTGTTTGAGAATATGCCAATAAGGAATGTGGTTTCTTGGAAC
GCAATGATTGCTGGATATGCACAGAATGAGAAGTATGCAGATGCTATTGAGTTGTTCAGGCAAATGCAGCATGAAGGCGGCCTAGCTCCGAACGATGTTACTCTT
GTTAGCGTGCTTTCAGCTTGTGCACATCTTGGTGCACTTGATCTAGGGAAGTGGATTCATAAGTTTATAAGACGAAGCAAGATGGACGTAGGTCTGTTTTTAGGG
AATGCATTGGCAGACATGTATGCAAAGTGTGGATGCATATTGGAGGCCAAGGGGGTGTTTAACGAAATGCAGGAGAGAGATGTAATCTCATGGAGTATAATAATC
ATGGGGTTGGCCATGTATGGATATGCAGATGAAGCATTCAACTGCTTTGCTGAAATGATTGAAGATGGTTTAAAGCCAAATGATATTTCCTTTATGGGTTTATTA
ACAGCCTGCACTCATGCTGGATTGGTTGATAAGGGGCTGGAGTACTTTGACATGATGGCTCAAGTCTATGGTATTACTCCCAAGGTTGAGCATTATGGGTGTATT
GTTGATCTTCTTAGCCGTGCAGGGCGTCTCGATCAAGCTGAGTCATTGATTAACTCCATGCCTATGCAACCTAATGTAATAGTTTGGGGTGCACTGCTCGGTGGT
TGTCGGATATACAAAGATGCAGAACGAGGAGAACGTGTTGTTCGGCACATACTTGAACTAGACTCCAACCATTCTGGAAGTCTTGTTTATCTAGCTAATGTTTAT
GCTTCAATGGGCAGGCTGGATGATGCAGCTACTTGTAGGTTGCGAATGCGAGACAACAAGTCGATGAAGACACCGGGGTGCAGTTGGATTGAGATCGACAACTCG
GTATATGAATTTTTCATGGGGGACTTGTCTCATCCTCAGTCTGCAAAAATATACTCAATGATTAGAGAATTGAAGTGGAAGATGAAAGTGGCAGGATATAAACCA
AAGACGGATCTTGTGATTCATAATATAGATGAAGAAGAGAAGGAGGATGCACTTTCTACTCATAGTGAGAAGCTTGCCATTGCATTTGGGCTTATCAATACCAGT
GAAGGAACTACAATCAGAATAGTTAAGAACTTGAGAGTTTGCAACGACTGTCATGATGCAATAAAAATAATCTCAAAAATTGTTGAGCGAGAGATTGTGGTGCGA
GATAGAAGTCGTTTTCACCATTTCAAAAATGGAAAATGTTCTTGTAATGATTACTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGATGTCCTGAGCTTCTTGACCTGATCTCTAGGGATAGGGAATGGGACCTGACTAGAAATGGTGTTGGAAGTTCTGTTAGTGAGAATAATAAACTTGAA
CTGAGACTTGGATTGCCAGGAGAAGGAAGTAGCTGCTTGAATAGTAACCATGAAAATGGTGAATTCCCTCTCTCTTCTGCTAAGCCATTTCCTTCTTGTTTGAAC
ATTAGAAACAACAAAAGGGCGCTTTATGAGAAAGCTGAAGGAATGGCAGGAGATGAATTCCAACCCCACAAGCTGATTTTCTATGAGAAAACAGCTGAGAAGGTG
TCTCCTCACACTCCCTCTCTTAGTGCTTCCTTACCATCTTCAGCTTTTCAGAGGGAAGCTCAGAAGCTATCTCAACCGTCACAATCTTCATATTTGAAACATTTA
TTGATGCCTCAGAAATTGGACTTGGTGTCAGAGGAGCCATCAAAGCCATGCAGTTTGAAAACAGCTGATTTGAAATCATGTCTCAACAGTGGTTCAGTTCCTGCA
GAATCATCTGAACCTAAACACCATGACAAAAGGGCTTCAGTTGCAGCAGCCGTTGGTTGGCCCCCCATCCGTTCTTTCAGGAAGAACTTTGCTGCTCCCCGCTCT
TCAAAGCCTAACTCGCTGGAGTTGTCAAAAGAAACTGTGCAGGATGAGAATGGCTCAAAATTATCTGACTGTTCCAAGGGACAAATGTTTGTGAAAGTTTGTATG
GATGGGGTTCCCATTGGAAGAAAACTGAATCTCCAAGCCTATAATAGTTATGATCAACTGTCTGCTGGCATAGACGAACTCTTCCACTGTCTTCTTGCAGCACAG
AGAAACTATCTAGCAGCTGAAGATGGGAGAAAGACGGAGGAAACAACTTCAGTATCTGAATCCAAACACAAAAATGGGCTGTACACTCTAGTTTACTATGATAAT
GAAGGGGACAGGATGCTTGTTGGTGATGTCCCGTGGAACATGTTTGTTTCCACTGTGAAGAGGCTACGTGTATTGAAGAGCTCCGTTGTTGCCACTGAAATGAAG
TATGAATCTTCATCCAATGCCAAAAGAGCACAAACAGGAAACAAGGGTAATAGTGCCAATTTTCCAGCACCAGAGACAAAACTCCAAAAAAAAAAGGGCTTGTGG
GTTGGGAGGCTTGGGCGCAATCAAATCCTTCCAACCATCCTCACAAACAAAGCAAGCAATAAAAGAAGCCGGCGGCACAACAGTTCCACTCTGTATCTTCAATTC
TACATGCTTCTCAGAGCCTCCCCTCTTCTTCGCGCCTCCCACCTTCCAGTCCCCTCTGGGATTCACATGGCGCTAGCTGCCCATCTCCCTTTCTATCCCAACTGT
AAAAGTTCCACTTCATGGCGATCTTCTCCGCACTTGGTTTCTGTTTGTCAAGAGAATGGTTACGACCGCATTTCAGCCAAAATAACTTTTTCACAATTTGTTCGA
AGTTTCTACGTTTTATCGGGAACATACAATAAGGTACAGAGACCCTGTGGAAGGATTTCTTGTGGGCAAAGAGAGTATCGAAACGTGAGAAGACGAGCCCCAAAG
AGTAAGATTAAGGAATTAGAGCTCAACGTCAGTATATGTATTGAAGAAGAATTGCCGGACGACCCAGAAATTTTGAGTATTGCAGAAATGCTTCGTTTGAATGTT
CCTAAGGCTCTGAAGTTAGCTTTCAACAATTTAAAGGATTCAGAATATAAAACTCGAGATAGTTCAATAGGTGATGTTGGTTTATTTGATAGTGTCGAGTTGTCC
GTGCTACTTTGTAATGACGAGTTTATTCGTAAACTTAACAAAGAATGGAGGGATGAGGATCATGCCACTGATGTTCTGTCCATGTCACAGCATGTCCCTGAACTG
AAGCTTCCAATTCTTATGTTAGGTGATATTGTCATTTCTGTTGAGACTGCCGCAAGGCAGGCGGAGGAAAGGGGGCACACACTCCTTGATGAAGTTCGGATACTC
CTGGAATGTCAAAATTATGTGAAAGAAAAGAAAAGCAGTCGAATTTTTTACTATAAAAAATTCAACGAAGCAAAGGTGCATGGATTGTTGCATCTTGTGGGATTT
GATCATGAGGTTAGCGAAGAGGCTGAAGTAGAGATGGAGAAGGAGGAGGAACTACTTTTGAAGAGTTTAGAATGGAAAGGGAAAGGGCTAATTCAGAGTGCAATT
GATGCTGAAACAAATTCAAACTCAGCTGCAGTAGTTTCAGATGGTACATTGCTCAACAGTAAAAGTCAAATTACTCCAACAACTGCCAAGGCTCTGAAAGAGGTC
TTGTCAAGGGGTGTGAAAGTTGTTATAGCCACTGGAAAGGCCCGCCCAGCAGTAATAAGCATTTTGGAGAAGGCAGGTTTAGCTGGAAAAGATGGTGCAGTGTCT
GAATATTCTCCTGGAGTTTTCTTACAGGGACTACTCGTTTATGGTAGACAAGGAAAAGAAATTTATAGGCGGAACTTAGACCAAGATGTCTGCAGAGAGGCATGC
CTTTATTCTTTAGAGAAGAAAGTTCCTCTGATTGCATTTGGTGAAGGCCGTTGCTTAACTCTATTTGATCATCCACTTGCTGATGCATTGCACACTATATATCAT
GAGCCCAAGGCAGAGGTCATGTCCTCTGTTGAGAATCTTTTGGCTGCAGCTGAAATACAGAAACTTATTTTCTTGGACACTGCCGAAGGAGTTTCACACACCTTA
CGACCATACTGGTCAGAAGCATCAGGGGAACACGCCTCGGTTGTTCAAGCTGTACCAGATATGCTCGAGATTGTCCCCCGTGGAACCTCAAAAGGTAGTGGAGTA
AAAATGCTGCTCGATCATTTGGGAGTTTCTGCTGAGGAGGTAATGGCTATTGGAGATGGGGAAAATGACATTGAGATGCTTGAATTAGCATCTCTTGGGGTTGCT
TTGAGTAATGGATCAGAGAAAACAAAAGCTGTTGCTAACTTAATTGGTCCCGGCAATGATGAAGATGGTGTTGCAGAGTCGCCAATTTTTCTAAGCACCCTGGCT
GTTCTTCACGACTACAACCACCGCAAAATCATCCACCTTCTTCTACGATGCGCCGCCCAGCTTTCAATGCGTCAGCTTTACGAAATCCAAGCCCAAGTCTTTGCC
TCCCCAATCCCCTCCATCGATCCCAATCTCATTGCCGTCAAGTTCATCGGCGTTTCCTCTTCCCATGGCAACGTCCGCCATTCTGTTCTCATCTTCAATCACTAT
CTATCTTCTCCAAACATCTTTGCCTACAATGCTCTTCTCAAAGCCTTTGCTCAACACAATGCTTGGCACAGTACCATTTCTTACTTCAATAACCAGTTGGTTTTG
CCTGATGCTCCAAACCCAGATGAATATACCTTCACTTCTGTACTCAAGGCCTGCGCTGGCCTTGCCCAGGTGCTCGAAGGCCAAAAAGTCCACTGTTTCGTCACC
AAATATGGCTGCGAATCAAACCTGTTTGTTAGGAATTCACTGGTCGATTTGTATTTTAAGGTCGGTTGCAATTGTATTGCCCAGAAGCTGTTTGATGAAATGGTT
GTGAGAGATGTTGTTTCATGGAACACTTTGATTTCAGGATATTGTTTTAGTGGGATGGTAGACAAAGCTCGGATGGTATTTGATGGGATGATGGAGAAAAACTTG
GTGTCCTGGTCGACAATGATCAGCGGATATGCAAGGATGGGAAATTTAGAAGAAGCACGGCAACTGTTTGAGAATATGCCAATAAGGAATGTGGTTTCTTGGAAC
GCAATGATTGCTGGATATGCACAGAATGAGAAGTATGCAGATGCTATTGAGTTGTTCAGGCAAATGCAGCATGAAGGCGGCCTAGCTCCGAACGATGTTACTCTT
GTTAGCGTGCTTTCAGCTTGTGCACATCTTGGTGCACTTGATCTAGGGAAGTGGATTCATAAGTTTATAAGACGAAGCAAGATGGACGTAGGTCTGTTTTTAGGG
AATGCATTGGCAGACATGTATGCAAAGTGTGGATGCATATTGGAGGCCAAGGGGGTGTTTAACGAAATGCAGGAGAGAGATGTAATCTCATGGAGTATAATAATC
ATGGGGTTGGCCATGTATGGATATGCAGATGAAGCATTCAACTGCTTTGCTGAAATGATTGAAGATGGTTTAAAGCCAAATGATATTTCCTTTATGGGTTTATTA
ACAGCCTGCACTCATGCTGGATTGGTTGATAAGGGGCTGGAGTACTTTGACATGATGGCTCAAGTCTATGGTATTACTCCCAAGGTTGAGCATTATGGGTGTATT
GTTGATCTTCTTAGCCGTGCAGGGCGTCTCGATCAAGCTGAGTCATTGATTAACTCCATGCCTATGCAACCTAATGTAATAGTTTGGGGTGCACTGCTCGGTGGT
TGTCGGATATACAAAGATGCAGAACGAGGAGAACGTGTTGTTCGGCACATACTTGAACTAGACTCCAACCATTCTGGAAGTCTTGTTTATCTAGCTAATGTTTAT
GCTTCAATGGGCAGGCTGGATGATGCAGCTACTTGTAGGTTGCGAATGCGAGACAACAAGTCGATGAAGACACCGGGGTGCAGTTGGATTGAGATCGACAACTCG
GTATATGAATTTTTCATGGGGGACTTGTCTCATCCTCAGTCTGCAAAAATATACTCAATGATTAGAGAATTGAAGTGGAAGATGAAAGTGGCAGGATATAAACCA
AAGACGGATCTTGTGATTCATAATATAGATGAAGAAGAGAAGGAGGATGCACTTTCTACTCATAGTGAGAAGCTTGCCATTGCATTTGGGCTTATCAATACCAGT
GAAGGAACTACAATCAGAATAGTTAAGAACTTGAGAGTTTGCAACGACTGTCATGATGCAATAAAAATAATCTCAAAAATTGTTGAGCGAGAGATTGTGGTGCGA
GATAGAAGTCGTTTTCACCATTTCAAAAATGGAAAATGTTCTTGTAATGATTACTGGTAA
Protein sequenceShow/hide protein sequence
MVGCPELLDLISRDREWDLTRNGVGSSVSENNKLELRLGLPGEGSSCLNSNHENGEFPLSSAKPFPSCLNIRNNKRALYEKAEGMAGDEFQPHKLIFYEKTAEKV
SPHTPSLSASLPSSAFQREAQKLSQPSQSSYLKHLLMPQKLDLVSEEPSKPCSLKTADLKSCLNSGSVPAESSEPKHHDKRASVAAAVGWPPIRSFRKNFAAPRS
SKPNSLELSKETVQDENGSKLSDCSKGQMFVKVCMDGVPIGRKLNLQAYNSYDQLSAGIDELFHCLLAAQRNYLAAEDGRKTEETTSVSESKHKNGLYTLVYYDN
EGDRMLVGDVPWNMFVSTVKRLRVLKSSVVATEMKYESSSNAKRAQTGNKGNSANFPAPETKLQKKKGLWVGRLGRNQILPTILTNKASNKRSRRHNSSTLYLQF
YMLLRASPLLRASHLPVPSGIHMALAAHLPFYPNCKSSTSWRSSPHLVSVCQENGYDRISAKITFSQFVRSFYVLSGTYNKVQRPCGRISCGQREYRNVRRRAPK
SKIKELELNVSICIEEELPDDPEILSIAEMLRLNVPKALKLAFNNLKDSEYKTRDSSIGDVGLFDSVELSVLLCNDEFIRKLNKEWRDEDHATDVLSMSQHVPEL
KLPILMLGDIVISVETAARQAEERGHTLLDEVRILLECQNYVKEKKSSRIFYYKKFNEAKVHGLLHLVGFDHEVSEEAEVEMEKEEELLLKSLEWKGKGLIQSAI
DAETNSNSAAVVSDGTLLNSKSQITPTTAKALKEVLSRGVKVVIATGKARPAVISILEKAGLAGKDGAVSEYSPGVFLQGLLVYGRQGKEIYRRNLDQDVCREAC
LYSLEKKVPLIAFGEGRCLTLFDHPLADALHTIYHEPKAEVMSSVENLLAAAEIQKLIFLDTAEGVSHTLRPYWSEASGEHASVVQAVPDMLEIVPRGTSKGSGV
KMLLDHLGVSAEEVMAIGDGENDIEMLELASLGVALSNGSEKTKAVANLIGPGNDEDGVAESPIFLSTLAVLHDYNHRKIIHLLLRCAAQLSMRQLYEIQAQVFA
SPIPSIDPNLIAVKFIGVSSSHGNVRHSVLIFNHYLSSPNIFAYNALLKAFAQHNAWHSTISYFNNQLVLPDAPNPDEYTFTSVLKACAGLAQVLEGQKVHCFVT
KYGCESNLFVRNSLVDLYFKVGCNCIAQKLFDEMVVRDVVSWNTLISGYCFSGMVDKARMVFDGMMEKNLVSWSTMISGYARMGNLEEARQLFENMPIRNVVSWN
AMIAGYAQNEKYADAIELFRQMQHEGGLAPNDVTLVSVLSACAHLGALDLGKWIHKFIRRSKMDVGLFLGNALADMYAKCGCILEAKGVFNEMQERDVISWSIII
MGLAMYGYADEAFNCFAEMIEDGLKPNDISFMGLLTACTHAGLVDKGLEYFDMMAQVYGITPKVEHYGCIVDLLSRAGRLDQAESLINSMPMQPNVIVWGALLGG
CRIYKDAERGERVVRHILELDSNHSGSLVYLANVYASMGRLDDAATCRLRMRDNKSMKTPGCSWIEIDNSVYEFFMGDLSHPQSAKIYSMIRELKWKMKVAGYKP
KTDLVIHNIDEEEKEDALSTHSEKLAIAFGLINTSEGTTIRIVKNLRVCNDCHDAIKIISKIVEREIVVRDRSRFHHFKNGKCSCNDYW