; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G080800 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G080800
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCiama_Chr05:1716008..1725294
RNA-Seq ExpressionCaUC05G080800
SyntenyCaUC05G080800
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR010839 - Acyclic terpene utilisation
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK17586.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0078.1Show/hide
Query:  MQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARK
        MQ DGFYPDNFTFPFLLK CTGN WLPVV+ VHAQIEKFGFMSDVFVPNSLIDSYSKCGS GISAAKKLFVSMGA RDVVSWNSMISG AKGGLYEEARK
Subjt:  MQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARK

Query:  VFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK
        VFDEMP+RDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAG MEMARMLFDKMPVKNLVSWTII+SGFAEKGLAREAI LFDQMEK
Subjt:  VFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK

Query:  ARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK
        A LKLDNGT+I IL ACAESGLLGLGE+IHASIKNNN KCTTEISNALVDMYAKCGRLNIAY+VF+DIKNKDVVSWNAMLQGLAMHGHG+KALELFK+MK
Subjt:  ARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK

Query:  EEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLD
        EEGFSP++VTMIGVLCACTHAGLIDDGI+YFSTMERDY LVPEVEHYGCMVDLLGRKGRLEEA+RLIR+MPM PN IIWGTLLGACRMHNAVELAREVLD
Subjt:  EEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLD

Query:  HLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERLGKDDIHDCTIKLRVNPQKQR
        HLV+LEPSDSGNLSMLSNIYAAAGDW+CVA+TRLRMRSIGT+KPSGASSIEVDNE                   VLMER  + DIHDCTIKLRVNP+KQR
Subjt:  HLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERLGKDDIHDCTIKLRVNPQKQR

Query:  DKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADS
        DKV IGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLAD  QVMLSGGDGYDSR   WMKLLLPL++KRNICIITNMGAMDP  AQ+ VIE+A S
Subjt:  DKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADS

Query:  LGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVGKAIHTYVSFFNTTFVSF
        LGLNVSVAVAYE SVKE                           ISTYMG APIVECLEKYHPNVIITSRVADAALFLAPM                   
Subjt:  LGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVGKAIHTYVSFFNTTFVSF

Query:  RNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLP
                                                                   VYELGWNWDD P LAQGILAGHLLECGCQLTGGYFMHP   
Subjt:  RNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLP

Query:  LLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQ
                 GDKYRSMSFQQLLNISLPYAEVE DGK+TVAK EE+GGLLNFSTCAEQLLYE+GDPSAYITPD+VVDFSNVSF SISSSRV+CSGAKPSIQ
Subjt:  LLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQ

Query:  GVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRM
        GVPEKLLQLAPK                  DCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSYTIGLDSLKASSNSSN +EDIRLRM
Subjt:  GVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRM

Query:  DGLFKQKEHALLFVREFTALYTNGPAGGGGIS
        DGLF+QKEHALLFV+EFTALYTNGPAGGGGIS
Subjt:  DGLFKQKEHALLFVREFTALYTNGPAGGGGIS

XP_022980258.1 pentatricopeptide repeat-containing protein At3g29230 [Cucurbita maxima]0.0e+0089.21Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS
        MQMCSV  RTPSWFSTRKLFEQKLSDLHKCTDLNQVKQ+HAQILKSNLHLDLYVVPKLISAFSL RQMPLATN FNQVQYPN HLYNT+IRAH  NSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS

Query:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG
        QAF+TFF MQ DG YPDNFTFPFLLKACTGN WLPV++MVHAQIEKFGFMSDVFVPNSLIDSYSKCGS GI  AKKLFVSMG CRDVVSWN MISGFAKG
Subjt:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG

Query:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI
        GLYEEARKVFD+MP RD ISWNTMLDGYVKVGKMDDAFKLFD MPERNVVSWSTM+LGYCK GDMEMA+MLF+KMP +NLVSWTI+ISGFAEKGLA+ AI
Subjt:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI

Query:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA
        GLFDQME+A +KLDNG VI ILAACAESGLLGLGE+IHASI+N+N KCTTEISNALVDMYAKCGRL+IAYNVFNDI+NKDVVSWNAML GLAMHGHG KA
Subjt:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA

Query:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV
        LELFKRMKE+GFSPDKVTMIGVLCAC+HAGLIDDGI+YFS+ME+DY LV E+EHYGCMVDLLGRKGRLEEA+RLIR+MPMEPNVIIWGTLLGACRMHNAV
Subjt:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        ELAREVLDHLVKLEPSD GNLSMLSNIYAAAGDWDCVAD RLRMRSIGTQKPSGASSIEV+NEVHEFTVFDRSHPKSD IYQ++
Subjt:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

XP_031738476.1 pentatricopeptide repeat-containing protein At3g29230 [Cucumis sativus]0.0e+0092.81Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS
        MQMCSVPIRTPSWFSTRKL EQKLSDLHKCT+LNQVKQ+HAQILKSNLH+DL+VVPKLISAFSL RQM LATN FNQVQYPNVHLYNTMIRAH+HNSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS

Query:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG
        QAFATFFAMQ DG Y DNFTFPFLLK CTGN WLPV++ VHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGA RDVVSWNSMISG AKG
Subjt:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG

Query:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI
        GLYEEARKVFDEMPE+DGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTII+SGFAEKGLAREAI
Subjt:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI

Query:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA
         LFDQMEKA LKLDNGTV+ ILAACAESGLLGLGE+IHASIKNNN KCTTEISNALVDMYAKCGRLNIAY+VFNDIKNKDVVSWNAMLQGLAMHGHGVKA
Subjt:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA

Query:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV
        LELFKRMKEEGFSP+KVTMIGVLCACTHAGLIDDGI+YFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEA+RLIR+MPM PN IIWGTLLGACRMHNAV
Subjt:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        ELAREVLDHLV+LEP+DSGN SMLSNIYAAAGDW+CVA+TRLRMRSIGT+KPSGASSIEV+NEVHEFTVFDRSHPKSDNIYQV+
Subjt:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

XP_038900159.1 uncharacterized protein LOC120087281 isoform X1 [Benincasa hispida]0.0e+0071.09Show/hide
Query:  MERLGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNI
        MER GKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVK+LNYL+LECLAERTLADR QVMLSGGDGYDSR  +WMKLLLPLA+KRNI
Subjt:  MERLGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNI

Query:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAAL
        CIITNMGAMDPP AQRNVIEIA SLGLNVSVAVAYEVSVKE                         P ISTY+GAAPIVECLEKYHPNVIITSR+ADAAL
Subjt:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAAL

Query:  FLAPMVGKAIHTYVSFFNTTFVSFRNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQG
        FLAPM                                                                              VYELGWNWDD PRLAQG
Subjt:  FLAPMVGKAIHTYVSFFNTTFVSFRNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQG

Query:  ILAGHLLECGCQLTGGYFMHPGLPLLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVD
        ILAGHLLECGCQLTGGYFMHP            GDKYRSMS QQLLNISLPYAE+E DG++TVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVD
Subjt:  ILAGHLLECGCQLTGGYFMHPGLPLLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVD

Query:  FSNVSFYSISSSRVLCSGAKPSIQGVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSY
        FSNVSF SISSSRV CSGAKPSIQG PEKLLQLAPK                  DCGWKGWGEISYGGRECVLRAKAA+YLVRSW+EE LIG+NQ IVSY
Subjt:  FSNVSFYSISSSRVLCSGAKPSIQGVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSY

Query:  TIGLDSLKASSNSSNSVEDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLIFGSTVFHFLHRVLLITCGCSGVMICEYCCN
        TIGLDSLKAS+NSS SVEDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISTGYKKEIVL+KQL                               
Subjt:  TIGLDSLKASSNSSNSVEDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLIFGSTVFHFLHRVLLITCGCSGVMICEYCCN

Query:  LSSSSSSSSSCVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGND
                   VGRENIFWQTGVKCTEAVK + QP D R+DPAE  SSP+V LPCPI+A AEKP  G FPPE GHSP PS QEIALYNVAHSRAGDKGND
Subjt:  LSSSSSSSSSCVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGND

Query:  LNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNSTLFPSSDADKKRDEGVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRMDRHGKTISDLILNQ
        LNFSVIPHYPSDIERLKM+ITPEWV RVLSVLHNST FPSSDA+KKRDE VDE VKVEIYEV+GIHSLNVVVRNILDGGVNCSRR+DRHGK ISDLILNQ
Subjt:  LNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNSTLFPSSDADKKRDEGVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRMDRHGKTISDLILNQ

Query:  QIVLPP
         IVLPP
Subjt:  QIVLPP

XP_038900175.1 pentatricopeptide repeat-containing protein At3g29230-like isoform X2 [Benincasa hispida]0.0e+0094.86Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS
        MQMCSVPIR PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLH+DLYVVPKLISAFSL RQMPLAT+ FNQVQYPNVHLYNTMIRAHTHNSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS

Query:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG
        QAFATFFAMQCDGFYPDNFTFPFLLKACTGN WL VV+MVHAQIEKFGFMSDVFVPNSLIDSYSKCGS GISAAKKLFVSMGACRDVVSWNSMISGFAKG
Subjt:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG

Query:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI
        GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMAR+LFDKMPVKNLVSWTIIISGFAEKGLAREA+
Subjt:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI

Query:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA
        GLF+QMEKA LK D+GTVI IL+ACAESGLLGLGE+IHASIKNNN KCT EISNALV+MYAKCGRLNIAY VFNDIKNKDVVSWNAMLQGLAMHGHGVKA
Subjt:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA

Query:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV
        LELFKRMKEEGFSPDK+TMIGVLCACTHAGLIDDGIQYFSTMERDY LVPEVEHYGCMVDLLGRKGRLEEAVRLIR+MPMEPNVIIWG LLGACRMHNAV
Subjt:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVAD RLRMRS GTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIY V+
Subjt:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

TrEMBL top hitse value%identityAlignment
A0A0A0L7H7 Uncharacterized protein0.0e+0079.57Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS
        MQMCSVPIRTPSWFSTRKL EQKLSDLHKCT+LNQVKQ+HAQILKSNLH+DL+VVPKLISAFSL RQM LATN FNQVQYPNVHLYNTMIRAH+HNSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS

Query:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG
        QAFATFFAMQ DG Y DNFTFPFLLK CTGN WLPV++ VHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGA RDVVSWNSMISG AKG
Subjt:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG

Query:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI
        GLYEEARKVFDEMPE+DGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTII+SGFAEKGLAREAI
Subjt:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI

Query:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA
         LFDQMEKA LKLDNGTV+ ILAACAESGLLGLGE+IHASIKNNN KCTTEISNALVDMYAKCGRLNIAY+VFNDIKNKDVVSWNAMLQGLAMHGHGVKA
Subjt:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA

Query:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV
        LELFKRMKEEGFSP+KVTMIGVLCACTHAGLIDDGI+YFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEA+RLIR+MPM PN IIWGTLLGACRMHNAV
Subjt:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERLGKDDIHDCTIKL
        ELAREVLDHLV+LEP+DSGN SMLSNIYAAAGDW+CVA+TRLRMRSIGT+KPSGASSIEV+NEVHEFTVFDRSHPKSDNIYQVLME  G+ DIHDCTIKL
Subjt:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERLGKDDIHDCTIKL

Query:  RVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNICIITNMGAMDPPGAQR
        RVNPQKQRDKV IGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLAD  QVMLSGGDGYD R  +WMKLLLPLA+KRNICIITNMGAMDPP AQ+
Subjt:  RVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNICIITNMGAMDPPGAQR

Query:  NVIEIADSLGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVGKAIHTYVSF
        NVIE+A SLGLNVSVAVAYE SVKE                           ISTYMG APIVECLEKYHPNVIITSRVADAALFLAPM           
Subjt:  NVIEIADSLGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVGKAIHTYVSF

Query:  FNTTFVSFRNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGG
                                                                           VYELGWNWDD P LAQGILAGHLLECGCQLTGG
Subjt:  FNTTFVSFRNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGG

Query:  YFMHPGLPLLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLC
        YFMHP            GDKYRSMSFQQLLNISLPYAEVE DGK+TVAK EE+GGLLNFSTCAEQLLYE+G+PSAYITPD+VVDFSNVSF SISSSRVLC
Subjt:  YFMHPGLPLLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLC

Query:  SGAKPSIQGVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNS
        SGAKPSIQGVPEKLLQLAPK                  DCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSYTIGLDSLKASSN SN 
Subjt:  SGAKPSIQGVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNS

Query:  VEDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLIFGSTVFHFLHRVLLITCGCSGVMICEYCCNLSSSSSSSSSCVGREN
        VEDIRLRMDGLF+QKEHALLFV+EFTALYTNGPAGGGGISTGYKKEIVL+KQL                                          VGREN
Subjt:  VEDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLIFGSTVFHFLHRVLLITCGCSGVMICEYCCNLSSSSSSSSSCVGREN

Query:  IFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERL
        IFWQT V CTEAVK + Q TDL+KDPAE CSSPRVTLPCPI+ +A++ C+GS PPE GHSP PSGQEIALYNVAHSRAGDKGNDLNFS+IPH PSDIERL
Subjt:  IFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERL

Query:  KMIITPEWVMRVLSVLHNSTLFPSSDADKKRDEGVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRMDRHGKTISDLILNQQIVLPP
        KMIITPEWVMRVLSVLHNST F SS+AD+KR+E V E VKVEIYEVKGIHSLNVVVRNILDGGVNCSRR+DRHGKTISDLILNQ IVLPP
Subjt:  KMIITPEWVMRVLSVLHNSTLFPSSDADKKRDEGVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRMDRHGKTISDLILNQQIVLPP

A0A5D3D3E7 Pentatricopeptide repeat-containing protein0.0e+0078.1Show/hide
Query:  MQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARK
        MQ DGFYPDNFTFPFLLK CTGN WLPVV+ VHAQIEKFGFMSDVFVPNSLIDSYSKCGS GISAAKKLFVSMGA RDVVSWNSMISG AKGGLYEEARK
Subjt:  MQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARK

Query:  VFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK
        VFDEMP+RDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAG MEMARMLFDKMPVKNLVSWTII+SGFAEKGLAREAI LFDQMEK
Subjt:  VFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK

Query:  ARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK
        A LKLDNGT+I IL ACAESGLLGLGE+IHASIKNNN KCTTEISNALVDMYAKCGRLNIAY+VF+DIKNKDVVSWNAMLQGLAMHGHG+KALELFK+MK
Subjt:  ARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK

Query:  EEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLD
        EEGFSP++VTMIGVLCACTHAGLIDDGI+YFSTMERDY LVPEVEHYGCMVDLLGRKGRLEEA+RLIR+MPM PN IIWGTLLGACRMHNAVELAREVLD
Subjt:  EEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLD

Query:  HLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERLGKDDIHDCTIKLRVNPQKQR
        HLV+LEPSDSGNLSMLSNIYAAAGDW+CVA+TRLRMRSIGT+KPSGASSIEVDNE                   VLMER  + DIHDCTIKLRVNP+KQR
Subjt:  HLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERLGKDDIHDCTIKLRVNPQKQR

Query:  DKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADS
        DKV IGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLAD  QVMLSGGDGYDSR   WMKLLLPL++KRNICIITNMGAMDP  AQ+ VIE+A S
Subjt:  DKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADS

Query:  LGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVGKAIHTYVSFFNTTFVSF
        LGLNVSVAVAYE SVKE                           ISTYMG APIVECLEKYHPNVIITSRVADAALFLAPM                   
Subjt:  LGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVGKAIHTYVSFFNTTFVSF

Query:  RNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLP
                                                                   VYELGWNWDD P LAQGILAGHLLECGCQLTGGYFMHP   
Subjt:  RNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLP

Query:  LLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQ
                 GDKYRSMSFQQLLNISLPYAEVE DGK+TVAK EE+GGLLNFSTCAEQLLYE+GDPSAYITPD+VVDFSNVSF SISSSRV+CSGAKPSIQ
Subjt:  LLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQ

Query:  GVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRM
        GVPEKLLQLAPK                  DCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSYTIGLDSLKASSNSSN +EDIRLRM
Subjt:  GVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRM

Query:  DGLFKQKEHALLFVREFTALYTNGPAGGGGIS
        DGLF+QKEHALLFV+EFTALYTNGPAGGGGIS
Subjt:  DGLFKQKEHALLFVREFTALYTNGPAGGGGIS

A0A6J1CU14 pentatricopeptide repeat-containing protein At3g292301.1e-30587.33Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS
        MQMC VP+RTPSWFSTR+LFEQKLSDLHKCTDLNQVKQ+HAQILKSNLHLDLYVVPKLISAFSL RQ  LAT+ FNQ+Q PNVHLYNT+IRAH  NSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS

Query:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG
        QAFA FFAMQC GFYPDNFTFPFLLKAC+G  WLPVV+MVHAQIEKFGFMSD+FVPNSLIDSYSKCGS GISAAKK F+SMG  RD+VSWNSMISG A+ 
Subjt:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG

Query:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI
        G Y EARKVFDEMP+RD ISWNTMLD YVK G+MDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI
Subjt:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI

Query:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA
        GL+DQME+A LKLDNGTVI ILAACAESGLL LGE++H+SI  NN  CTTEISNALVDMYAKCGRL++A+NVFN  +NKDVVSWNAMLQGLAMHGHG KA
Subjt:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA

Query:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV
        LELFKRMKEEGFSPD+VTMIGVLCACTHAGLIDDG +YF  MERDY +VPE+EHYGCMVDLLGRKGRLEEA+RLI SMPMEPN +IWGTLLGACRMHNAV
Subjt:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        ELAREVLDHLVKLEPSD GNLSMLSNIYAAAGDWDCVAD RLRMRSIG QKPSGASSIEVD+EVHEFTVFDRSHPKSD IYQV+
Subjt:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

A0A6J1EA54 pentatricopeptide repeat-containing protein At3g292301.2e-30788.32Show/hide
Query:  MCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQA
        MCSV  RTPSWFSTRKLFEQKLSDLHKCTDLNQVKQ+HAQILKS+LHLDLYVVPKLISAFSL RQMPLATN FNQVQYPNVHLYNT+IRAH  NSQPSQA
Subjt:  MCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQA

Query:  FATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGL
        F+TFF MQ DG YPDNFTFPFLLKACTGN WLPV++MVHAQIEKFGFMS+V VPNSLIDSYSKCGS GI  AKKLFVSMG CRDVVSWNSMISGFAKGGL
Subjt:  FATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGL

Query:  YEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGL
        YEEARKVFD+MP RD ISWNTMLDGYVK GKMDDAFKLFD MPERNVVSWSTM+LGYCK GDMEMA+ LFDKMP +NLVSWTI+ISGFAEKGLA+ AIGL
Subjt:  YEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGL

Query:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE
        FDQME+A +KLDNG VI ILAA AESGLLGLGE+IHASIKN+N KCTTEISNALVDMYAKCGRL+IAYNVFNDI+NKDVVSWNAML GLAMHGHG KALE
Subjt:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE

Query:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL
        LFKRMKE+GFSPDKVTMIGVLCAC+HAGLIDDGI+YFS+ME++Y LV E+EHYGCMVDLLGRKGRLEEA+RLIR+MPMEPNVIIWGTLLGACRMHNAVEL
Subjt:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL

Query:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        AREVLDHLVKLEPS+ GNLSMLSNIYAAAGDWDCVAD RLRMRS GTQKPSGASSIEV+NEVHEFTVFDRSHPKSD IYQ++
Subjt:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

A0A6J1IYS0 pentatricopeptide repeat-containing protein At3g292300.0e+0089.21Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS
        MQMCSV  RTPSWFSTRKLFEQKLSDLHKCTDLNQVKQ+HAQILKSNLHLDLYVVPKLISAFSL RQMPLATN FNQVQYPN HLYNT+IRAH  NSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS

Query:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG
        QAF+TFF MQ DG YPDNFTFPFLLKACTGN WLPV++MVHAQIEKFGFMSDVFVPNSLIDSYSKCGS GI  AKKLFVSMG CRDVVSWN MISGFAKG
Subjt:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG

Query:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI
        GLYEEARKVFD+MP RD ISWNTMLDGYVKVGKMDDAFKLFD MPERNVVSWSTM+LGYCK GDMEMA+MLF+KMP +NLVSWTI+ISGFAEKGLA+ AI
Subjt:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI

Query:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA
        GLFDQME+A +KLDNG VI ILAACAESGLLGLGE+IHASI+N+N KCTTEISNALVDMYAKCGRL+IAYNVFNDI+NKDVVSWNAML GLAMHGHG KA
Subjt:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA

Query:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV
        LELFKRMKE+GFSPDKVTMIGVLCAC+HAGLIDDGI+YFS+ME+DY LV E+EHYGCMVDLLGRKGRLEEA+RLIR+MPMEPNVIIWGTLLGACRMHNAV
Subjt:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        ELAREVLDHLVKLEPSD GNLSMLSNIYAAAGDWDCVAD RLRMRSIGTQKPSGASSIEV+NEVHEFTVFDRSHPKSD IYQ++
Subjt:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.1e-11836.92Show/hide
Query:  PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS--RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFA
        P+  +T     + +S + +C  L Q+KQ H  ++++    D Y   KL +  +LS    +  A   F+++  PN   +NT+IRA+     P  +   F  
Subjt:  PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS--RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFA

Query:  MQCDG-FYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEAR
        M  +   YP+ +TFPFL+KA    + L + Q +H    K    SDVFV NSLI  Y  CG   + +A K+F ++   +DVVSWNSMI+GF + G  ++A 
Subjt:  MQCDG-FYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEAR

Query:  KVFDEMPERD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA
        ++F +M   D         G+                                N MLD Y K G ++DA +LFD M E++ V+W+TM+ GY  + D E A
Subjt:  KVFDEMPERD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA

Query:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLN
        R + + MP K++V+W  +IS + + G   EA+ +F +++ +  +KL+  T++  L+ACA+ G L LG  IH+ IK + ++    +++AL+ MY+KCG L 
Subjt:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLN

Query:  IAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGR
         +  VFN ++ +DV  W+AM+ GLAMHG G +A+++F +M+E    P+ VT   V CAC+H GL+D+    F  ME +Y +VPE +HY C+VD+LGR G 
Subjt:  IAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGR

Query:  LEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEF
        LE+AV+ I +MP+ P+  +WG LLGAC++H  + LA      L++LEP + G   +LSNIYA  G W+ V++ R  MR  G +K  G SSIE+D  +HEF
Subjt:  LEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEF

Query:  TVFDRSHPKSDNIYQVLMERLGK
           D +HP S+ +Y  L E + K
Subjt:  TVFDRSHPKSDNIYQVLMERLGK

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.0e-12239.06Show/hide
Query:  LSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS---RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFT
        LS LH C  L  ++ IHAQ++K  LH   Y + KLI    LS     +P A + F  +Q PN+ ++NTM R H  +S P  A   +  M   G  P+++T
Subjt:  LSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS---RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFT

Query:  FPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGIS
        FPF+LK+C  +      Q +H  + K G   D++V  SLI  Y + G   +  A K+F      RDVVS+ ++I G+A  G  E A+K+FDE+P +D +S
Subjt:  FPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGIS

Query:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARMLFDKMPVKNLV
        WN M+ GY + G   +A +LF +M + NV    STMV             LG                         Y K G++E A  LF+++P K+++
Subjt:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARMLFDKMPVKNLV

Query:  SWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISN---ALVDMYAKCGRLNIAYNVFNDIKN
        SW  +I G+    L +EA+ LF +M ++    ++ T++ IL ACA  G + +G  IH  I +  LK  T  S+   +L+DMYAKCG +  A+ VFN I +
Subjt:  SWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISN---ALVDMYAKCGRLNIAYNVFNDIKN

Query:  KDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSM
        K + SWNAM+ G AMHG    + +LF RM++ G  PD +T +G+L AC+H+G++D G   F TM +DY + P++EHYGCM+DLLG  G  +EA  +I  M
Subjt:  KDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSM

Query:  PMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSD
         MEP+ +IW +LL AC+MH  VEL     ++L+K+EP + G+  +LSNIYA+AG W+ VA TR  +   G +K  G SSIE+D+ VHEF + D+ HP++ 
Subjt:  PMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSD

Query:  NIY------QVLMERLG
         IY      +VL+E+ G
Subjt:  NIY------QVLMERLG

Q9LS72 Pentatricopeptide repeat-containing protein At3g292303.0e-22864.09Show/hide
Query:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFA
        S+P+R PSW S+R++FE++L DL KC +LNQVKQ+HAQI++ NLH DL++ PKLISA SL RQ  LA   FNQVQ PNVHL N++IRAH  NSQP QAF 
Subjt:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFA

Query:  TFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYE
         F  MQ  G + DNFT+PFLLKAC+G +WLPVV+M+H  IEK G  SD++VPN+LID YS+CG  G+  A KLF  M   RD VSWNSM+ G  K G   
Subjt:  TFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYE

Query:  EARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL
        +AR++FDEMP+RD ISWNTMLDGY +  +M  AF+LF++MPERN VSWSTMV+GY KAGDMEMAR++FDKMP+  KN+V+WTIII+G+AEKGL +EA  L
Subjt:  EARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL

Query:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE
         DQM  + LK D   VI ILAAC ESGLL LG RIH+ +K +NL     + NAL+DMYAKCG L  A++VFNDI  KD+VSWN ML GL +HGHG +A+E
Subjt:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE

Query:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL
        LF RM+ EG  PDKVT I VLC+C HAGLID+GI YF +ME+ Y LVP+VEHYGC+VDLLGR GRL+EA++++++MPMEPNV+IWG LLGACRMHN V++
Subjt:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL

Query:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        A+EVLD+LVKL+P D GN S+LSNIYAAA DW+ VAD R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD IYQ+L
Subjt:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088203.3e-11039.65Show/hide
Query:  CTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACT
        CT +N +KQIH  ++  +LH D ++V  L+      RQ   +   F+  Q+PN+ LYN++I    +N    +    F +++  G Y   FTFP +LKACT
Subjt:  CTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACT

Query:  GNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPER----DGISWNTML
          +   +   +H+ + K GF  DV    SL+  YS  GS  ++ A KLF  +   R VV+W ++ SG+   G + EA  +F +M E     D      +L
Subjt:  GNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPER----DGISWNTML

Query:  DGYVKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGI
           V VG +D      K  +EM  ++N    +T+V  Y K G ME AR +FD M  K++V+W+ +I G+A     +E I LF QM +  LK D  +++G 
Subjt:  DGYVKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGI

Query:  LAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIG
        L++CA  G L LGE   + I  +       ++NAL+DMYAKCG +   + VF ++K KD+V  NA + GLA +GH   +  +F + ++ G SPD  T +G
Subjt:  LAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIG

Query:  VLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNL
        +LC C HAGLI DG+++F+ +   Y L   VEHYGCMVDL GR G L++A RLI  MPM PN I+WG LL  CR+    +LA  VL  L+ LEP ++GN 
Subjt:  VLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNL

Query:  SMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERLGKD
          LSNIY+  G WD  A+ R  M   G +K  G S IE++ +VHEF   D+SHP SD IY  L E LG +
Subjt:  SMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERLGKD

Q9SY02 Pentatricopeptide repeat-containing protein At4g027505.2e-11139.17Show/hide
Query:  LISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMS--DVFV
        ++S ++ +  +  A + F+++   N   +N ++ A+  NS+  +A   F + +          +  +   C    ++   ++V A+ + F  M+  DV  
Subjt:  LISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMS--DVFV

Query:  PNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTM
         N++I  Y++ G   I  A++LF      +DV +W +M+SG+ +  + EEAR++FD+MPER+ +SWN ML GYV+  +M+ A +LFD MP RNV +W+TM
Subjt:  PNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTM

Query:  VLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNA
        + GY + G +  A+ LFDKMP ++ VSW  +I+G+++ G + EA+ LF QME+   +L+  +    L+ CA+   L LG+++H  +     +    + NA
Subjt:  VLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNA

Query:  LVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHY
        L+ MY KCG +  A ++F ++  KD+VSWN M+ G + HG G  AL  F+ MK EG  PD  TM+ VL AC+H GL+D G QYF TM +DY ++P  +HY
Subjt:  LVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHY

Query:  GCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGA
         CMVDLLGR G LE+A  L+++MP EP+  IWGTLLGA R+H   ELA    D +  +EP +SG   +LSN+YA++G W  V   R+RMR  G +K  G 
Subjt:  GCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGA

Query:  SSIEVDNEVHEFTVFDRSHPKSDNIYQVLME
        S IE+ N+ H F+V D  HP+ D I+  L E
Subjt:  SSIEVDNEVHEFTVFDRSHPKSDNIYQVLME

Arabidopsis top hitse value%identityAlignment
AT1G01770.1 unknown protein6.5e-21051.43Show/hide
Query:  KDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNICIITN
        K+ + DC I LR NP+++R+ VY+GCGAGFGGDRP AALKLLQRV+ LNYLVLECLAERTLADR   M SGG GYD R   WM+LLLPLA++R  CIITN
Subjt:  KDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSR--NWMKLLLPLAIKRNICIITN

Query:  MGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPM
        MGA+DP GAQ+ V+E+A  LGL +SVAVA+EV  +                  F          STY+GAAPIVECLEKY PNVIITSRVADAALFLAPM
Subjt:  MGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPEVLWIICPVRIDFQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPM

Query:  VGKAIHTYVSFFNTTFVSFRNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQGILAGH
                                                                                      VYELGWNW+DL  LAQG LAGH
Subjt:  VGKAIHTYVSFFNTTFVSFRNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQPRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQGILAGH

Query:  LLECGCQLTGGYFMHPGLPLLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVS
        LLECGCQLTGGYFMHP            GD+YR M+F  L ++SLPYAE+ YDGKV V+K E +GG+LN STCAEQLLYE+ DPSAYITPD+V+D   VS
Subjt:  LLECGCQLTGGYFMHPGLPLLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVS

Query:  FYSISSSRVLCSGAKPSIQ-GVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGL
        F  +S  +V CSGAKPS    VPEKLL+L PK                  +CGWKGWGEISYGG   + RAKA+E+LVRSWMEE + G+N  I+SY IG+
Subjt:  FYSISSSRVLCSGAKPSIQ-GVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGL

Query:  DSLKASSNSSNSVE---DIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLIFGSTVFHFLHRVLLITCGCSGVMICEYCCNL
        DSLKA+SN + S +   DIRLRMDGLFK KEHA+   +EFTALYTNGPAGGGGISTG+K EIVL+K+L                                
Subjt:  DSLKASSNSSNSVE---DIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLIFGSTVFHFLHRVLLITCGCSGVMICEYCCNL

Query:  SSSSSSSSSCVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGNDL
                  V RE++ W+TG+          Q T+  +    E  SP      P          G +     HSP PSGQ+I LY+VAHSRAGDKGND+
Subjt:  SSSSSSSSSCVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGNDL

Query:  NFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNSTLFPSSDADKKRDEGVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRMDRHGKTISDLILNQQ
        NFS+IPHY  D+ERLK+IITP+WV  V+SVL +++ F   DA     + +DE+V VEIY+V+GIH++NVVVRNILDGGVNCSRR+DRHGKTISDLIL QQ
Subjt:  NFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNSTLFPSSDADKKRDEGVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRMDRHGKTISDLILNQQ

Query:  IVL
        +VL
Subjt:  IVL

AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-12439.06Show/hide
Query:  LSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS---RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFT
        LS LH C  L  ++ IHAQ++K  LH   Y + KLI    LS     +P A + F  +Q PN+ ++NTM R H  +S P  A   +  M   G  P+++T
Subjt:  LSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS---RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFT

Query:  FPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGIS
        FPF+LK+C  +      Q +H  + K G   D++V  SLI  Y + G   +  A K+F      RDVVS+ ++I G+A  G  E A+K+FDE+P +D +S
Subjt:  FPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGIS

Query:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARMLFDKMPVKNLV
        WN M+ GY + G   +A +LF +M + NV    STMV             LG                         Y K G++E A  LF+++P K+++
Subjt:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARMLFDKMPVKNLV

Query:  SWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISN---ALVDMYAKCGRLNIAYNVFNDIKN
        SW  +I G+    L +EA+ LF +M ++    ++ T++ IL ACA  G + +G  IH  I +  LK  T  S+   +L+DMYAKCG +  A+ VFN I +
Subjt:  SWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISN---ALVDMYAKCGRLNIAYNVFNDIKN

Query:  KDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSM
        K + SWNAM+ G AMHG    + +LF RM++ G  PD +T +G+L AC+H+G++D G   F TM +DY + P++EHYGCM+DLLG  G  +EA  +I  M
Subjt:  KDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSM

Query:  PMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSD
         MEP+ +IW +LL AC+MH  VEL     ++L+K+EP + G+  +LSNIYA+AG W+ VA TR  +   G +K  G SSIE+D+ VHEF + D+ HP++ 
Subjt:  PMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSD

Query:  NIY------QVLMERLG
         IY      +VL+E+ G
Subjt:  NIY------QVLMERLG

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-12036.92Show/hide
Query:  PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS--RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFA
        P+  +T     + +S + +C  L Q+KQ H  ++++    D Y   KL +  +LS    +  A   F+++  PN   +NT+IRA+     P  +   F  
Subjt:  PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS--RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFA

Query:  MQCDG-FYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEAR
        M  +   YP+ +TFPFL+KA    + L + Q +H    K    SDVFV NSLI  Y  CG   + +A K+F ++   +DVVSWNSMI+GF + G  ++A 
Subjt:  MQCDG-FYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEAR

Query:  KVFDEMPERD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA
        ++F +M   D         G+                                N MLD Y K G ++DA +LFD M E++ V+W+TM+ GY  + D E A
Subjt:  KVFDEMPERD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA

Query:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLN
        R + + MP K++V+W  +IS + + G   EA+ +F +++ +  +KL+  T++  L+ACA+ G L LG  IH+ IK + ++    +++AL+ MY+KCG L 
Subjt:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLN

Query:  IAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGR
         +  VFN ++ +DV  W+AM+ GLAMHG G +A+++F +M+E    P+ VT   V CAC+H GL+D+    F  ME +Y +VPE +HY C+VD+LGR G 
Subjt:  IAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGR

Query:  LEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEF
        LE+AV+ I +MP+ P+  +WG LLGAC++H  + LA      L++LEP + G   +LSNIYA  G W+ V++ R  MR  G +K  G SSIE+D  +HEF
Subjt:  LEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEF

Query:  TVFDRSHPKSDNIYQVLMERLGK
           D +HP S+ +Y  L E + K
Subjt:  TVFDRSHPKSDNIYQVLMERLGK

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-22964.09Show/hide
Query:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFA
        S+P+R PSW S+R++FE++L DL KC +LNQVKQ+HAQI++ NLH DL++ PKLISA SL RQ  LA   FNQVQ PNVHL N++IRAH  NSQP QAF 
Subjt:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFA

Query:  TFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYE
         F  MQ  G + DNFT+PFLLKAC+G +WLPVV+M+H  IEK G  SD++VPN+LID YS+CG  G+  A KLF  M   RD VSWNSM+ G  K G   
Subjt:  TFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYE

Query:  EARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL
        +AR++FDEMP+RD ISWNTMLDGY +  +M  AF+LF++MPERN VSWSTMV+GY KAGDMEMAR++FDKMP+  KN+V+WTIII+G+AEKGL +EA  L
Subjt:  EARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL

Query:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE
         DQM  + LK D   VI ILAAC ESGLL LG RIH+ +K +NL     + NAL+DMYAKCG L  A++VFNDI  KD+VSWN ML GL +HGHG +A+E
Subjt:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE

Query:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL
        LF RM+ EG  PDKVT I VLC+C HAGLID+GI YF +ME+ Y LVP+VEHYGC+VDLLGR GRL+EA++++++MPMEPNV+IWG LLGACRMHN V++
Subjt:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL

Query:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        A+EVLD+LVKL+P D GN S+LSNIYAAA DW+ VAD R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD IYQ+L
Subjt:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-11239.17Show/hide
Query:  LISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMS--DVFV
        ++S ++ +  +  A + F+++   N   +N ++ A+  NS+  +A   F + +          +  +   C    ++   ++V A+ + F  M+  DV  
Subjt:  LISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMS--DVFV

Query:  PNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTM
         N++I  Y++ G   I  A++LF      +DV +W +M+SG+ +  + EEAR++FD+MPER+ +SWN ML GYV+  +M+ A +LFD MP RNV +W+TM
Subjt:  PNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTM

Query:  VLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNA
        + GY + G +  A+ LFDKMP ++ VSW  +I+G+++ G + EA+ LF QME+   +L+  +    L+ CA+   L LG+++H  +     +    + NA
Subjt:  VLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNA

Query:  LVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHY
        L+ MY KCG +  A ++F ++  KD+VSWN M+ G + HG G  AL  F+ MK EG  PD  TM+ VL AC+H GL+D G QYF TM +DY ++P  +HY
Subjt:  LVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHY

Query:  GCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGA
         CMVDLLGR G LE+A  L+++MP EP+  IWGTLLGA R+H   ELA    D +  +EP +SG   +LSN+YA++G W  V   R+RMR  G +K  G 
Subjt:  GCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGA

Query:  SSIEVDNEVHEFTVFDRSHPKSDNIYQVLME
        S IE+ N+ H F+V D  HP+ D I+  L E
Subjt:  SSIEVDNEVHEFTVFDRSHPKSDNIYQVLME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TACTTATCTAACTTATCTCGTAATTATGCGTTTTCTCCTTCTGTCAAGACAGGAAAGGGCGTGGCTAAAGCTTCTCCGTCTGCCATTTTTGCGTGCGAAGCCATGCAAAT
GTGCAGTGTCCCAATTCGAACCCCCTCCTGGTTTTCCACTCGAAAGCTCTTCGAGCAGAAGCTTTCAGATCTCCACAAGTGTACAGACCTCAACCAAGTGAAGCAAATCC
ACGCTCAAATCCTCAAATCCAATCTCCACCTCGACCTCTATGTTGTTCCCAAACTCATATCTGCCTTCTCCCTTTCTCGCCAGATGCCTCTCGCCACCAACACTTTCAAT
CAAGTTCAATATCCAAACGTCCATTTGTACAACACTATGATTCGAGCCCACACCCATAACTCACAACCTTCACAAGCCTTCGCCACTTTCTTTGCTATGCAATGTGATGG
ATTTTACCCCGATAATTTCACTTTCCCGTTTCTTTTGAAAGCTTGTACTGGGAATGCATGGTTGCCTGTTGTTCAAATGGTGCATGCCCAAATCGAGAAATTTGGGTTCA
TGTCGGATGTATTCGTGCCAAATTCTCTTATTGATTCATATTCCAAATGTGGGTCTTGTGGAATTTCAGCAGCAAAGAAGTTGTTTGTGTCAATGGGAGCTTGTAGGGAT
GTTGTGTCATGGAACTCAATGATCTCTGGATTTGCGAAGGGTGGGTTATATGAAGAAGCTCGAAAGGTGTTCGATGAAATGCCTGAAAGGGATGGTATTAGTTGGAACAC
AATGTTGGATGGGTATGTTAAAGTTGGGAAAATGGATGATGCGTTTAAACTGTTTGATGAAATGCCTGAGAGGAATGTTGTCTCTTGGTCGACAATGGTGTTAGGGTATT
GCAAGGCAGGGGATATGGAGATGGCACGAATGTTGTTTGATAAAATGCCTGTGAAGAATTTGGTTTCTTGGACCATAATTATCTCTGGGTTTGCTGAGAAAGGGCTAGCT
AGGGAGGCCATTGGTTTGTTTGATCAAATGGAAAAGGCTCGCTTGAAATTAGACAATGGGACGGTAATAGGTATTTTGGCTGCTTGTGCTGAGTCTGGTTTGCTTGGGCT
TGGTGAGAGAATACATGCTTCCATTAAGAACAACAATTTGAAATGTACTACTGAAATCTCCAATGCTTTGGTTGATATGTATGCAAAATGTGGTAGGTTGAATATTGCTT
ACAATGTTTTTAATGACATAAAAAATAAAGATGTCGTGTCTTGGAATGCTATGCTTCAAGGGCTAGCAATGCATGGACATGGAGTGAAAGCACTCGAGCTTTTCAAAAGA
ATGAAAGAAGAGGGTTTCTCACCTGACAAAGTTACAATGATCGGAGTCTTGTGTGCTTGTACGCATGCAGGATTGATCGACGATGGCATTCAATACTTCTCTACAATGGA
AAGGGACTACACCCTTGTTCCTGAAGTTGAGCATTATGGTTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAGGAAGCCGTAAGGCTCATTCGCAGCATGCCAA
TGGAACCAAATGTCATCATTTGGGGAACCCTTTTAGGGGCATGTCGTATGCATAATGCTGTTGAACTTGCGAGGGAGGTTCTTGATCATTTGGTTAAGCTAGAGCCATCT
GATTCAGGTAATTTGTCCATGTTGTCGAACATATATGCTGCGGCAGGGGACTGGGATTGTGTCGCTGACACGAGGTTAAGAATGCGGAGTATTGGAACTCAAAAACCGTC
GGGTGCTAGTTCCATTGAGGTCGACAATGAGGTTCATGAATTTACAGTATTTGATCGATCGCATCCAAAGTCTGATAATATATATCAAGTGCTAATGGAGAGGCTGGGTA
AAGATGACATCCATGACTGCACAATTAAGCTGAGAGTAAATCCTCAAAAACAGAGAGACAAGGTGTACATTGGCTGTGGTGCTGGATTTGGAGGCGATAGGCCAACAGCA
GCTCTTAAATTGCTTCAAAGGGTCAAAACCCTAAACTATCTCGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCGCTGTCAAGTTATGTTGTCTGGTGGTGATGG
TTATGATTCAAGGAATTGGATGAAATTGCTTCTTCCATTGGCTATAAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGACCCCCCTGGGGCTCAGCGAAACG
TTATAGAAATTGCAGACAGTCTGGGGTTGAATGTTTCAGTTGCAGTTGCTTATGAGGTTTCAGTAAAAGAACCAGAGGTTTTATGGATTATCTGTCCAGTAAGGATTGAT
TTCCAAGAATTGTTCTCGTGCCCTCCCAAATGGTGGCCAAGGATAAGCACGTATATGGGAGCAGCTCCTATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAAT
TACTTCACGTGTCGCAGATGCTGCCCTATTCTTGGCTCCAATGGTAGGAAAAGCGATTCATACTTATGTTTCTTTTTTCAATACAACCTTTGTTTCCTTTAGAAACAATG
CTGTTTTAAAGGAGAATATCAGTCACTTTAGAGAAGTTGCTTTGTGCCTCCTTTGGGGGACGGAAAACTTTGTGGATTTTAGAGATTTCAGCCTCTTTTATGGGCTTCAA
CCGAGGAGTTACAGAAAATATCTCCAGTTGATTGTGATAGAAGAAATGTTAGATTTTGTCTATGAACTTGGTTGGAACTGGGATGATCTTCCACGGCTAGCACAGGGAAT
ACTGGCTGGTCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGTCTACCTCTCCTTCATACCCAATTCTTTGTATGTGGGGACAAGTATA
GAAGCATGTCTTTCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAGTATGATGGAAAAGTCACTGTAGCCAAGGCAGAAGAGACTGGAGGTCTTTTGAAT
TTCAGTACATGTGCCGAACAACTTCTGTATGAGGTTGGTGATCCATCGGCTTATATCACCCCTGATATGGTGGTTGACTTCAGCAATGTTTCGTTTTACTCTATATCCAG
TTCTAGGGTTTTATGTTCCGGAGCTAAACCATCTATTCAAGGAGTGCCTGAGAAACTCTTGCAGTTGGCTCCAAAGGTATACAAGTCTATTAGAGTAAAAAGTTCCGAAC
TTGGAAATCTAACTTCTCAGGACTGTGGATGGAAAGGATGGGGAGAGATATCGTATGGAGGACGTGAATGTGTTCTGCGTGCTAAGGCTGCAGAATATCTGGTTCGGTCG
TGGATGGAAGAACAGTTGATTGGTATTAATCAGCATATAGTTTCTTACACAATTGGACTCGACAGCCTTAAAGCATCCAGCAATAGTAGCAATAGTGTTGAAGATATTAG
GTTGCGCATGGATGGTCTCTTCAAGCAGAAGGAGCACGCTCTCCTGTTTGTTAGAGAATTTACAGCTTTATACACAAATGGGCCAGCTGGTGGTGGAGGCATCAGCACTG
GCTACAAGAAAGAAATTGTGCTTGATAAACAACTGATATTTGGATCAACAGTATTTCATTTTCTCCATCGAGTTCTTTTAATAACTTGTGGATGTTCTGGAGTTATGATT
TGTGAATATTGCTGTAATCTCTCGTCTTCTTCTTCTTCTTCTTCTTCGTGTGTTGGGCGTGAGAATATTTTCTGGCAAACAGGAGTGAAGTGCACTGAAGCAGTAAAATT
CAACAGACAACCAACAGATCTTCGAAAGGATCCAGCAGAGGAATGTTCTTCGCCCCGAGTAACATTGCCATGTCCGATAACTGCTTATGCCGAGAAACCTTGTTCAGGCT
CCTTTCCACCAGAAACGGGTCATTCCCCTTTTCCATCTGGCCAGGAGATTGCGCTGTACAATGTAGCCCATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCT
GTCATTCCTCATTATCCTTCCGATATTGAGCGATTGAAGATGATCATCACGCCTGAATGGGTGATGAGAGTTCTCTCGGTTCTGCATAATTCGACTCTGTTTCCTTCTTC
GGATGCCGATAAGAAGAGAGACGAGGGGGTAGATGAACATGTGAAGGTTGAAATATACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTAGACG
GTGGCGTAAATTGCTCGCGGAGAATGGATCGCCATGGAAAGACCATATCGGATCTCATCTTGAACCAGCAAATTGTTTTGCCACCATAG
mRNA sequenceShow/hide mRNA sequence
TACTTATCTAACTTATCTCGTAATTATGCGTTTTCTCCTTCTGTCAAGACAGGAAAGGGCGTGGCTAAAGCTTCTCCGTCTGCCATTTTTGCGTGCGAAGCCATGCAAAT
GTGCAGTGTCCCAATTCGAACCCCCTCCTGGTTTTCCACTCGAAAGCTCTTCGAGCAGAAGCTTTCAGATCTCCACAAGTGTACAGACCTCAACCAAGTGAAGCAAATCC
ACGCTCAAATCCTCAAATCCAATCTCCACCTCGACCTCTATGTTGTTCCCAAACTCATATCTGCCTTCTCCCTTTCTCGCCAGATGCCTCTCGCCACCAACACTTTCAAT
CAAGTTCAATATCCAAACGTCCATTTGTACAACACTATGATTCGAGCCCACACCCATAACTCACAACCTTCACAAGCCTTCGCCACTTTCTTTGCTATGCAATGTGATGG
ATTTTACCCCGATAATTTCACTTTCCCGTTTCTTTTGAAAGCTTGTACTGGGAATGCATGGTTGCCTGTTGTTCAAATGGTGCATGCCCAAATCGAGAAATTTGGGTTCA
TGTCGGATGTATTCGTGCCAAATTCTCTTATTGATTCATATTCCAAATGTGGGTCTTGTGGAATTTCAGCAGCAAAGAAGTTGTTTGTGTCAATGGGAGCTTGTAGGGAT
GTTGTGTCATGGAACTCAATGATCTCTGGATTTGCGAAGGGTGGGTTATATGAAGAAGCTCGAAAGGTGTTCGATGAAATGCCTGAAAGGGATGGTATTAGTTGGAACAC
AATGTTGGATGGGTATGTTAAAGTTGGGAAAATGGATGATGCGTTTAAACTGTTTGATGAAATGCCTGAGAGGAATGTTGTCTCTTGGTCGACAATGGTGTTAGGGTATT
GCAAGGCAGGGGATATGGAGATGGCACGAATGTTGTTTGATAAAATGCCTGTGAAGAATTTGGTTTCTTGGACCATAATTATCTCTGGGTTTGCTGAGAAAGGGCTAGCT
AGGGAGGCCATTGGTTTGTTTGATCAAATGGAAAAGGCTCGCTTGAAATTAGACAATGGGACGGTAATAGGTATTTTGGCTGCTTGTGCTGAGTCTGGTTTGCTTGGGCT
TGGTGAGAGAATACATGCTTCCATTAAGAACAACAATTTGAAATGTACTACTGAAATCTCCAATGCTTTGGTTGATATGTATGCAAAATGTGGTAGGTTGAATATTGCTT
ACAATGTTTTTAATGACATAAAAAATAAAGATGTCGTGTCTTGGAATGCTATGCTTCAAGGGCTAGCAATGCATGGACATGGAGTGAAAGCACTCGAGCTTTTCAAAAGA
ATGAAAGAAGAGGGTTTCTCACCTGACAAAGTTACAATGATCGGAGTCTTGTGTGCTTGTACGCATGCAGGATTGATCGACGATGGCATTCAATACTTCTCTACAATGGA
AAGGGACTACACCCTTGTTCCTGAAGTTGAGCATTATGGTTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAGGAAGCCGTAAGGCTCATTCGCAGCATGCCAA
TGGAACCAAATGTCATCATTTGGGGAACCCTTTTAGGGGCATGTCGTATGCATAATGCTGTTGAACTTGCGAGGGAGGTTCTTGATCATTTGGTTAAGCTAGAGCCATCT
GATTCAGGTAATTTGTCCATGTTGTCGAACATATATGCTGCGGCAGGGGACTGGGATTGTGTCGCTGACACGAGGTTAAGAATGCGGAGTATTGGAACTCAAAAACCGTC
GGGTGCTAGTTCCATTGAGGTCGACAATGAGGTTCATGAATTTACAGTATTTGATCGATCGCATCCAAAGTCTGATAATATATATCAAGTGCTAATGGAGAGGCTGGGTA
AAGATGACATCCATGACTGCACAATTAAGCTGAGAGTAAATCCTCAAAAACAGAGAGACAAGGTGTACATTGGCTGTGGTGCTGGATTTGGAGGCGATAGGCCAACAGCA
GCTCTTAAATTGCTTCAAAGGGTCAAAACCCTAAACTATCTCGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCGCTGTCAAGTTATGTTGTCTGGTGGTGATGG
TTATGATTCAAGGAATTGGATGAAATTGCTTCTTCCATTGGCTATAAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGACCCCCCTGGGGCTCAGCGAAACG
TTATAGAAATTGCAGACAGTCTGGGGTTGAATGTTTCAGTTGCAGTTGCTTATGAGGTTTCAGTAAAAGAACCAGAGGTTTTATGGATTATCTGTCCAGTAAGGATTGAT
TTCCAAGAATTGTTCTCGTGCCCTCCCAAATGGTGGCCAAGGATAAGCACGTATATGGGAGCAGCTCCTATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAAT
TACTTCACGTGTCGCAGATGCTGCCCTATTCTTGGCTCCAATGGTAGGAAAAGCGATTCATACTTATGTTTCTTTTTTCAATACAACCTTTGTTTCCTTTAGAAACAATG
CTGTTTTAAAGGAGAATATCAGTCACTTTAGAGAAGTTGCTTTGTGCCTCCTTTGGGGGACGGAAAACTTTGTGGATTTTAGAGATTTCAGCCTCTTTTATGGGCTTCAA
CCGAGGAGTTACAGAAAATATCTCCAGTTGATTGTGATAGAAGAAATGTTAGATTTTGTCTATGAACTTGGTTGGAACTGGGATGATCTTCCACGGCTAGCACAGGGAAT
ACTGGCTGGTCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGTCTACCTCTCCTTCATACCCAATTCTTTGTATGTGGGGACAAGTATA
GAAGCATGTCTTTCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAGTATGATGGAAAAGTCACTGTAGCCAAGGCAGAAGAGACTGGAGGTCTTTTGAAT
TTCAGTACATGTGCCGAACAACTTCTGTATGAGGTTGGTGATCCATCGGCTTATATCACCCCTGATATGGTGGTTGACTTCAGCAATGTTTCGTTTTACTCTATATCCAG
TTCTAGGGTTTTATGTTCCGGAGCTAAACCATCTATTCAAGGAGTGCCTGAGAAACTCTTGCAGTTGGCTCCAAAGGTATACAAGTCTATTAGAGTAAAAAGTTCCGAAC
TTGGAAATCTAACTTCTCAGGACTGTGGATGGAAAGGATGGGGAGAGATATCGTATGGAGGACGTGAATGTGTTCTGCGTGCTAAGGCTGCAGAATATCTGGTTCGGTCG
TGGATGGAAGAACAGTTGATTGGTATTAATCAGCATATAGTTTCTTACACAATTGGACTCGACAGCCTTAAAGCATCCAGCAATAGTAGCAATAGTGTTGAAGATATTAG
GTTGCGCATGGATGGTCTCTTCAAGCAGAAGGAGCACGCTCTCCTGTTTGTTAGAGAATTTACAGCTTTATACACAAATGGGCCAGCTGGTGGTGGAGGCATCAGCACTG
GCTACAAGAAAGAAATTGTGCTTGATAAACAACTGATATTTGGATCAACAGTATTTCATTTTCTCCATCGAGTTCTTTTAATAACTTGTGGATGTTCTGGAGTTATGATT
TGTGAATATTGCTGTAATCTCTCGTCTTCTTCTTCTTCTTCTTCTTCGTGTGTTGGGCGTGAGAATATTTTCTGGCAAACAGGAGTGAAGTGCACTGAAGCAGTAAAATT
CAACAGACAACCAACAGATCTTCGAAAGGATCCAGCAGAGGAATGTTCTTCGCCCCGAGTAACATTGCCATGTCCGATAACTGCTTATGCCGAGAAACCTTGTTCAGGCT
CCTTTCCACCAGAAACGGGTCATTCCCCTTTTCCATCTGGCCAGGAGATTGCGCTGTACAATGTAGCCCATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCT
GTCATTCCTCATTATCCTTCCGATATTGAGCGATTGAAGATGATCATCACGCCTGAATGGGTGATGAGAGTTCTCTCGGTTCTGCATAATTCGACTCTGTTTCCTTCTTC
GGATGCCGATAAGAAGAGAGACGAGGGGGTAGATGAACATGTGAAGGTTGAAATATACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTAGACG
GTGGCGTAAATTGCTCGCGGAGAATGGATCGCCATGGAAAGACCATATCGGATCTCATCTTGAACCAGCAAATTGTTTTGCCACCATAGTTTGGTGGTTCACACAAATCA
CACAGAAGAGAGCTAAAATTTCTGAACCAGAAGGCTCTTCTAGGCAATAAAGTATCTTGCCTTCTGCCGTTGGATTACTTTACCTTGTAATTCTCAACTCCAACTTGTGC
TTCTAATGCCTTCAAAATCTTTTGCGATCAAGTACTGATTATCCTTAAATTTCAAAACTTTTCGGATTAATTAATTATTCCATATATTCTCACATTTCTCTTTGCTTAAA
GATTTAG
Protein sequenceShow/hide protein sequence
YLSNLSRNYAFSPSVKTGKGVAKASPSAIFACEAMQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFN
QVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRD
VVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLA
REAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKR
MKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPS
DSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERLGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTA
ALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRNWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPEVLWIICPVRID
FQELFSCPPKWWPRISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVGKAIHTYVSFFNTTFVSFRNNAVLKENISHFREVALCLLWGTENFVDFRDFSLFYGLQ
PRSYRKYLQLIVIEEMLDFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLPLLHTQFFVCGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLN
FSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQGVPEKLLQLAPKVYKSIRVKSSELGNLTSQDCGWKGWGEISYGGRECVLRAKAAEYLVRS
WMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLIFGSTVFHFLHRVLLITCGCSGVMI
CEYCCNLSSSSSSSSSCVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGNDLNFS
VIPHYPSDIERLKMIITPEWVMRVLSVLHNSTLFPSSDADKKRDEGVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRMDRHGKTISDLILNQQIVLPP