; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G001680 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G001680
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCG_Chr05:1653007..1663843
RNA-Seq ExpressionClCG05G001680
SyntenyClCG05G001680
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR010839 - Acyclic terpene utilisation
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK17586.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0089.66Show/hide
Query:  MQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARK
        MQ DGFYPDNFTFPFLLK CTGN WLPVV+ VHAQIEKFGFMSDVFVPNSLIDSYSKCGS GISAAKKLFVSMGA RDVVSWNSMISG AKGGLYEEARK
Subjt:  MQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARK

Query:  VFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK
        VFDEMP+RDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAG MEMAR+LFDKMPVKNLVSWTII+SGFAEKGLAREAI LFDQMEK
Subjt:  VFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK

Query:  ARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK
        A LKLDNGT+I IL ACAESGLLGLGE+IHASIKNNN KCTTEISNALVDMYAKCGRLNIAY+VF+DIKNKDVVSWNAMLQGLAMHGHG+KALELFK+MK
Subjt:  ARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK

Query:  EEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLD
        EEGFSP++VTMIGVLCACTHAGLIDDGI+YFSTMERDY LVPEVEHYGCMVDLLGRKGRLEEA+RLIR+MPM PN IIWGTLLGACRMHNAVELAREVLD
Subjt:  EEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLD

Query:  HLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERQGKDDIHDCTIKLRVNPQKQR
        HLV+LEPSDSGNLSMLSNIYAAAGDW+CVA+TRLRMRSIGT+KPSGASSIEVDNE                   VLMER  + DIHDCTIKLRVNP+KQR
Subjt:  HLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERQGKDDIHDCTIKLRVNPQKQR

Query:  DKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADS
        DKV IGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLAD  QVMLSGGDGYDSRIA+WMKLLLPL++KRNICIITNMGAMDP  AQ+ VIE+A S
Subjt:  DKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADS

Query:  LGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
        LGLNVSVAVAYE SVKE GISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDD P LAQGILAGHLLECGCQLTGGYFMHPGDKYRS
Subjt:  LGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGDKYRS

Query:  MSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQGVPGKLLQLAPKDCG
        MSFQQLLNISLPYAEVE DGK+TVAK EE+GGLLNFSTCAEQLLYE+GDPSAYITPD+VVDFSNVSF SISSSRV+CSGAKPSIQGVP KLLQLAPKDCG
Subjt:  MSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQGVPGKLLQLAPKDCG

Query:  WKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGHALLFVREFTALYTNGPAGGGGIS
        WKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSYTIGLDSLKASSNSSN +EDIRLRMDGLF+QK HALLFV+EFTALYTNGPAGGGGIS
Subjt:  WKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGHALLFVREFTALYTNGPAGGGGIS

XP_004134329.1 uncharacterized protein LOC101212841 isoform X2 [Cucumis sativus]0.0e+0089.54Show/hide
Query:  MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNI
        ME  G+ DIHDCTIKLRVNPQKQRDKV IGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLAD  QVMLSGGDGYD RIADWMKLLLPLA+KRNI
Subjt:  MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNI

Query:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILA
        CIITNMGAMDPP AQ+NVIE+A SLGLNVSVAVAYE SVKE GISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDD P LAQGILA
Subjt:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVL
        GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVE DGK+TVAK EE+GGLLNFSTCAEQLLYE+G+PSAYITPD+VVDFSNVSF SISSSRVL
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVL

Query:  CSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGH
        CSGAKPSIQGVP KLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSYTIGLDSLKASSN SN VEDIRLRMDGLF+QK H
Subjt:  CSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGH

Query:  ALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGH
        ALLFV+EFTALYTNGPAGGGGISTGYKKEIVL+KQLVGRENIFWQT V CTEAVK + Q TDL+KDPAE CSSPRVTLPCPI+ +A++ C+GS PPE GH
Subjt:  ALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGH

Query:  SPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNI
        SP PSGQEIALYNVAHSRAGDKGNDLNFS+IPH PSDIERLKMIITPEWVMRVLSVLHN T F SS+AD+KR+E V E VKVEIYEVKGIHSLNVVVRNI
Subjt:  SPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQQIVLPP
        LDGGVNCSRRIDRHGKTISDLILNQ IVLPP
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQQIVLPP

XP_008438065.1 PREDICTED: uncharacterized protein LOC103483286 [Cucumis melo]0.0e+0089.22Show/hide
Query:  MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNI
        MER  + DIHDCTIKLRVNP+KQRDKV IGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLAD  QVMLSGGDGYDSRIA+WMKLLLPL++KRNI
Subjt:  MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNI

Query:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILA
        CIITNMGAMDP  AQ+ VIE+A SLGLNVSVAVAYE SVKE GISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDD P LAQGILA
Subjt:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVL
        GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVE DGK+TVAK EE+GGLLNFSTCAEQLLYE+GDPSAYITPD+VVDFSNVSF SISSSRV+
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVL

Query:  CSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGH
        CSGAKPSIQGVP KLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSYTIGLDSLKASSNSSN +EDIRLRMDGLF+QK H
Subjt:  CSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGH

Query:  ALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGH
        ALLFV+EFTALYTNGPAGGGGISTGYKKEIVL+KQLVGRENIFWQT VKC+EAVK + Q TDL+KDPAE CSSPRVTLPCPI+++AEK C+GSFPPETGH
Subjt:  ALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGH

Query:  SPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNI
        SP PSGQEIALY+VAHSRAGDKGNDLNFS+IPHYPSDIERLKMIITPEWVMRVLS LHNLT F SS+A +KR+E V+E VKVEIYEVK IHSLNVVVRNI
Subjt:  SPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQQIVLPP
        LDGGVNCSRRIDRHGKTISDLILNQ IVLPP
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQQIVLPP

XP_038900159.1 uncharacterized protein LOC120087281 isoform X1 [Benincasa hispida]0.0e+0091.28Show/hide
Query:  MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNI
        MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVK+LNYL+LECLAERTLADR QVMLSGGDGYDSRIADWMKLLLPLA+KRNI
Subjt:  MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNI

Query:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILA
        CIITNMGAMDPP AQRNVIEIA SLGLNVSVAVAYEVSVKEPGISTY+GAAPIVECLEKYHPNVIITSR+ADAALFLAPMVYELGWNWDD PRLAQGILA
Subjt:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVL
        GHLLECGCQLTGGYFMHPGDKYRSMS QQLLNISLPYAE+E DG++TVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSF SISSSRV 
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVL

Query:  CSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGH
        CSGAKPSIQG P KLLQLAPKDCGWKGWGEISYGGRECVLRAKAA+YLVRSW+EE LIG+NQ IVSYTIGLDSLKAS+NSS SVEDIRLRMDGLFKQK H
Subjt:  CSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGH

Query:  ALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGH
        ALLFVREFTALYTNGPAGGGGISTGYKKEIVL+KQLVGRENIFWQTGVKCTEAVK + QP D R+DPAE  SSP+V LPCPI+A AEKP  G FPPE GH
Subjt:  ALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGH

Query:  SPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNI
        SP PS QEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKM+ITPEWV RVLSVLHN T FPSSDA+KKRDE VDE VKVEIYEV+GIHSLNVVVRNI
Subjt:  SPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQQIVLPP
        LDGGVNCSRRIDRHGK ISDLILNQ IVLPP
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQQIVLPP

XP_038900175.1 pentatricopeptide repeat-containing protein At3g29230-like isoform X2 [Benincasa hispida]0.0e+0094.86Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS
        MQMCSVPIR PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLH+DLYVVPKLISAFSL RQMPLAT+ FNQVQYPNVHLYNTMIRAHTHNSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS

Query:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG
        QAFATFFAMQCDGFYPDNFTFPFLLKACTGN WL VV+MVHAQIEKFGFMSDVFVPNSLIDSYSKCGS GISAAKKLFVSMGACRDVVSWNSMISGFAKG
Subjt:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG

Query:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAI
        GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMAR+LFDKMPVKNLVSWTIIISGFAEKGLAREA+
Subjt:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAI

Query:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA
        GLF+QMEKA LK D+GTVI IL+ACAESGLLGLGE+IHASIKNNN KCT EISNALV+MYAKCGRLNIAY VFNDIKNKDVVSWNAMLQGLAMHGHGVKA
Subjt:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA

Query:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV
        LELFKRMKEEGFSPDK+TMIGVLCACTHAGLIDDGIQYFSTMERDY LVPEVEHYGCMVDLLGRKGRLEEAVRLIR+MPMEPNVIIWG LLGACRMHNAV
Subjt:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVAD RLRMRS GTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIY V+
Subjt:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

TrEMBL top hitse value%identityAlignment
A0A0A0L7H7 Uncharacterized protein0.0e+0091.11Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS
        MQMCSVPIRTPSWFSTRKL EQKLSDLHKCT+LNQVKQ+HAQILKSNLH+DL+VVPKLISAFSL RQM LATN FNQVQYPNVHLYNTMIRAH+HNSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPS

Query:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG
        QAFATFFAMQ DG Y DNFTFPFLLK CTGN WLPV++ VHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGA RDVVSWNSMISG AKG
Subjt:  QAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKG

Query:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAI
        GLYEEARKVFDEMPE+DGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMAR+LFDKMPVKNLVSWTII+SGFAEKGLAREAI
Subjt:  GLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAI

Query:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA
         LFDQMEKA LKLDNGTV+ ILAACAESGLLGLGE+IHASIKNNN KCTTEISNALVDMYAKCGRLNIAY+VFNDIKNKDVVSWNAMLQGLAMHGHGVKA
Subjt:  GLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKA

Query:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV
        LELFKRMKEEGFSP+KVTMIGVLCACTHAGLIDDGI+YFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEA+RLIR+MPM PN IIWGTLLGACRMHNAV
Subjt:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERQGKDDIHDCTIKL
        ELAREVLDHLV+LEP+DSGN SMLSNIYAAAGDW+CVA+TRLRMRSIGT+KPSGASSIEV+NEVHEFTVFDRSHPKSDNIYQVLME  G+ DIHDCTIKL
Subjt:  ELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERQGKDDIHDCTIKL

Query:  RVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNICIITNMGAMDPPGAQR
        RVNPQKQRDKV IGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLAD  QVMLSGGDGYD RIADWMKLLLPLA+KRNICIITNMGAMDPP AQ+
Subjt:  RVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNICIITNMGAMDPPGAQR

Query:  NVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFM
        NVIE+A SLGLNVSVAVAYE SVKE GISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDD P LAQGILAGHLLECGCQLTGGYFM
Subjt:  NVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFM

Query:  HPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQGVPGKLL
        HPGDKYRSMSFQQLLNISLPYAEVE DGK+TVAK EE+GGLLNFSTCAEQLLYE+G+PSAYITPD+VVDFSNVSF SISSSRVLCSGAKPSIQGVP KLL
Subjt:  HPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQGVPGKLL

Query:  QLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGHALLFVREFTALYTNGP
        QLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSYTIGLDSLKASSN SN VEDIRLRMDGLF+QK HALLFV+EFTALYTNGP
Subjt:  QLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGHALLFVREFTALYTNGP

Query:  AGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAH
        AGGGGISTGYKKEIVL+KQLVGRENIFWQT V CTEAVK + Q TDL+KDPAE CSSPRVTLPCPI+ +A++ C+GS PPE GHSP PSGQEIALYNVAH
Subjt:  AGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAH

Query:  SRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGK
        SRAGDKGNDLNFS+IPH PSDIERLKMIITPEWVMRVLSVLHN T F SS+AD+KR+E V E VKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGK
Subjt:  SRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGK

Query:  TISDLILNQQIVLPP
        TISDLILNQ IVLPP
Subjt:  TISDLILNQQIVLPP

A0A1S3AV50 uncharacterized protein LOC1034832860.0e+0089.22Show/hide
Query:  MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNI
        MER  + DIHDCTIKLRVNP+KQRDKV IGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLAD  QVMLSGGDGYDSRIA+WMKLLLPL++KRNI
Subjt:  MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNI

Query:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILA
        CIITNMGAMDP  AQ+ VIE+A SLGLNVSVAVAYE SVKE GISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDD P LAQGILA
Subjt:  CIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVL
        GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVE DGK+TVAK EE+GGLLNFSTCAEQLLYE+GDPSAYITPD+VVDFSNVSF SISSSRV+
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVL

Query:  CSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGH
        CSGAKPSIQGVP KLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSYTIGLDSLKASSNSSN +EDIRLRMDGLF+QK H
Subjt:  CSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGH

Query:  ALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGH
        ALLFV+EFTALYTNGPAGGGGISTGYKKEIVL+KQLVGRENIFWQT VKC+EAVK + Q TDL+KDPAE CSSPRVTLPCPI+++AEK C+GSFPPETGH
Subjt:  ALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPETGH

Query:  SPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNI
        SP PSGQEIALY+VAHSRAGDKGNDLNFS+IPHYPSDIERLKMIITPEWVMRVLS LHNLT F SS+A +KR+E V+E VKVEIYEVK IHSLNVVVRNI
Subjt:  SPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQQIVLPP
        LDGGVNCSRRIDRHGKTISDLILNQ IVLPP
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQQIVLPP

A0A5D3D3E7 Pentatricopeptide repeat-containing protein0.0e+0089.66Show/hide
Query:  MQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARK
        MQ DGFYPDNFTFPFLLK CTGN WLPVV+ VHAQIEKFGFMSDVFVPNSLIDSYSKCGS GISAAKKLFVSMGA RDVVSWNSMISG AKGGLYEEARK
Subjt:  MQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARK

Query:  VFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK
        VFDEMP+RDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAG MEMAR+LFDKMPVKNLVSWTII+SGFAEKGLAREAI LFDQMEK
Subjt:  VFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK

Query:  ARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK
        A LKLDNGT+I IL ACAESGLLGLGE+IHASIKNNN KCTTEISNALVDMYAKCGRLNIAY+VF+DIKNKDVVSWNAMLQGLAMHGHG+KALELFK+MK
Subjt:  ARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK

Query:  EEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLD
        EEGFSP++VTMIGVLCACTHAGLIDDGI+YFSTMERDY LVPEVEHYGCMVDLLGRKGRLEEA+RLIR+MPM PN IIWGTLLGACRMHNAVELAREVLD
Subjt:  EEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLD

Query:  HLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERQGKDDIHDCTIKLRVNPQKQR
        HLV+LEPSDSGNLSMLSNIYAAAGDW+CVA+TRLRMRSIGT+KPSGASSIEVDNE                   VLMER  + DIHDCTIKLRVNP+KQR
Subjt:  HLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERQGKDDIHDCTIKLRVNPQKQR

Query:  DKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADS
        DKV IGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLAD  QVMLSGGDGYDSRIA+WMKLLLPL++KRNICIITNMGAMDP  AQ+ VIE+A S
Subjt:  DKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADS

Query:  LGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
        LGLNVSVAVAYE SVKE GISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDD P LAQGILAGHLLECGCQLTGGYFMHPGDKYRS
Subjt:  LGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGDKYRS

Query:  MSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQGVPGKLLQLAPKDCG
        MSFQQLLNISLPYAEVE DGK+TVAK EE+GGLLNFSTCAEQLLYE+GDPSAYITPD+VVDFSNVSF SISSSRV+CSGAKPSIQGVP KLLQLAPKDCG
Subjt:  MSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQGVPGKLLQLAPKDCG

Query:  WKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGHALLFVREFTALYTNGPAGGGGIS
        WKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSYTIGLDSLKASSNSSN +EDIRLRMDGLF+QK HALLFV+EFTALYTNGPAGGGGIS
Subjt:  WKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGHALLFVREFTALYTNGPAGGGGIS

A0A6J1IGN9 uncharacterized protein LOC111474742 isoform X10.0e+0086.3Show/hide
Query:  YQVLMERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAI
        + +LMERQG+DD+HDCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLADR Q M SGGDGYDSRIADWMKLLLPLA+
Subjt:  YQVLMERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAI

Query:  KRNICIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQ
        KRNICIITNMGAMDPPGAQ+NVIEIA SLGL+VSVAVAYEVSVKE GISTY+GAAPIV+CLEKYHPNVIITSRVADAALF+APMVYELGWNWDD PRL+Q
Subjt:  KRNICIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQ

Query:  GILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISS
        G LAGHLLECGCQLTGGYFMHPGDK+RSM FQQLL+ISLPYAE++ DGKV VAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPD+VVD SNVSF SISS
Subjt:  GILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISS

Query:  SRVLCSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFK
        S+V CSGAKPSIQ VP KLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE L G+NQHIVSY IGLDSLKAS NSS SVEDIRLRMDGLF+
Subjt:  SRVLCSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFK

Query:  QKGHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPP
         K HALLFVREFTALYTNGPAGGGGISTGYKKEIVL+KQLVGRE++FW+ GVKCT+AV+ + +PTDLR+DPA+  +SPRVTLPC I AYA+ PC+ S  P
Subjt:  QKGHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPP

Query:  ETGHSPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVV
        ETGHSP PSGQ++ALYNVAHSRAGDKGND+NFSV+PHYPSDIERLKMIITPEWV RVLS L N + F   DADKKRDE V+EHVKVEIYEVKGIHSLNVV
Subjt:  ETGHSPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVV

Query:  VRNILDGGVNCSRRIDRHGKTISDLILNQQIVLPP
        VRNILDGGVNCSRRIDRHGKTISDL+LNQQ+VLPP
Subjt:  VRNILDGGVNCSRRIDRHGKTISDLILNQQIVLPP

A0A6J1IJ63 uncharacterized protein LOC111474742 isoform X30.0e+0086.57Show/hide
Query:  VLMERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKR
        +LMERQG+DD+HDCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRVK LNYLVLECLAERTLADR Q M SGGDGYDSRIADWMKLLLPLA+KR
Subjt:  VLMERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKR

Query:  NICIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGI
        NICIITNMGAMDPPGAQ+NVIEIA SLGL+VSVAVAYEVSVKE GISTY+GAAPIV+CLEKYHPNVIITSRVADAALF+APMVYELGWNWDD PRL+QG 
Subjt:  NICIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGI

Query:  LAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSR
        LAGHLLECGCQLTGGYFMHPGDK+RSM FQQLL+ISLPYAE++ DGKV VAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPD+VVD SNVSF SISSS+
Subjt:  LAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSR

Query:  VLCSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQK
        V CSGAKPSIQ VP KLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE L G+NQHIVSY IGLDSLKAS NSS SVEDIRLRMDGLF+ K
Subjt:  VLCSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQK

Query:  GHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPET
         HALLFVREFTALYTNGPAGGGGISTGYKKEIVL+KQLVGRE++FW+ GVKCT+AV+ + +PTDLR+DPA+  +SPRVTLPC I AYA+ PC+ S  PET
Subjt:  GHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEKPCSGSFPPET

Query:  GHSPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVR
        GHSP PSGQ++ALYNVAHSRAGDKGND+NFSV+PHYPSDIERLKMIITPEWV RVLS L N + F   DADKKRDE V+EHVKVEIYEVKGIHSLNVVVR
Subjt:  GHSPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVKGIHSLNVVVR

Query:  NILDGGVNCSRRIDRHGKTISDLILNQQIVLPP
        NILDGGVNCSRRIDRHGKTISDL+LNQQ+VLPP
Subjt:  NILDGGVNCSRRIDRHGKTISDLILNQQIVLPP

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.7e-11837Show/hide
Query:  PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS--RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFA
        P+  +T     + +S + +C  L Q+KQ H  ++++    D Y   KL +  +LS    +  A   F+++  PN   +NT+IRA+     P  +   F  
Subjt:  PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS--RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFA

Query:  MQCDG-FYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEAR
        M  +   YP+ +TFPFL+KA    + L + Q +H    K    SDVFV NSLI  Y  CG   + +A K+F ++   +DVVSWNSMI+GF + G  ++A 
Subjt:  MQCDG-FYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEAR

Query:  KVFDEMPERD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA
        ++F +M   D         G+                                N MLD Y K G ++DA +LFD M E++ V+W+TM+ GY  + D E A
Subjt:  KVFDEMPERD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA

Query:  RVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLN
        R + + MP K++V+W  +IS + + G   EA+ +F +++ +  +KL+  T++  L+ACA+ G L LG  IH+ IK + ++    +++AL+ MY+KCG L 
Subjt:  RVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLN

Query:  IAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGR
         +  VFN ++ +DV  W+AM+ GLAMHG G +A+++F +M+E    P+ VT   V CAC+H GL+D+    F  ME +Y +VPE +HY C+VD+LGR G 
Subjt:  IAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGR

Query:  LEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEF
        LE+AV+ I +MP+ P+  +WG LLGAC++H  + LA      L++LEP + G   +LSNIYA  G W+ V++ R  MR  G +K  G SSIE+D  +HEF
Subjt:  LEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEF

Query:  TVFDRSHPKSDNIYQVLME
           D +HP S+ +Y  L E
Subjt:  TVFDRSHPKSDNIYQVLME

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.2e-12239.18Show/hide
Query:  LSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS---RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFT
        LS LH C  L  ++ IHAQ++K  LH   Y + KLI    LS     +P A + F  +Q PN+ ++NTM R H  +S P  A   +  M   G  P+++T
Subjt:  LSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS---RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFT

Query:  FPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGIS
        FPF+LK+C  +      Q +H  + K G   D++V  SLI  Y + G   +  A K+F      RDVVS+ ++I G+A  G  E A+K+FDE+P +D +S
Subjt:  FPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGIS

Query:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARVLFDKMPVKNLV
        WN M+ GY + G   +A +LF +M + NV    STMV             LG                         Y K G++E A  LF+++P K+++
Subjt:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARVLFDKMPVKNLV

Query:  SWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISN---ALVDMYAKCGRLNIAYNVFNDIKN
        SW  +I G+    L +EA+ LF +M ++    ++ T++ IL ACA  G + +G  IH  I +  LK  T  S+   +L+DMYAKCG +  A+ VFN I +
Subjt:  SWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISN---ALVDMYAKCGRLNIAYNVFNDIKN

Query:  KDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSM
        K + SWNAM+ G AMHG    + +LF RM++ G  PD +T +G+L AC+H+G++D G   F TM +DY + P++EHYGCM+DLLG  G  +EA  +I  M
Subjt:  KDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSM

Query:  PMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSD
         MEP+ +IW +LL AC+MH  VEL     ++L+K+EP + G+  +LSNIYA+AG W+ VA TR  +   G +K  G SSIE+D+ VHEF + D+ HP++ 
Subjt:  PMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSD

Query:  NIYQVLMERQ
         IY +L E +
Subjt:  NIYQVLMERQ

Q9LS72 Pentatricopeptide repeat-containing protein At3g292301.2e-22864.26Show/hide
Query:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFA
        S+P+R PSW S+R++FE++L DL KC +LNQVKQ+HAQI++ NLH DL++ PKLISA SL RQ  LA   FNQVQ PNVHL N++IRAH  NSQP QAF 
Subjt:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFA

Query:  TFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYE
         F  MQ  G + DNFT+PFLLKAC+G +WLPVV+M+H  IEK G  SD++VPN+LID YS+CG  G+  A KLF  M   RD VSWNSM+ G  K G   
Subjt:  TFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYE

Query:  EARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL
        +AR++FDEMP+RD ISWNTMLDGY +  +M  AF+LF++MPERN VSWSTMV+GY KAGDMEMARV+FDKMP+  KN+V+WTIII+G+AEKGL +EA  L
Subjt:  EARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL

Query:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE
         DQM  + LK D   VI ILAAC ESGLL LG RIH+ +K +NL     + NAL+DMYAKCG L  A++VFNDI  KD+VSWN ML GL +HGHG +A+E
Subjt:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE

Query:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL
        LF RM+ EG  PDKVT I VLC+C HAGLID+GI YF +ME+ Y LVP+VEHYGC+VDLLGR GRL+EA++++++MPMEPNV+IWG LLGACRMHN V++
Subjt:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL

Query:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        A+EVLD+LVKL+P D GN S+LSNIYAAA DW+ VAD R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD IYQ+L
Subjt:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088206.6e-11039.61Show/hide
Query:  CTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACT
        CT +N +KQIH  ++  +LH D ++V  L+      RQ   +   F+  Q+PN+ LYN++I    +N    +    F +++  G Y   FTFP +LKACT
Subjt:  CTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACT

Query:  GNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPER----DGISWNTML
          +   +   +H+ + K GF  DV    SL+  YS  GS  ++ A KLF  +   R VV+W ++ SG+   G + EA  +F +M E     D      +L
Subjt:  GNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPER----DGISWNTML

Query:  DGYVKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGI
           V VG +D      K  +EM  ++N    +T+V  Y K G ME AR +FD M  K++V+W+ +I G+A     +E I LF QM +  LK D  +++G 
Subjt:  DGYVKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGI

Query:  LAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIG
        L++CA  G L LGE   + I  +       ++NAL+DMYAKCG +   + VF ++K KD+V  NA + GLA +GH   +  +F + ++ G SPD  T +G
Subjt:  LAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIG

Query:  VLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNL
        +LC C HAGLI DG+++F+ +   Y L   VEHYGCMVDL GR G L++A RLI  MPM PN I+WG LL  CR+    +LA  VL  L+ LEP ++GN 
Subjt:  VLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNL

Query:  SMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
          LSNIY+  G WD  A+ R  M   G +K  G S IE++ +VHEF   D+SHP SD IY  L
Subjt:  SMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

Q9SY02 Pentatricopeptide repeat-containing protein At4g027505.9e-11139.17Show/hide
Query:  LISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMS--DVFV
        ++S ++ +  +  A + F+++   N   +N ++ A+  NS+  +A   F + +          +  +   C    ++   ++V A+ + F  M+  DV  
Subjt:  LISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMS--DVFV

Query:  PNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTM
         N++I  Y++ G   I  A++LF      +DV +W +M+SG+ +  + EEAR++FD+MPER+ +SWN ML GYV+  +M+ A +LFD MP RNV +W+TM
Subjt:  PNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTM

Query:  VLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNA
        + GY + G +  A+ LFDKMP ++ VSW  +I+G+++ G + EA+ LF QME+   +L+  +    L+ CA+   L LG+++H  +     +    + NA
Subjt:  VLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNA

Query:  LVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHY
        L+ MY KCG +  A ++F ++  KD+VSWN M+ G + HG G  AL  F+ MK EG  PD  TM+ VL AC+H GL+D G QYF TM +DY ++P  +HY
Subjt:  LVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHY

Query:  GCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGA
         CMVDLLGR G LE+A  L+++MP EP+  IWGTLLGA R+H   ELA    D +  +EP +SG   +LSN+YA++G W  V   R+RMR  G +K  G 
Subjt:  GCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGA

Query:  SSIEVDNEVHEFTVFDRSHPKSDNIYQVLME
        S IE+ N+ H F+V D  HP+ D I+  L E
Subjt:  SSIEVDNEVHEFTVFDRSHPKSDNIYQVLME

Arabidopsis top hitse value%identityAlignment
AT1G01770.1 unknown protein7.9e-23664.27Show/hide
Query:  KDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNICIITN
        K+ + DC I LR NP+++R+ VY+GCGAGFGGDRP AALKLLQRV+ LNYLVLECLAERTLADR   M SGG GYD R+++WM+LLLPLA++R  CIITN
Subjt:  KDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNICIITN

Query:  MGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVK-------------EPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLP
        MGA+DP GAQ+ V+E+A  LGL +SVAVA+EV  +               G STY+GAAPIVECLEKY PNVIITSRVADAALFLAPMVYELGWNW+DL 
Subjt:  MGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVK-------------EPGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLP

Query:  RLAQGILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFY
         LAQG LAGHLLECGCQLTGGYFMHPGD+YR M+F  L ++SLPYAE+ YDGKV V+K E +GG+LN STCAEQLLYE+ DPSAYITPD+V+D   VSF 
Subjt:  RLAQGILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFY

Query:  SISSSRVLCSGAKPSIQ-GVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVE---DIR
         +S  +V CSGAKPS    VP KLL+L PK+CGWKGWGEISYGG   + RAKA+E+LVRSWMEE + G+N  I+SY IG+DSLKA+SN + S +   DIR
Subjt:  SISSSRVLCSGAKPSIQ-GVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYTIGLDSLKASSNSSNSVE---DIR

Query:  LRMDGLFKQKGHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEK
        LRMDGLFK K HA+   +EFTALYTNGPAGGGGISTG+K EIVL+K+LV RE++ W+TG+          Q T+  +    E  SP      P       
Subjt:  LRMDGLFKQKGHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSPRVTLPCPITAYAEK

Query:  PCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVK
           G +     HSP PSGQ+I LY+VAHSRAGDKGND+NFS+IPHY  D+ERLK+IITP+WV  V+SVL + + F   DA     + +DE+V VEIY+V+
Subjt:  PCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEIYEVK

Query:  GIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLILNQQIVL
        GIH++NVVVRNILDGGVNCSRRIDRHGKTISDLIL QQ+VL
Subjt:  GIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLILNQQIVL

AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.2e-12439.18Show/hide
Query:  LSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS---RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFT
        LS LH C  L  ++ IHAQ++K  LH   Y + KLI    LS     +P A + F  +Q PN+ ++NTM R H  +S P  A   +  M   G  P+++T
Subjt:  LSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS---RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFT

Query:  FPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGIS
        FPF+LK+C  +      Q +H  + K G   D++V  SLI  Y + G   +  A K+F      RDVVS+ ++I G+A  G  E A+K+FDE+P +D +S
Subjt:  FPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGIS

Query:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARVLFDKMPVKNLV
        WN M+ GY + G   +A +LF +M + NV    STMV             LG                         Y K G++E A  LF+++P K+++
Subjt:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARVLFDKMPVKNLV

Query:  SWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISN---ALVDMYAKCGRLNIAYNVFNDIKN
        SW  +I G+    L +EA+ LF +M ++    ++ T++ IL ACA  G + +G  IH  I +  LK  T  S+   +L+DMYAKCG +  A+ VFN I +
Subjt:  SWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISN---ALVDMYAKCGRLNIAYNVFNDIKN

Query:  KDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSM
        K + SWNAM+ G AMHG    + +LF RM++ G  PD +T +G+L AC+H+G++D G   F TM +DY + P++EHYGCM+DLLG  G  +EA  +I  M
Subjt:  KDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSM

Query:  PMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSD
         MEP+ +IW +LL AC+MH  VEL     ++L+K+EP + G+  +LSNIYA+AG W+ VA TR  +   G +K  G SSIE+D+ VHEF + D+ HP++ 
Subjt:  PMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSD

Query:  NIYQVLMERQ
         IY +L E +
Subjt:  NIYQVLMERQ

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-11937Show/hide
Query:  PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS--RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFA
        P+  +T     + +S + +C  L Q+KQ H  ++++    D Y   KL +  +LS    +  A   F+++  PN   +NT+IRA+     P  +   F  
Subjt:  PSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLS--RQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFA

Query:  MQCDG-FYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEAR
        M  +   YP+ +TFPFL+KA    + L + Q +H    K    SDVFV NSLI  Y  CG   + +A K+F ++   +DVVSWNSMI+GF + G  ++A 
Subjt:  MQCDG-FYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEAR

Query:  KVFDEMPERD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA
        ++F +M   D         G+                                N MLD Y K G ++DA +LFD M E++ V+W+TM+ GY  + D E A
Subjt:  KVFDEMPERD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA

Query:  RVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLN
        R + + MP K++V+W  +IS + + G   EA+ +F +++ +  +KL+  T++  L+ACA+ G L LG  IH+ IK + ++    +++AL+ MY+KCG L 
Subjt:  RVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLN

Query:  IAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGR
         +  VFN ++ +DV  W+AM+ GLAMHG G +A+++F +M+E    P+ VT   V CAC+H GL+D+    F  ME +Y +VPE +HY C+VD+LGR G 
Subjt:  IAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGR

Query:  LEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEF
        LE+AV+ I +MP+ P+  +WG LLGAC++H  + LA      L++LEP + G   +LSNIYA  G W+ V++ R  MR  G +K  G SSIE+D  +HEF
Subjt:  LEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEF

Query:  TVFDRSHPKSDNIYQVLME
           D +HP S+ +Y  L E
Subjt:  TVFDRSHPKSDNIYQVLME

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.5e-23064.26Show/hide
Query:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFA
        S+P+R PSW S+R++FE++L DL KC +LNQVKQ+HAQI++ NLH DL++ PKLISA SL RQ  LA   FNQVQ PNVHL N++IRAH  NSQP QAF 
Subjt:  SVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFA

Query:  TFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYE
         F  MQ  G + DNFT+PFLLKAC+G +WLPVV+M+H  IEK G  SD++VPN+LID YS+CG  G+  A KLF  M   RD VSWNSM+ G  K G   
Subjt:  TFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYE

Query:  EARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL
        +AR++FDEMP+RD ISWNTMLDGY +  +M  AF+LF++MPERN VSWSTMV+GY KAGDMEMARV+FDKMP+  KN+V+WTIII+G+AEKGL +EA  L
Subjt:  EARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL

Query:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE
         DQM  + LK D   VI ILAAC ESGLL LG RIH+ +K +NL     + NAL+DMYAKCG L  A++VFNDI  KD+VSWN ML GL +HGHG +A+E
Subjt:  FDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALE

Query:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL
        LF RM+ EG  PDKVT I VLC+C HAGLID+GI YF +ME+ Y LVP+VEHYGC+VDLLGR GRL+EA++++++MPMEPNV+IWG LLGACRMHN V++
Subjt:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVEL

Query:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        A+EVLD+LVKL+P D GN S+LSNIYAAA DW+ VAD R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD IYQ+L
Subjt:  AREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-11239.17Show/hide
Query:  LISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMS--DVFV
        ++S ++ +  +  A + F+++   N   +N ++ A+  NS+  +A   F + +          +  +   C    ++   ++V A+ + F  M+  DV  
Subjt:  LISAFSLSRQMPLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMS--DVFV

Query:  PNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTM
         N++I  Y++ G   I  A++LF      +DV +W +M+SG+ +  + EEAR++FD+MPER+ +SWN ML GYV+  +M+ A +LFD MP RNV +W+TM
Subjt:  PNSLIDSYSKCGSCGISAAKKLFVSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTM

Query:  VLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNA
        + GY + G +  A+ LFDKMP ++ VSW  +I+G+++ G + EA+ LF QME+   +L+  +    L+ CA+   L LG+++H  +     +    + NA
Subjt:  VLGYCKAGDMEMARVLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNA

Query:  LVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHY
        L+ MY KCG +  A ++F ++  KD+VSWN M+ G + HG G  AL  F+ MK EG  PD  TM+ VL AC+H GL+D G QYF TM +DY ++P  +HY
Subjt:  LVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHY

Query:  GCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGA
         CMVDLLGR G LE+A  L+++MP EP+  IWGTLLGA R+H   ELA    D +  +EP +SG   +LSN+YA++G W  V   R+RMR  G +K  G 
Subjt:  GCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGA

Query:  SSIEVDNEVHEFTVFDRSHPKSDNIYQVLME
        S IE+ N+ H F+V D  HP+ D I+  L E
Subjt:  SSIEVDNEVHEFTVFDRSHPKSDNIYQVLME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTGCGATCAGTCCTTGCAAGTTTCGCAGGTTAATTGGCTTTCGAGTGGTAGGCGGCGGCAGGCTTTGAGTGGAAAGGGTGTGGCTAAAGCTTCTCCGTCTGCCAT
TTTTGCGTGCGAAGCCATGCAAATGTGCAGTGTCCCAATTCGAACCCCCTCCTGGTTTTCCACTCGAAAGCTCTTCGAGCAGAAGCTTTCAGATCTCCACAAGTGTACAG
ACCTCAACCAAGTGAAGCAAATCCACGCTCAAATCCTCAAATCCAATCTCCACCTCGACCTCTATGTTGTTCCCAAACTCATATCTGCCTTCTCCCTTTCTCGCCAGATG
CCTCTCGCCACCAACACTTTCAATCAAGTTCAATATCCAAATGTCCATTTGTACAACACTATGATTCGAGCCCACACCCATAACTCACAACCTTCACAAGCCTTCGCCAC
TTTCTTTGCTATGCAATGTGATGGATTCTACCCCGATAATTTCACTTTCCCGTTTCTTTTGAAAGCTTGTACTGGGAATGCATGGTTGCCTGTTGTTCAAATGGTGCATG
CCCAAATCGAGAAATTTGGGTTCATGTCGGATGTATTCGTGCCAAATTCTCTTATTGATTCATATTCCAAATGTGGTTCTTGTGGAATTTCAGCAGCGAAGAAGTTGTTT
GTGTCAATGGGAGCTTGTAGGGATGTTGTGTCATGGAACTCAATGATCTCTGGATTTGCAAAGGGTGGGTTATATGAAGAAGCTCGAAAGGTGTTCGATGAAATGCCTGA
AAGGGATGGTATTAGTTGGAACACAATGTTGGATGGGTATGTTAAAGTTGGGAAAATGGACGATGCGTTTAAACTGTTTGATGAAATGCCTGAGAGGAATGTTGTCTCTT
GGTCGACAATGGTGTTAGGGTATTGCAAGGCAGGGGATATGGAGATGGCACGAGTGTTGTTTGATAAAATGCCTGTGAAGAATTTGGTTTCTTGGACCATAATTATCTCT
GGGTTTGCTGAGAAAGGGCTAGCTAGGGAGGCCATTGGTTTGTTTGATCAAATGGAAAAGGCTCGCTTGAAATTAGACAATGGGACGGTAATAGGTATTTTGGCTGCTTG
TGCCGAGTCTGGTTTGCTTGGGCTTGGTGAGAGAATACATGCTTCCATTAAGAACAATAATTTGAAATGTACTACTGAAATCTCCAATGCTTTGGTTGATATGTATGCAA
AATGTGGTAGGTTGAATATTGCTTACAATGTTTTTAATGACATAAAAAATAAAGATGTCGTGTCTTGGAATGCTATGCTTCAAGGGCTAGCAATGCATGGACATGGAGTG
AAAGCACTCGAGCTTTTCAAAAGAATGAAAGAAGAGGGTTTCTCACCTGACAAAGTTACAATGATCGGAGTCTTGTGTGCTTGTACGCATGCAGGATTGATCGACGATGG
CATTCAATACTTCTCTACGATGGAAAGGGACTACACCCTTGTTCCTGAAGTTGAGCATTATGGTTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAGGAAGCCG
TAAGGCTCATTCGCAGCATGCCAATGGAACCAAATGTCATCATTTGGGGAACCCTTTTAGGGGCATGTCGTATGCATAATGCTGTTGAACTTGCGAGGGAGGTTCTTGAT
CATTTGGTTAAGCTAGAGCCATCTGATTCGGGTAATTTGTCCATGTTGTCGAACATATATGCTGCGGCAGGGGACTGGGATTGTGTCGCTGACACGAGGTTAAGAATGCG
GAGTATTGGAACTCAAAAACCGTCGGGTGCTAGTTCCATTGAGGTCGACAATGAGGTTCATGAATTTACAGTATTTGATCGATCGCATCCAAAGTCTGATAATATATATC
AGGTGCTAATGGAGAGGCAGGGTAAAGATGACATCCATGACTGCACAATTAAGCTGAGAGTAAATCCTCAAAAACAGAGAGACAAGGTGTACATTGGCTGTGGTGCTGGA
TTTGGAGGCGATAGGCCAACAGCAGCTCTTAAATTGCTTCAGAGGGTCAAAACCCTAAACTATCTCGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCGCTGTCA
AGTTATGTTGTCTGGTGGTGATGGTTATGATTCAAGGATTGCAGATTGGATGAAATTGCTTCTTCCATTGGCTATAAAGAGAAATATTTGCATAATTACCAACATGGGTG
CAATGGACCCCCCTGGGGCTCAGCGAAACGTTATAGAAATTGCAGACAGTCTGGGGTTGAATGTTTCAGTTGCAGTTGCTTATGAGGTTTCAGTAAAAGAACCAGGGATA
AGCACATATATGGGAGCAGCTCCTATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCACGTGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGT
CTATGAACTTGGTTGGAACTGGGATGATCTTCCACGGCTAGCACAGGGAATACTGGCTGGTCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATC
CAGGGGACAAGTATAGAAGCATGTCTTTCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAGTATGATGGAAAAGTCACTGTAGCCAAGGCAGAAGAGACT
GGAGGTCTTTTGAATTTCAGTACATGTGCCGAACAACTTCTGTATGAGGTTGGTGATCCATCGGCTTATATCACCCCTGATATGGTGGTTGACTTCAGCAATGTTTCGTT
TTACTCTATATCCAGTTCTAGGGTTTTATGTTCCGGAGCTAAACCATCTATTCAAGGAGTGCCTGGGAAACTCTTGCAGTTGGCTCCAAAGGACTGTGGATGGAAAGGAT
GGGGAGAGATATCGTATGGAGGACGTGAATGTGTTCTGCGTGCTAAGGCTGCAGAATATCTGGTTCGGTCGTGGATGGAAGAACAGTTGATTGGTATTAATCAGCATATA
GTTTCTTACACAATTGGACTCGACAGCCTTAAAGCATCCAGCAATAGTAGCAATAGTGTTGAAGATATTAGGTTGCGCATGGATGGTCTCTTCAAGCAGAAGGGGCACGC
TCTCCTGTTTGTTAGAGAATTTACAGCTTTATACACAAATGGGCCAGCTGGTGGTGGAGGCATCAGCACTGGCTACAAGAAAGAAATTGTGCTTGATAAACAACTGGTTG
GGCGTGAGAATATTTTCTGGCAAACAGGAGTGAAGTGCACTGAAGCAGTAAAATTCAACAGACAACCAACAGATCTTCGAAAGGATCCAGCAGAGGAATGTTCTTCGCCC
CGAGTAACATTGCCATGTCCGATAACTGCTTATGCTGAGAAACCTTGTTCAGGCTCCTTTCCACCAGAAACGGGTCATTCCCCTTTTCCATCTGGCCAGGAGATTGCGCT
GTACAATGTAGCCCATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTGTCATTCCTCATTATCCTTCCGATATTGAGCGATTGAAGATGATCATCACACCTG
AATGGGTGATGAGAGTTCTCTCGGTTCTGCATAATTTGACTCTGTTTCCTTCTTCGGATGCCGATAAGAAGAGAGACGAGGTGGTAGATGAACATGTGAAGGTTGAAATA
TACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTAGACGGTGGCGTAAATTGCTCGCGGAGAATCGATCGCCATGGAAAGACCATATCGGATCT
CATCTTGAACCAGCAAATTGTTTTGCCACCATAG
mRNA sequenceShow/hide mRNA sequence
ATGACCTGCGATCAGTCCTTGCAAGTTTCGCAGGTTAATTGGCTTTCGAGTGGTAGGCGGCGGCAGGCTTTGAGTGGAAAGGGTGTGGCTAAAGCTTCTCCGTCTGCCAT
TTTTGCGTGCGAAGCCATGCAAATGTGCAGTGTCCCAATTCGAACCCCCTCCTGGTTTTCCACTCGAAAGCTCTTCGAGCAGAAGCTTTCAGATCTCCACAAGTGTACAG
ACCTCAACCAAGTGAAGCAAATCCACGCTCAAATCCTCAAATCCAATCTCCACCTCGACCTCTATGTTGTTCCCAAACTCATATCTGCCTTCTCCCTTTCTCGCCAGATG
CCTCTCGCCACCAACACTTTCAATCAAGTTCAATATCCAAATGTCCATTTGTACAACACTATGATTCGAGCCCACACCCATAACTCACAACCTTCACAAGCCTTCGCCAC
TTTCTTTGCTATGCAATGTGATGGATTCTACCCCGATAATTTCACTTTCCCGTTTCTTTTGAAAGCTTGTACTGGGAATGCATGGTTGCCTGTTGTTCAAATGGTGCATG
CCCAAATCGAGAAATTTGGGTTCATGTCGGATGTATTCGTGCCAAATTCTCTTATTGATTCATATTCCAAATGTGGTTCTTGTGGAATTTCAGCAGCGAAGAAGTTGTTT
GTGTCAATGGGAGCTTGTAGGGATGTTGTGTCATGGAACTCAATGATCTCTGGATTTGCAAAGGGTGGGTTATATGAAGAAGCTCGAAAGGTGTTCGATGAAATGCCTGA
AAGGGATGGTATTAGTTGGAACACAATGTTGGATGGGTATGTTAAAGTTGGGAAAATGGACGATGCGTTTAAACTGTTTGATGAAATGCCTGAGAGGAATGTTGTCTCTT
GGTCGACAATGGTGTTAGGGTATTGCAAGGCAGGGGATATGGAGATGGCACGAGTGTTGTTTGATAAAATGCCTGTGAAGAATTTGGTTTCTTGGACCATAATTATCTCT
GGGTTTGCTGAGAAAGGGCTAGCTAGGGAGGCCATTGGTTTGTTTGATCAAATGGAAAAGGCTCGCTTGAAATTAGACAATGGGACGGTAATAGGTATTTTGGCTGCTTG
TGCCGAGTCTGGTTTGCTTGGGCTTGGTGAGAGAATACATGCTTCCATTAAGAACAATAATTTGAAATGTACTACTGAAATCTCCAATGCTTTGGTTGATATGTATGCAA
AATGTGGTAGGTTGAATATTGCTTACAATGTTTTTAATGACATAAAAAATAAAGATGTCGTGTCTTGGAATGCTATGCTTCAAGGGCTAGCAATGCATGGACATGGAGTG
AAAGCACTCGAGCTTTTCAAAAGAATGAAAGAAGAGGGTTTCTCACCTGACAAAGTTACAATGATCGGAGTCTTGTGTGCTTGTACGCATGCAGGATTGATCGACGATGG
CATTCAATACTTCTCTACGATGGAAAGGGACTACACCCTTGTTCCTGAAGTTGAGCATTATGGTTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAGGAAGCCG
TAAGGCTCATTCGCAGCATGCCAATGGAACCAAATGTCATCATTTGGGGAACCCTTTTAGGGGCATGTCGTATGCATAATGCTGTTGAACTTGCGAGGGAGGTTCTTGAT
CATTTGGTTAAGCTAGAGCCATCTGATTCGGGTAATTTGTCCATGTTGTCGAACATATATGCTGCGGCAGGGGACTGGGATTGTGTCGCTGACACGAGGTTAAGAATGCG
GAGTATTGGAACTCAAAAACCGTCGGGTGCTAGTTCCATTGAGGTCGACAATGAGGTTCATGAATTTACAGTATTTGATCGATCGCATCCAAAGTCTGATAATATATATC
AGGTGCTAATGGAGAGGCAGGGTAAAGATGACATCCATGACTGCACAATTAAGCTGAGAGTAAATCCTCAAAAACAGAGAGACAAGGTGTACATTGGCTGTGGTGCTGGA
TTTGGAGGCGATAGGCCAACAGCAGCTCTTAAATTGCTTCAGAGGGTCAAAACCCTAAACTATCTCGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCGCTGTCA
AGTTATGTTGTCTGGTGGTGATGGTTATGATTCAAGGATTGCAGATTGGATGAAATTGCTTCTTCCATTGGCTATAAAGAGAAATATTTGCATAATTACCAACATGGGTG
CAATGGACCCCCCTGGGGCTCAGCGAAACGTTATAGAAATTGCAGACAGTCTGGGGTTGAATGTTTCAGTTGCAGTTGCTTATGAGGTTTCAGTAAAAGAACCAGGGATA
AGCACATATATGGGAGCAGCTCCTATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCACGTGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGT
CTATGAACTTGGTTGGAACTGGGATGATCTTCCACGGCTAGCACAGGGAATACTGGCTGGTCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATC
CAGGGGACAAGTATAGAAGCATGTCTTTCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAGTATGATGGAAAAGTCACTGTAGCCAAGGCAGAAGAGACT
GGAGGTCTTTTGAATTTCAGTACATGTGCCGAACAACTTCTGTATGAGGTTGGTGATCCATCGGCTTATATCACCCCTGATATGGTGGTTGACTTCAGCAATGTTTCGTT
TTACTCTATATCCAGTTCTAGGGTTTTATGTTCCGGAGCTAAACCATCTATTCAAGGAGTGCCTGGGAAACTCTTGCAGTTGGCTCCAAAGGACTGTGGATGGAAAGGAT
GGGGAGAGATATCGTATGGAGGACGTGAATGTGTTCTGCGTGCTAAGGCTGCAGAATATCTGGTTCGGTCGTGGATGGAAGAACAGTTGATTGGTATTAATCAGCATATA
GTTTCTTACACAATTGGACTCGACAGCCTTAAAGCATCCAGCAATAGTAGCAATAGTGTTGAAGATATTAGGTTGCGCATGGATGGTCTCTTCAAGCAGAAGGGGCACGC
TCTCCTGTTTGTTAGAGAATTTACAGCTTTATACACAAATGGGCCAGCTGGTGGTGGAGGCATCAGCACTGGCTACAAGAAAGAAATTGTGCTTGATAAACAACTGGTTG
GGCGTGAGAATATTTTCTGGCAAACAGGAGTGAAGTGCACTGAAGCAGTAAAATTCAACAGACAACCAACAGATCTTCGAAAGGATCCAGCAGAGGAATGTTCTTCGCCC
CGAGTAACATTGCCATGTCCGATAACTGCTTATGCTGAGAAACCTTGTTCAGGCTCCTTTCCACCAGAAACGGGTCATTCCCCTTTTCCATCTGGCCAGGAGATTGCGCT
GTACAATGTAGCCCATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTGTCATTCCTCATTATCCTTCCGATATTGAGCGATTGAAGATGATCATCACACCTG
AATGGGTGATGAGAGTTCTCTCGGTTCTGCATAATTTGACTCTGTTTCCTTCTTCGGATGCCGATAAGAAGAGAGACGAGGTGGTAGATGAACATGTGAAGGTTGAAATA
TACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTAGACGGTGGCGTAAATTGCTCGCGGAGAATCGATCGCCATGGAAAGACCATATCGGATCT
CATCTTGAACCAGCAAATTGTTTTGCCACCATAG
Protein sequenceShow/hide protein sequence
MTCDQSLQVSQVNWLSSGRRRQALSGKGVAKASPSAIFACEAMQMCSVPIRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQIHAQILKSNLHLDLYVVPKLISAFSLSRQM
PLATNTFNQVQYPNVHLYNTMIRAHTHNSQPSQAFATFFAMQCDGFYPDNFTFPFLLKACTGNAWLPVVQMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLF
VSMGACRDVVSWNSMISGFAKGGLYEEARKVFDEMPERDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARVLFDKMPVKNLVSWTIIIS
GFAEKGLAREAIGLFDQMEKARLKLDNGTVIGILAACAESGLLGLGERIHASIKNNNLKCTTEISNALVDMYAKCGRLNIAYNVFNDIKNKDVVSWNAMLQGLAMHGHGV
KALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIQYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAVRLIRSMPMEPNVIIWGTLLGACRMHNAVELAREVLD
HLVKLEPSDSGNLSMLSNIYAAAGDWDCVADTRLRMRSIGTQKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAG
FGGDRPTAALKLLQRVKTLNYLVLECLAERTLADRCQVMLSGGDGYDSRIADWMKLLLPLAIKRNICIITNMGAMDPPGAQRNVIEIADSLGLNVSVAVAYEVSVKEPGI
STYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVEYDGKVTVAKAEET
GGLLNFSTCAEQLLYEVGDPSAYITPDMVVDFSNVSFYSISSSRVLCSGAKPSIQGVPGKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHI
VSYTIGLDSLKASSNSSNSVEDIRLRMDGLFKQKGHALLFVREFTALYTNGPAGGGGISTGYKKEIVLDKQLVGRENIFWQTGVKCTEAVKFNRQPTDLRKDPAEECSSP
RVTLPCPITAYAEKPCSGSFPPETGHSPFPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLKMIITPEWVMRVLSVLHNLTLFPSSDADKKRDEVVDEHVKVEI
YEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLILNQQIVLPP