; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0009228 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0009228
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00000267:376674..385535
RNA-Seq ExpressionIVF0009228
SyntenyIVF0009228
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR010839 - Acyclic terpene utilisation
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK17586.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.097.89Show/hide
Query:  MQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARK
        MQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARK
Subjt:  MQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARK

Query:  VFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEK
        VFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEK
Subjt:  VFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEK

Query:  ACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMK
        ACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMK
Subjt:  ACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMK

Query:  EEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLD
        EEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLD
Subjt:  EEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLD

Query:  HLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPKKQR
        HLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEV                   LMERHSQADIHDCTIKLRVNPKKQR
Subjt:  HLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPKKQR

Query:  DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQKVIEVAGS
        DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQKVIEVAGS
Subjt:  DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQKVIEVAGS

Query:  LGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
        LGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
Subjt:  LGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS

Query:  MSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLLQLAPKDCG
        MSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLLQLAPKDCG
Subjt:  MSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLLQLAPKDCG

Query:  WKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS
        WKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS
Subjt:  WKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS

XP_004134329.1 uncharacterized protein LOC101212841 isoform X2 [Cucumis sativus]0.095.58Show/hide
Query:  MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNI
        ME H QADIHDCTIKLRVNP+KQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYD RIA+WMKLLLPL+MKRNI
Subjt:  MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNI

Query:  CIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
        CIITNMGAMDP AAQQ VIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
Subjt:  CIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVV
        GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIG+PSAYITPDLVVDFSNVSFCSISSSRV+
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVV

Query:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEH
        CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSN SNC+EDIRLRMDGLFEQKEH
Subjt:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEH

Query:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGH
        ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEV C+EAVKLDSQSTDLQKDPAEACSSPRVTLPCPIS HA++LCTGS PPE GH
Subjt:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGH

Query:  SPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNI
        SPIPSGQEIALY+VAHSRAGDKGNDLNFSLIPH PSDIERLKMIITPEWVMRVLS LHN TRFHSSNA EKRNEWV+EDVKVEIYEVK IHSLNVVVRNI
Subjt:  SPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQLIVLPPGQ
        LDGGVNCSRRIDRHGKTISDLILNQLIVLPP Q
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQLIVLPPGQ

XP_008438065.1 PREDICTED: uncharacterized protein LOC103483286 [Cucumis melo]0.0100Show/hide
Query:  MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNI
        MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNI
Subjt:  MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNI

Query:  CIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
        CIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
Subjt:  CIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVV
        GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVV
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVV

Query:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEH
        CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEH
Subjt:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEH

Query:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGH
        ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGH
Subjt:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGH

Query:  SPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNI
        SPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNI
Subjt:  SPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQLIVLPPGQ
        LDGGVNCSRRIDRHGKTISDLILNQLIVLPPGQ
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQLIVLPPGQ

XP_031738474.1 uncharacterized protein LOC101212841 isoform X1 [Cucumis sativus]0.095.02Show/hide
Query:  DCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMD
        +C    RVNP+KQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYD RIA+WMKLLLPL+MKRNICIITNMGAMD
Subjt:  DCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMD

Query:  PLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL
        P AAQQ VIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL
Subjt:  PLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL

Query:  TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQG
        TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIG+PSAYITPDLVVDFSNVSFCSISSSRV+CSGAKPSIQG
Subjt:  TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQG

Query:  VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTA
        VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSN SNC+EDIRLRMDGLFEQKEHALLFVKEFTA
Subjt:  VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTA

Query:  LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGHSPIPSGQEIA
        LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEV C+EAVKLDSQSTDLQKDPAEACSSPRVTLPCPIS HA++LCTGS PPE GHSPIPSGQEIA
Subjt:  LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGHSPIPSGQEIA

Query:  LYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNILDGGVNCSRR
        LY+VAHSRAGDKGNDLNFSLIPH PSDIERLKMIITPEWVMRVLS LHN TRFHSSNA EKRNEWV+EDVKVEIYEVK IHSLNVVVRNILDGGVNCSRR
Subjt:  LYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNILDGGVNCSRR

Query:  IDRHGKTISDLILNQLIVLPPGQ
        IDRHGKTISDLILNQLIVLPP Q
Subjt:  IDRHGKTISDLILNQLIVLPPGQ

XP_031738475.1 uncharacterized protein LOC101212841 isoform X3 [Cucumis sativus]0.094.7Show/hide
Query:  DCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMD
        +C    RVNP+KQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYD R   WMKLLLPL+MKRNICIITNMGAMD
Subjt:  DCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMD

Query:  PLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL
        P AAQQ VIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL
Subjt:  PLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL

Query:  TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQG
        TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIG+PSAYITPDLVVDFSNVSFCSISSSRV+CSGAKPSIQG
Subjt:  TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQG

Query:  VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTA
        VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSN SNC+EDIRLRMDGLFEQKEHALLFVKEFTA
Subjt:  VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTA

Query:  LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGHSPIPSGQEIA
        LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEV C+EAVKLDSQSTDLQKDPAEACSSPRVTLPCPIS HA++LCTGS PPE GHSPIPSGQEIA
Subjt:  LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGHSPIPSGQEIA

Query:  LYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNILDGGVNCSRR
        LY+VAHSRAGDKGNDLNFSLIPH PSDIERLKMIITPEWVMRVLS LHN TRFHSSNA EKRNEWV+EDVKVEIYEVK IHSLNVVVRNILDGGVNCSRR
Subjt:  LYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNILDGGVNCSRR

Query:  IDRHGKTISDLILNQLIVLPPGQ
        IDRHGKTISDLILNQLIVLPP Q
Subjt:  IDRHGKTISDLILNQLIVLPPGQ

TrEMBL top hitse value%identityAlignment
A0A0A0L7H7 Uncharacterized protein0.0e+0095.56Show/hide
Query:  MQMCSVPIRTPSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPS
        MQMCSVPIRTPSWFSTRKL EQKL++LHKCT+LNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATN FNQVQYPNVHLYNTMIRAHSHNSQPS
Subjt:  MQMCSVPIRTPSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPS

Query:  QAFATFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKG
        QAFATFFAMQRDG Y DNFTFPFLLKVCTGNVWLPV+E VHAQIEKFGFMSDVFVPNSLIDSYSKCGS GISAAKKLFVSMGARRDVVSWNSMISGLAKG
Subjt:  QAFATFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKG

Query:  GLYEEARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAI
        GLYEEARKVFDEMP++DGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAG MEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAI
Subjt:  GLYEEARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAI

Query:  DLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKA
         LFDQMEKACLKLDNGT++SIL ACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVF+DIKNKDVVSWNAMLQGLAMHGHG+KA
Subjt:  DLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKA

Query:  LELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAV
        LELFK+MKEEGFSPN+VTMIGVLCACTHAGLIDDGIRYFSTMERDY LVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPM PNAIIWGTLLGACRMHNAV
Subjt:  LELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKL
        ELAREVLDHLVELEP+DSGN SMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEV+NEVHEFTVFDRSHPKSDNIYQVLME H QADIHDCTIKL
Subjt:  ELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKL

Query:  RVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQ
        RVNP+KQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYD RIA+WMKLLLPL+MKRNICIITNMGAMDP AAQQ
Subjt:  RVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQ

Query:  KVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFM
         VIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFM
Subjt:  KVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFM

Query:  HPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLL
        HPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIG+PSAYITPDLVVDFSNVSFCSISSSRV+CSGAKPSIQGVPEKLL
Subjt:  HPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLL

Query:  QLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTALYTNGP
        QLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSN SNC+EDIRLRMDGLFEQKEHALLFVKEFTALYTNGP
Subjt:  QLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTALYTNGP

Query:  AGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGHSPIPSGQEIALYDVAH
        AGGGGISTGYKKEIVLEKQLVGRENIFWQTEV C+EAVKLDSQSTDLQKDPAEACSSPRVTLPCPIS HA++LCTGS PPE GHSPIPSGQEIALY+VAH
Subjt:  AGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGHSPIPSGQEIALYDVAH

Query:  SRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNILDGGVNCSRRIDRHGK
        SRAGDKGNDLNFSLIPH PSDIERLKMIITPEWVMRVLS LHN TRFHSSNA EKRNEWV+EDVKVEIYEVK IHSLNVVVRNILDGGVNCSRRIDRHGK
Subjt:  SRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNILDGGVNCSRRIDRHGK

Query:  TISDLILNQLIVLPPGQ
        TISDLILNQLIVLPP Q
Subjt:  TISDLILNQLIVLPPGQ

A0A1S3AV50 uncharacterized protein LOC1034832860.0e+00100Show/hide
Query:  MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNI
        MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNI
Subjt:  MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNI

Query:  CIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
        CIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
Subjt:  CIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVV
        GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVV
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVV

Query:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEH
        CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEH
Subjt:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEH

Query:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGH
        ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGH
Subjt:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGH

Query:  SPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNI
        SPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNI
Subjt:  SPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQLIVLPPGQ
        LDGGVNCSRRIDRHGKTISDLILNQLIVLPPGQ
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQLIVLPPGQ

A0A5D3D3E7 Pentatricopeptide repeat-containing protein0.0e+0097.89Show/hide
Query:  MQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARK
        MQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARK
Subjt:  MQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARK

Query:  VFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEK
        VFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEK
Subjt:  VFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEK

Query:  ACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMK
        ACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMK
Subjt:  ACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMK

Query:  EEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLD
        EEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLD
Subjt:  EEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLD

Query:  HLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPKKQR
        HLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNE                   VLMERHSQADIHDCTIKLRVNPKKQR
Subjt:  HLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPKKQR

Query:  DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQKVIEVAGS
        DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQKVIEVAGS
Subjt:  DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQKVIEVAGS

Query:  LGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
        LGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
Subjt:  LGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS

Query:  MSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLLQLAPKDCG
        MSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLLQLAPKDCG
Subjt:  MSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLLQLAPKDCG

Query:  WKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS
        WKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS
Subjt:  WKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS

A0A6J1IGN9 uncharacterized protein LOC111474742 isoform X10.0e+0083.31Show/hide
Query:  YQVLMERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSM
        + +LMER  + D+HDCTIKLRVNPKK+RDKV IGCGAGFGGDRPTAALKLLQRVK+LNYLVLECLAERTLAD +Q M SGGDGYDSRIA+WMKLLLPL++
Subjt:  YQVLMERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSM

Query:  KRNICIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQ
        KRNICIITNMGAMDP  AQQ VIE+A SLGL+VSVAVAYE SVKESGISTY+G APIV+CLEKYHPNVIITSRVADAALF+APMVYELGWNWDDFP L+Q
Subjt:  KRNICIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQ

Query:  GILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISS
        G LAGHLLECGCQLTGGYFMHPGDK+RSM FQQLL+ISLPYAE++CDGK+ VAK EE+GGLLNFSTCAEQLLYE+GDPSAYITPDLVVD SNVSFCSISS
Subjt:  GILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISS

Query:  SRVVCSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFE
        S+V CSGAKPSIQ VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE+L G+N+HIVSY IGLDSLKAS NSS+ +EDIRLRMDGLFE
Subjt:  SRVVCSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFE

Query:  QKEHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPP
         KEHALLFV+EFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRE++FW+  VKC++AV+LDS+ TDL++DPA+A +SPRVTLPC I ++A+  C  S  P
Subjt:  QKEHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPP

Query:  ETGHSPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVV
        ETGHSPIPSGQ++ALY+VAHSRAGDKGND+NFS++PHYPSDIERLKMIITPEWV RVLS L N + FH  +A +KR+EWVNE VKVEIYEVK IHSLNVV
Subjt:  ETGHSPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVV

Query:  VRNILDGGVNCSRRIDRHGKTISDLILNQLIVLPP
        VRNILDGGVNCSRRIDRHGKTISDL+LNQ +VLPP
Subjt:  VRNILDGGVNCSRRIDRHGKTISDLILNQLIVLPP

A0A6J1IJ63 uncharacterized protein LOC111474742 isoform X30.0e+0083.57Show/hide
Query:  VLMERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKR
        +LMER  + D+HDCTIKLRVNPKK+RDKV IGCGAGFGGDRPTAALKLLQRVK+LNYLVLECLAERTLAD +Q M SGGDGYDSRIA+WMKLLLPL++KR
Subjt:  VLMERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKR

Query:  NICIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGI
        NICIITNMGAMDP  AQQ VIE+A SLGL+VSVAVAYE SVKESGISTY+G APIV+CLEKYHPNVIITSRVADAALF+APMVYELGWNWDDFP L+QG 
Subjt:  NICIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGI

Query:  LAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSR
        LAGHLLECGCQLTGGYFMHPGDK+RSM FQQLL+ISLPYAE++CDGK+ VAK EE+GGLLNFSTCAEQLLYE+GDPSAYITPDLVVD SNVSFCSISSS+
Subjt:  LAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSSR

Query:  VVCSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQK
        V CSGAKPSIQ VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEE+L G+N+HIVSY IGLDSLKAS NSS+ +EDIRLRMDGLFE K
Subjt:  VVCSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQK

Query:  EHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPET
        EHALLFV+EFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRE++FW+  VKC++AV+LDS+ TDL++DPA+A +SPRVTLPC I ++A+  C  S  PET
Subjt:  EHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPET

Query:  GHSPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVR
        GHSPIPSGQ++ALY+VAHSRAGDKGND+NFS++PHYPSDIERLKMIITPEWV RVLS L N + FH  +A +KR+EWVNE VKVEIYEVK IHSLNVVVR
Subjt:  GHSPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVR

Query:  NILDGGVNCSRRIDRHGKTISDLILNQLIVLPP
        NILDGGVNCSRRIDRHGKTISDL+LNQ +VLPP
Subjt:  NILDGGVNCSRRIDRHGKTISDLILNQLIVLPP

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic2.0e-11636.51Show/hide
Query:  PSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKL--ISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFA
        P+  +T     + ++ + +C  L Q+KQ H  ++++    D +   KL  ++A S    +  A   F+++  PN   +NT+IRA++    P  +   F  
Subjt:  PSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKL--ISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFA

Query:  MQRDG-FYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEAR
        M  +   YP+ +TFPFL+K       L + + +H    K    SDVFV NSLI  Y  CG   + +A K+F ++   +DVVSWNSMI+G  + G  ++A 
Subjt:  MQRDG-FYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEAR

Query:  KVFDEMPKRD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMA
        ++F +M   D         G+                                N MLD Y K G ++DA +LFD M E++ V+W+TM+ GY  +   E A
Subjt:  KVFDEMPKRD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMA

Query:  RMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQME-KACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLN
        R + + MP K++V+W  ++S + + G   EA+ +F +++ +  +KL+  T++S L ACA+ G L LG  IH+ IK +  +    +++AL+ MY+KCG L 
Subjt:  RMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQME-KACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLN

Query:  IAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGR
         + +VF+ ++ +DV  W+AM+ GLAMHG G +A+++F KM+E    PN VT   V CAC+H GL+D+    F  ME +YG+VPE +HY C+VD+LGR G 
Subjt:  IAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGR

Query:  LEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEF
        LE+A++ I  MP+ P+  +WG LLGAC++H  + LA      L+ELEP + G   +LSNIYA  G W  V+  R  MR  G KK  G SSIE+D  +HEF
Subjt:  LEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEF

Query:  TVFDRSHPKSDNIYQVLME
           D +HP S+ +Y  L E
Subjt:  TVFDRSHPKSDNIYQVLME

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic7.0e-11737.66Show/hide
Query:  LAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLC---RQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGFYPDNFT
        L+ LH C  L  ++ +HAQ++K  LH   + + KLI    L      +  A + F  +Q PN+ ++NTM R H+ +S P  A   +  M   G  P+++T
Subjt:  LAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLC---RQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGFYPDNFT

Query:  FPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGIS
        FPF+LK C  +      +++H  + K G   D++V  SLI  Y + G   +  A K+F      RDVVS+ ++I G A  G  E A+K+FDE+P +D +S
Subjt:  FPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGIS

Query:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGGMEMARMLFDKMPVKNLV
        WN M+ GY + G   +A +LF +M + NV    STMV             LG                         Y K G +E A  LF+++P K+++
Subjt:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGGMEMARMLFDKMPVKNLV

Query:  SWTIIVSGFAEKGLAREAIDLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISN---ALVDMYAKCGRLNIAYDVFSDIKN
        SW  ++ G+    L +EA+ LF +M ++    ++ T++SIL ACA  G + +G  IH  I +   K  T  S+   +L+DMYAKCG +  A+ VF+ I +
Subjt:  SWTIIVSGFAEKGLAREAIDLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISN---ALVDMYAKCGRLNIAYDVFSDIKN

Query:  KDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNM
        K + SWNAM+ G AMHG    + +LF +M++ G  P+ +T +G+L AC+H+G++D G   F TM +DY + P++EHYGCM+DLLG  G  +EA  +I  M
Subjt:  KDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNM

Query:  PMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSD
         M P+ +IW +LL AC+MH  VEL     ++L+++EP + G+  +LSNIYA+AG WN VA TR  +   G KK  G SSIE+D+ VHEF + D+ HP++ 
Subjt:  PMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSD

Query:  NIYQVLME
         IY +L E
Subjt:  NIYQVLME

Q9LS72 Pentatricopeptide repeat-containing protein At3g292309.5e-22362.37Show/hide
Query:  SVPIRTPSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA
        S+P+R PSW S+R++FE++L +L KC +LNQVKQLHAQI++ NLH DL + PKLISA SLCRQ  LA   FNQVQ PNVHL N++IRAH+ NSQP QAF 
Subjt:  SVPIRTPSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA

Query:  TFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYE
         F  MQR G + DNFT+PFLLK C+G  WLPVV+ +H  IEK G  SD++VPN+LID YS+CG  G+  A KLF  M + RD VSWNSM+ GL K G   
Subjt:  TFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYE

Query:  EARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPV--KNLVSWTIIVSGFAEKGLAREAIDL
        +AR++FDEMP+RD ISWNTMLDGY +  +M  AF+LF++MPERN VSWSTMV+GY KAG MEMAR++FDKMP+  KN+V+WTII++G+AEKGL +EA  L
Subjt:  EARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPV--KNLVSWTIIVSGFAEKGLAREAIDL

Query:  FDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALE
         DQM  + LK D   +ISIL AC ESGLL LG +IH+ +K +N      + NAL+DMYAKCG L  A+DVF+DI  KD+VSWN ML GL +HGHG +A+E
Subjt:  FDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALE

Query:  LFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVEL
        LF +M+ EG  P++VT I VLC+C HAGLID+GI YF +ME+ Y LVP+VEHYGC+VDLLGR GRL+EAI++++ MPM PN +IWG LLGACRMHN V++
Subjt:  LFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVEL

Query:  AREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        A+EVLD+LV+L+P D GN S+LSNIYAAA DW  VA+ R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD IYQ+L
Subjt:  AREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088201.4e-10939.1Show/hide
Query:  STRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGF
        +T K+ + K      CT +N +KQ+H  ++  +LH D F+V  L+      RQ   +   F+  Q+PN+ LYN++I    +N    +    F ++++ G 
Subjt:  STRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGF

Query:  YPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEM-
        Y   FTFP +LK CT      +   +H+ + K GF  DV    SL+  YS  GS  ++ A KLF  +   R VV+W ++ SG    G + EA  +F +M 
Subjt:  YPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEM-

Query:  ---PKRDGISWNTMLDGYVKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQM
            K D      +L   V VG +D      K  +EM  ++N    +T+V  Y K G ME AR +FD M  K++V+W+ ++ G+A     +E I+LF QM
Subjt:  ---PKRDGISWNTMLDGYVKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQM

Query:  EKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKK
         +  LK D  +I+  L +CA  G L LGE   + I  + F     ++NAL+DMYAKCG +   ++VF ++K KD+V  NA + GLA +GH   +  +F +
Subjt:  EKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKK

Query:  MKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREV
         ++ G SP+  T +G+LC C HAGLI DG+R+F+ +   Y L   VEHYGCMVDL GR G L++A RLI +MPM PNAI+WG LL  CR+    +LA  V
Subjt:  MKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREV

Query:  LDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        L  L+ LEP ++GN   LSNIY+  G W+  A  R  M   G KK  G S IE++ +VHEF   D+SHP SD IY  L
Subjt:  LDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

Q9SY02 Pentatricopeptide repeat-containing protein At4g027505.8e-11139.96Show/hide
Query:  ATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMS--DVFVPNSLIDSYSKCGS
        A + F+++   N   +N ++ A+  NS+  +A   F + +       N      +K           +++    + F  M+  DV   N++I  Y++ G 
Subjt:  ATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMS--DVFVPNSLIDSYSKCGS

Query:  RGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMA
          I  A++LF      +DV +W +M+SG  +  + EEAR++FD+MP+R+ +SWN ML GYV+  +M+ A +LFD MP RNV +W+TM+ GY + G +  A
Subjt:  RGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMA

Query:  RMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNI
        + LFDKMP ++ VSW  +++G+++ G + EA+ LF QME+   +L+  +  S L  CA+   L LG+++H  +    ++    + NAL+ MY KCG +  
Subjt:  RMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNI

Query:  AYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRL
        A D+F ++  KD+VSWN M+ G + HG G  AL  F+ MK EG  P+  TM+ VL AC+H GL+D G +YF TM +DYG++P  +HY CMVDLLGR G L
Subjt:  AYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRL

Query:  EEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFT
        E+A  L++NMP  P+A IWGTLLGA R+H   ELA    D +  +EP +SG   +LSN+YA++G W  V   R+RMR  G KK  G S IE+ N+ H F+
Subjt:  EEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFT

Query:  VFDRSHPKSDNIYQVLME
        V D  HP+ D I+  L E
Subjt:  VFDRSHPKSDNIYQVLME

Arabidopsis top hitse value%identityAlignment
AT1G01770.1 unknown protein4.4e-23163.99Show/hide
Query:  DCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMD
        DC I LR NPK++R+ V +GCGAGFGGDRP AALKLLQRV+ LNYLVLECLAERTLAD +  M SGG GYD R++EWM+LLLPL+++R  CIITNMGA+D
Subjt:  DCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMD

Query:  PLAAQQKVIEVAGSLGLNVSVAVAYE-------GS------VKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQG
        P  AQ+KV+EVAG LGL +SVAVA+E       GS          G STY+G APIVECLEKY PNVIITSRVADAALFLAPMVYELGWNW+D  LLAQG
Subjt:  PLAAQQKVIEVAGSLGLNVSVAVAYE-------GS------VKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQG

Query:  ILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSS
         LAGHLLECGCQLTGGYFMHPGD+YR M+F  L ++SLPYAE+  DGK+ V+K E SGG+LN STCAEQLLYEI DPSAYITPD+V+D   VSF  +S  
Subjt:  ILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISSS

Query:  RVVCSGAKPSIQ-GVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIE---DIRLRMDG
        +V CSGAKPS    VPEKLL+L PK+CGWKGWGEISYGG   + RAKA+E+LVRSWMEE + G+N  I+SY IG+DSLKA+SN +   +   DIRLRMDG
Subjt:  RVVCSGAKPSIQ-GVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIE---DIRLRMDG

Query:  LFEQKEHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGS
        LF+ KEHA+   KEFTALYTNGPAGGGGISTG+K EIVLEK+LV RE++ W+T +          Q T+  +       SP   +P     + + L    
Subjt:  LFEQKEHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGS

Query:  FPPETGHSPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSL
        +     HSP PSGQ+I LY VAHSRAGDKGND+NFS+IPHY  D+ERLK+IITP+WV  V+S L + + F   +A     + ++E+V VEIY+V+ IH++
Subjt:  FPPETGHSPIPSGQEIALYDVAHSRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSL

Query:  NVVVRNILDGGVNCSRRIDRHGKTISDLILNQLIVL
        NVVVRNILDGGVNCSRRIDRHGKTISDLIL Q +VL
Subjt:  NVVVRNILDGGVNCSRRIDRHGKTISDLILNQLIVL

AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.0e-11837.66Show/hide
Query:  LAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLC---RQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGFYPDNFT
        L+ LH C  L  ++ +HAQ++K  LH   + + KLI    L      +  A + F  +Q PN+ ++NTM R H+ +S P  A   +  M   G  P+++T
Subjt:  LAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLC---RQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGFYPDNFT

Query:  FPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGIS
        FPF+LK C  +      +++H  + K G   D++V  SLI  Y + G   +  A K+F      RDVVS+ ++I G A  G  E A+K+FDE+P +D +S
Subjt:  FPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGIS

Query:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGGMEMARMLFDKMPVKNLV
        WN M+ GY + G   +A +LF +M + NV    STMV             LG                         Y K G +E A  LF+++P K+++
Subjt:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGGMEMARMLFDKMPVKNLV

Query:  SWTIIVSGFAEKGLAREAIDLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISN---ALVDMYAKCGRLNIAYDVFSDIKN
        SW  ++ G+    L +EA+ LF +M ++    ++ T++SIL ACA  G + +G  IH  I +   K  T  S+   +L+DMYAKCG +  A+ VF+ I +
Subjt:  SWTIIVSGFAEKGLAREAIDLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISN---ALVDMYAKCGRLNIAYDVFSDIKN

Query:  KDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNM
        K + SWNAM+ G AMHG    + +LF +M++ G  P+ +T +G+L AC+H+G++D G   F TM +DY + P++EHYGCM+DLLG  G  +EA  +I  M
Subjt:  KDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNM

Query:  PMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSD
         M P+ +IW +LL AC+MH  VEL     ++L+++EP + G+  +LSNIYA+AG WN VA TR  +   G KK  G SSIE+D+ VHEF + D+ HP++ 
Subjt:  PMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSD

Query:  NIYQVLME
         IY +L E
Subjt:  NIYQVLME

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-11736.51Show/hide
Query:  PSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKL--ISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFA
        P+  +T     + ++ + +C  L Q+KQ H  ++++    D +   KL  ++A S    +  A   F+++  PN   +NT+IRA++    P  +   F  
Subjt:  PSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKL--ISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFA

Query:  MQRDG-FYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEAR
        M  +   YP+ +TFPFL+K       L + + +H    K    SDVFV NSLI  Y  CG   + +A K+F ++   +DVVSWNSMI+G  + G  ++A 
Subjt:  MQRDG-FYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEAR

Query:  KVFDEMPKRD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMA
        ++F +M   D         G+                                N MLD Y K G ++DA +LFD M E++ V+W+TM+ GY  +   E A
Subjt:  KVFDEMPKRD---------GI------------------------------SWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMA

Query:  RMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQME-KACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLN
        R + + MP K++V+W  ++S + + G   EA+ +F +++ +  +KL+  T++S L ACA+ G L LG  IH+ IK +  +    +++AL+ MY+KCG L 
Subjt:  RMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQME-KACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLN

Query:  IAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGR
         + +VF+ ++ +DV  W+AM+ GLAMHG G +A+++F KM+E    PN VT   V CAC+H GL+D+    F  ME +YG+VPE +HY C+VD+LGR G 
Subjt:  IAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGR

Query:  LEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEF
        LE+A++ I  MP+ P+  +WG LLGAC++H  + LA      L+ELEP + G   +LSNIYA  G W  V+  R  MR  G KK  G SSIE+D  +HEF
Subjt:  LEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEF

Query:  TVFDRSHPKSDNIYQVLME
           D +HP S+ +Y  L E
Subjt:  TVFDRSHPKSDNIYQVLME

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.8e-22462.37Show/hide
Query:  SVPIRTPSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA
        S+P+R PSW S+R++FE++L +L KC +LNQVKQLHAQI++ NLH DL + PKLISA SLCRQ  LA   FNQVQ PNVHL N++IRAH+ NSQP QAF 
Subjt:  SVPIRTPSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA

Query:  TFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYE
         F  MQR G + DNFT+PFLLK C+G  WLPVV+ +H  IEK G  SD++VPN+LID YS+CG  G+  A KLF  M + RD VSWNSM+ GL K G   
Subjt:  TFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYE

Query:  EARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPV--KNLVSWTIIVSGFAEKGLAREAIDL
        +AR++FDEMP+RD ISWNTMLDGY +  +M  AF+LF++MPERN VSWSTMV+GY KAG MEMAR++FDKMP+  KN+V+WTII++G+AEKGL +EA  L
Subjt:  EARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPV--KNLVSWTIIVSGFAEKGLAREAIDL

Query:  FDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALE
         DQM  + LK D   +ISIL AC ESGLL LG +IH+ +K +N      + NAL+DMYAKCG L  A+DVF+DI  KD+VSWN ML GL +HGHG +A+E
Subjt:  FDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALE

Query:  LFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVEL
        LF +M+ EG  P++VT I VLC+C HAGLID+GI YF +ME+ Y LVP+VEHYGC+VDLLGR GRL+EAI++++ MPM PN +IWG LLGACRMHN V++
Subjt:  LFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVEL

Query:  AREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL
        A+EVLD+LV+L+P D GN S+LSNIYAAA DW  VA+ R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD IYQ+L
Subjt:  AREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVL

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.1e-11239.96Show/hide
Query:  ATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMS--DVFVPNSLIDSYSKCGS
        A + F+++   N   +N ++ A+  NS+  +A   F + +       N      +K           +++    + F  M+  DV   N++I  Y++ G 
Subjt:  ATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMS--DVFVPNSLIDSYSKCGS

Query:  RGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMA
          I  A++LF      +DV +W +M+SG  +  + EEAR++FD+MP+R+ +SWN ML GYV+  +M+ A +LFD MP RNV +W+TM+ GY + G +  A
Subjt:  RGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMA

Query:  RMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNI
        + LFDKMP ++ VSW  +++G+++ G + EA+ LF QME+   +L+  +  S L  CA+   L LG+++H  +    ++    + NAL+ MY KCG +  
Subjt:  RMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNI

Query:  AYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRL
        A D+F ++  KD+VSWN M+ G + HG G  AL  F+ MK EG  P+  TM+ VL AC+H GL+D G +YF TM +DYG++P  +HY CMVDLLGR G L
Subjt:  AYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRL

Query:  EEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFT
        E+A  L++NMP  P+A IWGTLLGA R+H   ELA    D +  +EP +SG   +LSN+YA++G W  V   R+RMR  G KK  G S IE+ N+ H F+
Subjt:  EEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTKKPSGASSIEVDNEVHEFT

Query:  VFDRSHPKSDNIYQVLME
        V D  HP+ D I+  L E
Subjt:  VFDRSHPKSDNIYQVLME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATGTGCAGCGTCCCAATTCGAACCCCATCTTGGTTTTCCACTCGAAAGCTCTTCGAGCAGAAGCTTGCAGAGCTCCACAAGTGCACAGACCTCAACCAAGTGAA
GCAACTCCACGCCCAAATCCTCAAATCCAATCTCCACGTCGACCTCTTCGTTGTTCCCAAACTCATATCTGCCTTCTCCCTCTGTCGCCAAATGCTCCTCGCCACTAACA
CTTTCAATCAAGTACAATATCCGAACGTCCATTTATACAACACCATGATTCGTGCCCACTCCCATAACTCCCAACCCTCACAAGCCTTTGCCACTTTCTTTGCCATGCAA
CGTGATGGATTCTACCCCGATAATTTCACTTTCCCGTTTCTTTTGAAAGTTTGTACAGGGAATGTGTGGTTGCCTGTTGTTGAAAGGGTACATGCCCAAATCGAGAAATT
TGGGTTCATGTCGGATGTATTCGTGCCAAATTCTCTAATCGATTCATATTCCAAATGTGGGTCTCGTGGAATTTCGGCAGCAAAGAAATTGTTTGTGTCAATGGGGGCTC
GTAGGGATGTTGTGTCATGGAATTCAATGATCTCTGGATTAGCGAAAGGTGGGTTGTATGAAGAAGCTCGAAAGGTGTTTGATGAAATGCCTAAAAGGGATGGTATTAGT
TGGAACACAATGTTGGACGGGTACGTTAAAGTTGGCAAAATGGATGATGCGTTTAAGTTGTTTGATGAAATGCCTGAGAGGAATGTCGTCTCTTGGTCAACAATGGTGTT
AGGGTATTGCAAGGCAGGGGGTATGGAGATGGCACGAATGTTGTTCGATAAAATGCCAGTAAAGAATTTGGTTTCTTGGACTATAATTGTATCTGGGTTTGCTGAGAAAG
GGCTAGCTAGAGAGGCCATTGACTTGTTTGATCAAATGGAAAAGGCTTGCTTGAAGTTAGACAATGGGACGATAATAAGTATTTTGGATGCTTGTGCTGAGTCTGGTTTG
CTTGGGCTCGGTGAGAAAATACATGCTTCCATTAAGAACAATAATTTCAAATGTACTACTGAAATCTCCAATGCTTTGGTTGATATGTATGCAAAATGTGGTAGGTTGAA
TATTGCATATGATGTTTTTAGTGACATAAAAAATAAAGATGTCGTGTCTTGGAATGCTATGCTTCAAGGGCTTGCAATGCATGGACATGGCATGAAAGCACTTGAGCTTT
TCAAAAAAATGAAAGAAGAGGGTTTCTCACCCAATAGAGTTACAATGATTGGAGTCTTGTGTGCTTGTACGCATGCAGGATTGATCGACGATGGCATTCGCTACTTCTCT
ACGATGGAAAGGGACTACGGCCTTGTTCCTGAGGTTGAGCATTATGGCTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAGGAAGCCATAAGGCTCATTCGCAA
CATGCCAATGACACCAAATGCCATCATTTGGGGAACCCTTTTGGGGGCATGTCGCATGCATAATGCTGTTGAACTTGCAAGGGAGGTTCTAGATCATTTGGTTGAGCTGG
AACCGTCTGATTCGGGTAATCTTTCCATGTTGTCTAACATATATGCTGCGGCAGGGGACTGGAACTGCGTTGCCAACACGAGGTTGAGAATGCGGAGTATTGGAACTAAA
AAACCGTCGGGTGCTAGTTCCATTGAGGTCGACAATGAGGTTCATGAATTTACAGTTTTTGATCGATCACATCCAAAATCTGATAATATATATCAGGTTCTAATGGAGAG
GCATAGTCAAGCTGACATACATGACTGCACAATTAAACTGAGAGTAAATCCTAAAAAACAGAGAGACAAGGTGTGCATTGGCTGTGGTGCTGGATTTGGAGGCGATAGGC
CAACTGCGGCTCTTAAATTGCTTCAGAGGGTCAAAAACCTAAACTATCTTGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCACTATCAAGTTATGTTGTCTGGT
GGTGATGGTTACGATTCAAGGATTGCAGAATGGATGAAATTGCTTCTTCCCTTGTCTATGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGACCCCCTTGC
GGCTCAGCAAAAAGTTATAGAAGTAGCAGGTAGTCTGGGGCTGAATGTTTCAGTTGCAGTTGCTTATGAGGGTTCGGTAAAAGAATCAGGAATTAGCACGTATATGGGAG
GAGCACCTATTGTTGAGTGTCTGGAGAAGTACCATCCAAATGTCATAATTACTTCACGTGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTCTATGAACTTGGTTGG
AACTGGGATGATTTTCCATTGCTAGCACAGGGAATACTGGCTGGTCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGAGATAAGTATAG
GAGCATGTCTTTCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAATGTGATGGAAAGTTAACTGTAGCCAAGCCTGAAGAGAGTGGAGGTCTTTTGAATT
TCAGTACATGTGCTGAACAACTTCTGTACGAGATTGGTGATCCATCAGCTTATATCACCCCTGATTTGGTGGTTGACTTCAGCAATGTTTCGTTTTGCTCTATATCCAGC
TCTAGGGTTGTATGTTCCGGAGCAAAACCGTCTATTCAAGGAGTGCCGGAGAAACTCTTGCAGTTGGCCCCAAAGGACTGTGGATGGAAAGGATGGGGAGAGATTTCCTA
TGGGGGACGTGAATGTGTTCTGCGTGCTAAAGCTGCAGAATATCTGGTTCGGTCATGGATGGAAGAACTGTTGATTGGTATTAATGAGCATATAGTTTCTTACACAATTG
GACTCGACAGCCTTAAAGCATCCAGCAATAGTAGCAATTGTATTGAAGATATTAGGTTGCGCATGGATGGACTCTTTGAGCAGAAGGAGCACGCTCTCCTGTTTGTTAAA
GAATTTACAGCTTTATACACAAATGGGCCAGCTGGTGGTGGCGGCATCAGCACTGGCTACAAGAAAGAAATTGTGCTTGAAAAACAACTGGTTGGGCGTGAAAATATTTT
CTGGCAAACAGAAGTGAAGTGCAGTGAAGCAGTAAAATTAGACAGCCAATCAACAGATCTTCAAAAGGATCCAGCAGAGGCATGTTCTTCGCCCCGAGTAACGTTGCCAT
GTCCGATATCTTCTCATGCAGAGAAACTTTGTACAGGCTCCTTCCCACCAGAAACGGGTCATTCTCCTATTCCATCTGGCCAGGAGATTGCTCTTTACGATGTAGCCCAT
AGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTCTCATTCCTCATTATCCTTCTGATATCGAGCGATTGAAGATGATCATCACGCCTGAATGGGTGATGAGAGT
TCTCTCGGGTCTGCATAATTTGACTCGGTTTCATTCTTCAAATGCTGGTGAGAAGAGAAACGAGTGGGTAAATGAAGATGTGAAGGTTGAAATATACGAAGTTAAATCTA
TACATTCTTTGAATGTCGTTGTTCGTAACATTCTTGACGGTGGCGTAAATTGCTCACGGAGAATCGATCGCCATGGAAAGACTATATCGGATCTCATCTTGAACCAGCTA
ATTGTTTTGCCACCTGGACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAATGTGCAGCGTCCCAATTCGAACCCCATCTTGGTTTTCCACTCGAAAGCTCTTCGAGCAGAAGCTTGCAGAGCTCCACAAGTGCACAGACCTCAACCAAGTGAA
GCAACTCCACGCCCAAATCCTCAAATCCAATCTCCACGTCGACCTCTTCGTTGTTCCCAAACTCATATCTGCCTTCTCCCTCTGTCGCCAAATGCTCCTCGCCACTAACA
CTTTCAATCAAGTACAATATCCGAACGTCCATTTATACAACACCATGATTCGTGCCCACTCCCATAACTCCCAACCCTCACAAGCCTTTGCCACTTTCTTTGCCATGCAA
CGTGATGGATTCTACCCCGATAATTTCACTTTCCCGTTTCTTTTGAAAGTTTGTACAGGGAATGTGTGGTTGCCTGTTGTTGAAAGGGTACATGCCCAAATCGAGAAATT
TGGGTTCATGTCGGATGTATTCGTGCCAAATTCTCTAATCGATTCATATTCCAAATGTGGGTCTCGTGGAATTTCGGCAGCAAAGAAATTGTTTGTGTCAATGGGGGCTC
GTAGGGATGTTGTGTCATGGAATTCAATGATCTCTGGATTAGCGAAAGGTGGGTTGTATGAAGAAGCTCGAAAGGTGTTTGATGAAATGCCTAAAAGGGATGGTATTAGT
TGGAACACAATGTTGGACGGGTACGTTAAAGTTGGCAAAATGGATGATGCGTTTAAGTTGTTTGATGAAATGCCTGAGAGGAATGTCGTCTCTTGGTCAACAATGGTGTT
AGGGTATTGCAAGGCAGGGGGTATGGAGATGGCACGAATGTTGTTCGATAAAATGCCAGTAAAGAATTTGGTTTCTTGGACTATAATTGTATCTGGGTTTGCTGAGAAAG
GGCTAGCTAGAGAGGCCATTGACTTGTTTGATCAAATGGAAAAGGCTTGCTTGAAGTTAGACAATGGGACGATAATAAGTATTTTGGATGCTTGTGCTGAGTCTGGTTTG
CTTGGGCTCGGTGAGAAAATACATGCTTCCATTAAGAACAATAATTTCAAATGTACTACTGAAATCTCCAATGCTTTGGTTGATATGTATGCAAAATGTGGTAGGTTGAA
TATTGCATATGATGTTTTTAGTGACATAAAAAATAAAGATGTCGTGTCTTGGAATGCTATGCTTCAAGGGCTTGCAATGCATGGACATGGCATGAAAGCACTTGAGCTTT
TCAAAAAAATGAAAGAAGAGGGTTTCTCACCCAATAGAGTTACAATGATTGGAGTCTTGTGTGCTTGTACGCATGCAGGATTGATCGACGATGGCATTCGCTACTTCTCT
ACGATGGAAAGGGACTACGGCCTTGTTCCTGAGGTTGAGCATTATGGCTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAGGAAGCCATAAGGCTCATTCGCAA
CATGCCAATGACACCAAATGCCATCATTTGGGGAACCCTTTTGGGGGCATGTCGCATGCATAATGCTGTTGAACTTGCAAGGGAGGTTCTAGATCATTTGGTTGAGCTGG
AACCGTCTGATTCGGGTAATCTTTCCATGTTGTCTAACATATATGCTGCGGCAGGGGACTGGAACTGCGTTGCCAACACGAGGTTGAGAATGCGGAGTATTGGAACTAAA
AAACCGTCGGGTGCTAGTTCCATTGAGGTCGACAATGAGGTTCATGAATTTACAGTTTTTGATCGATCACATCCAAAATCTGATAATATATATCAGGTTCTAATGGAGAG
GCATAGTCAAGCTGACATACATGACTGCACAATTAAACTGAGAGTAAATCCTAAAAAACAGAGAGACAAGGTGTGCATTGGCTGTGGTGCTGGATTTGGAGGCGATAGGC
CAACTGCGGCTCTTAAATTGCTTCAGAGGGTCAAAAACCTAAACTATCTTGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCACTATCAAGTTATGTTGTCTGGT
GGTGATGGTTACGATTCAAGGATTGCAGAATGGATGAAATTGCTTCTTCCCTTGTCTATGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGACCCCCTTGC
GGCTCAGCAAAAAGTTATAGAAGTAGCAGGTAGTCTGGGGCTGAATGTTTCAGTTGCAGTTGCTTATGAGGGTTCGGTAAAAGAATCAGGAATTAGCACGTATATGGGAG
GAGCACCTATTGTTGAGTGTCTGGAGAAGTACCATCCAAATGTCATAATTACTTCACGTGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTCTATGAACTTGGTTGG
AACTGGGATGATTTTCCATTGCTAGCACAGGGAATACTGGCTGGTCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGAGATAAGTATAG
GAGCATGTCTTTCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAATGTGATGGAAAGTTAACTGTAGCCAAGCCTGAAGAGAGTGGAGGTCTTTTGAATT
TCAGTACATGTGCTGAACAACTTCTGTACGAGATTGGTGATCCATCAGCTTATATCACCCCTGATTTGGTGGTTGACTTCAGCAATGTTTCGTTTTGCTCTATATCCAGC
TCTAGGGTTGTATGTTCCGGAGCAAAACCGTCTATTCAAGGAGTGCCGGAGAAACTCTTGCAGTTGGCCCCAAAGGACTGTGGATGGAAAGGATGGGGAGAGATTTCCTA
TGGGGGACGTGAATGTGTTCTGCGTGCTAAAGCTGCAGAATATCTGGTTCGGTCATGGATGGAAGAACTGTTGATTGGTATTAATGAGCATATAGTTTCTTACACAATTG
GACTCGACAGCCTTAAAGCATCCAGCAATAGTAGCAATTGTATTGAAGATATTAGGTTGCGCATGGATGGACTCTTTGAGCAGAAGGAGCACGCTCTCCTGTTTGTTAAA
GAATTTACAGCTTTATACACAAATGGGCCAGCTGGTGGTGGCGGCATCAGCACTGGCTACAAGAAAGAAATTGTGCTTGAAAAACAACTGGTTGGGCGTGAAAATATTTT
CTGGCAAACAGAAGTGAAGTGCAGTGAAGCAGTAAAATTAGACAGCCAATCAACAGATCTTCAAAAGGATCCAGCAGAGGCATGTTCTTCGCCCCGAGTAACGTTGCCAT
GTCCGATATCTTCTCATGCAGAGAAACTTTGTACAGGCTCCTTCCCACCAGAAACGGGTCATTCTCCTATTCCATCTGGCCAGGAGATTGCTCTTTACGATGTAGCCCAT
AGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTCTCATTCCTCATTATCCTTCTGATATCGAGCGATTGAAGATGATCATCACGCCTGAATGGGTGATGAGAGT
TCTCTCGGGTCTGCATAATTTGACTCGGTTTCATTCTTCAAATGCTGGTGAGAAGAGAAACGAGTGGGTAAATGAAGATGTGAAGGTTGAAATATACGAAGTTAAATCTA
TACATTCTTTGAATGTCGTTGTTCGTAACATTCTTGACGGTGGCGTAAATTGCTCACGGAGAATCGATCGCCATGGAAAGACTATATCGGATCTCATCTTGAACCAGCTA
ATTGTTTTGCCACCTGGACAATAA
Protein sequenceShow/hide protein sequence
MQMCSVPIRTPSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLISAFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQ
RDGFYPDNFTFPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVSMGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGIS
WNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAIDLFDQMEKACLKLDNGTIISILDACAESGL
LGLGEKIHASIKNNNFKCTTEISNALVDMYAKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMIGVLCACTHAGLIDDGIRYFS
TMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMTPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANTRLRMRSIGTK
KPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADHYQVMLSG
GDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQKVIEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGW
NWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSISS
SRVVCSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIEDIRLRMDGLFEQKEHALLFVK
EFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCSEAVKLDSQSTDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGHSPIPSGQEIALYDVAH
SRAGDKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDVKVEIYEVKSIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLILNQL
IVLPPGQ