; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0025315 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0025315
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr06:1344217..1353030
RNA-Seq ExpressionPI0025315
SyntenyPI0025315
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR010839 - Acyclic terpene utilisation
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK17586.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0092.88Show/hide
Query:  MQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARK
        MQ DGFYPDNFTFPFLLKVCTGNVWLPVVE VHAQIE+FGFMSDVFVPNSLIDSYSKCGSRGI AAKKLFVSMGA RDVVSWNSMISGLAKGGLYEEARK
Subjt:  MQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARK

Query:  MFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK
        +FDEMPKRD ISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAG MEMARMLFDKMPVKNLVSWTII+SGFAEKGLAREAI LFDQMEK
Subjt:  MFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK

Query:  ACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK
        ACLKLDNGTII IL ACAESGLLGLGEKIHASIKNNNFKCT EISNAL+DMYAKCG LNIAY +F+DIKNKDVVSWNAMLQGLAMHGHG+KALELFK+MK
Subjt:  ACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK

Query:  EEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLD
        EEGFSP++VTMIGVLCACTHAGLIDDGIRYFS MERDY LVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPM PNAIIWGTLLGACRMHNAVELAREVLD
Subjt:  EEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLD

Query:  HLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPQKQR
        HLVEL+P DSGN SMLSNIYAAAGDW+CVANTRLRMRSIGT+KPSGASSIE+DNE                   VLMERHSQADIHDCTIKLRVNP+KQR
Subjt:  HLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPQKQR

Query:  DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQNVIEVAGS
        DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLAD YQVMLSGGDGYDSRIA+WMKLLLPL+MKRNICIITNMGAMDP AAQQ VIEVAGS
Subjt:  DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQNVIEVAGS

Query:  LGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
        LGLNV VAVAYEGSVKESGISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
Subjt:  LGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS

Query:  MSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQGVPEKLLQLAPKDCG
        MSFQQLLNISLPYAEVECDGKLTVAKP+ESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCS SSSRV+CSGAKPSIQGVPEKLLQLAPKDCG
Subjt:  MSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQGVPEKLLQLAPKDCG

Query:  WKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS
        WKGWGEISYGGRECVLRAK AEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSNSSNC+EDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS
Subjt:  WKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS

XP_004134329.1 uncharacterized protein LOC101212841 isoform X2 [Cucumis sativus]0.0e+0094.79Show/hide
Query:  MERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNI
        ME H QADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLAD YQVMLSGGDGYD RIADWMKLLLPLAMKRNI
Subjt:  MERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNI

Query:  CIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
        CIITNMGAMDPPAAQQNVIEVAGSLGLNV VAVAYEGSVKESGISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
Subjt:  CIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVI
        GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKP+ESGGLLNFSTCAEQLLYEIG+PSAYITPDLVVDFSNVSFCS SSSRV+
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVI

Query:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEH
        CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAK AEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSN SNCVEDIRLRMDGLFEQKEH
Subjt:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEH

Query:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVD
        ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEV CTEAVKLDSQST LQKD A+ACSSP VTLPCPIS HA++ CTGS PPE   
Subjt:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVD

Query:  SPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNI
        SPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPH PSD ERLKMIITPEWVMRVLSVL NSTRFHSSNA+EKRNEWV+EDVKVEIYEVKGIHSLNVVVRNI
Subjt:  SPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQPIVLPPGQ
        LDGGVNCSRRIDRHGKTISDLILNQ IVLPP Q
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQPIVLPPGQ

XP_008438065.1 PREDICTED: uncharacterized protein LOC103483286 [Cucumis melo]0.0e+0095.1Show/hide
Query:  MERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNI
        MERHSQADIHDCTIKLRVNP+KQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLAD YQVMLSGGDGYDSRIA+WMKLLLPL+MKRNI
Subjt:  MERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNI

Query:  CIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
        CIITNMGAMDP AAQQ VIEVAGSLGLNV VAVAYEGSVKESGISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
Subjt:  CIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVI
        GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKP+ESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCS SSSRV+
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVI

Query:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEH
        CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAK AEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSNSSNC+EDIRLRMDGLFEQKEH
Subjt:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEH

Query:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVD
        ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKC+EAVKLDSQST LQKD A+ACSSP VTLPCPIS+HAEK CTGSFPPET  
Subjt:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVD

Query:  SPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNI
        SPIPSGQEIALY+VAHSRAGDKGNDLNFSLIPHYPSD ERLKMIITPEWVMRVLS L N TRFHSSNA EKRNEWVNEDVKVEIYEVK IHSLNVVVRNI
Subjt:  SPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQPIVLPPGQ
        LDGGVNCSRRIDRHGKTISDLILNQ IVLPPGQ
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQPIVLPPGQ

XP_031738474.1 uncharacterized protein LOC101212841 isoform X1 [Cucumis sativus]0.0e+0094.22Show/hide
Query:  DCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMD
        +C    RVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLAD YQVMLSGGDGYD RIADWMKLLLPLAMKRNICIITNMGAMD
Subjt:  DCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMD

Query:  PPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL
        PPAAQQNVIEVAGSLGLNV VAVAYEGSVKESGISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL
Subjt:  PPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL

Query:  TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQG
        TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKP+ESGGLLNFSTCAEQLLYEIG+PSAYITPDLVVDFSNVSFCS SSSRV+CSGAKPSIQG
Subjt:  TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQG

Query:  VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTA
        VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAK AEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSN SNCVEDIRLRMDGLFEQKEHALLFVKEFTA
Subjt:  VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTA

Query:  LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVDSPIPSGQEIA
        LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEV CTEAVKLDSQST LQKD A+ACSSP VTLPCPIS HA++ CTGS PPE   SPIPSGQEIA
Subjt:  LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVDSPIPSGQEIA

Query:  LYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRR
        LYNVAHSRAGDKGNDLNFSLIPH PSD ERLKMIITPEWVMRVLSVL NSTRFHSSNA+EKRNEWV+EDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRR
Subjt:  LYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRR

Query:  IDRHGKTISDLILNQPIVLPPGQ
        IDRHGKTISDLILNQ IVLPP Q
Subjt:  IDRHGKTISDLILNQPIVLPPGQ

XP_031738475.1 uncharacterized protein LOC101212841 isoform X3 [Cucumis sativus]0.0e+0093.74Show/hide
Query:  DCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMD
        +C    RVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLAD YQVMLSGGDGYD R  +WMKLLLPLAMKRNICIITNMGAMD
Subjt:  DCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMD

Query:  PPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL
        PPAAQQNVIEVAGSLGLNV VAVAYEGSVKESGISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL
Subjt:  PPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQL

Query:  TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQG
        TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKP+ESGGLLNFSTCAEQLLYEIG+PSAYITPDLVVDFSNVSFCS SSSRV+CSGAKPSIQG
Subjt:  TGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQG

Query:  VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTA
        VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAK AEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSN SNCVEDIRLRMDGLFEQKEHALLFVKEFTA
Subjt:  VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTA

Query:  LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVDSPIPSGQEIA
        LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEV CTEAVKLDSQST LQKD A+ACSSP VTLPCPIS HA++ CTGS PPE   SPIPSGQEIA
Subjt:  LYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVDSPIPSGQEIA

Query:  LYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRR
        LYNVAHSRAGDKGNDLNFSLIPH PSD ERLKMIITPEWVMRVLSVL NSTRFHSSNA+EKRNEWV+EDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRR
Subjt:  LYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRR

Query:  IDRHGKTISDLILNQPIVLPPGQ
        IDRHGKTISDLILNQ IVLPP Q
Subjt:  IDRHGKTISDLILNQPIVLPPGQ

TrEMBL top hitse value%identityAlignment
A0A0A0L7H7 Uncharacterized protein0.0e+0094Show/hide
Query:  MQICSVPIRTPSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPS
        MQ+CSVPIRTPSWFSTRKLLEQKLSDLHKC++LNQVKQLHAQI+KSNLHVDL+VVPKLISAFSLCR M LATNAFNQVQYPNVHLYNTMIRAHSHNSQPS
Subjt:  MQICSVPIRTPSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPS

Query:  QAFATFFAMQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKG
        QAFATFFAMQ DG Y DNFTFPFLLKVCTGNVWLPV+E+VHAQIE+FGFMSDVFVPNSLIDSYSKCGS GI AAKKLFVSMGA RDVVSWNSMISGLAKG
Subjt:  QAFATFFAMQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKG

Query:  GLYEEARKMFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI
        GLYEEARK+FDEMP++D ISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTII+SGFAEKGLAREAI
Subjt:  GLYEEARKMFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAI

Query:  GLFDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKA
         LFDQMEKACLKLDNGT++ ILAACAESGLLGLGEKIHASIKNNNFKCT EISNAL+DMYAKCG LNIAY +FNDIKNKDVVSWNAMLQGLAMHGHGVKA
Subjt:  GLFDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKA

Query:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAV
        LELFKRMKEEGFSP+KVTMIGVLCACTHAGLIDDGIRYFS MERDY LVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPM PNAIIWGTLLGACRMHNAV
Subjt:  LELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAV

Query:  ELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKL
        ELAREVLDHLVEL+P DSGNFSMLSNIYAAAGDW+CVANTRLRMRSIGT+KPSGASSIE++NEVHEFTVFDRSHPKSDNIYQVLME H QADIHDCTIKL
Subjt:  ELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKL

Query:  RVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQ
        RVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLAD YQVMLSGGDGYD RIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQ
Subjt:  RVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQ

Query:  NVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFM
        NVIEVAGSLGLNV VAVAYEGSVKESGISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFM
Subjt:  NVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFM

Query:  HPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQGVPEKLL
        HPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKP+ESGGLLNFSTCAEQLLYEIG+PSAYITPDLVVDFSNVSFCS SSSRV+CSGAKPSIQGVPEKLL
Subjt:  HPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQGVPEKLL

Query:  QLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTALYTNGP
        QLAPKDCGWKGWGEISYGGRECVLRAK AEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSN SNCVEDIRLRMDGLFEQKEHALLFVKEFTALYTNGP
Subjt:  QLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTALYTNGP

Query:  AGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVDSPIPSGQEIALYNVAH
        AGGGGISTGYKKEIVLEKQLVGRENIFWQTEV CTEAVKLDSQST LQKD A+ACSSP VTLPCPIS HA++ CTGS PPE   SPIPSGQEIALYNVAH
Subjt:  AGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVDSPIPSGQEIALYNVAH

Query:  SRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGK
        SRAGDKGNDLNFSLIPH PSD ERLKMIITPEWVMRVLSVL NSTRFHSSNA+EKRNEWV+EDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGK
Subjt:  SRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGK

Query:  TISDLILNQPIVLPPGQ
        TISDLILNQ IVLPP Q
Subjt:  TISDLILNQPIVLPPGQ

A0A1S3AV50 uncharacterized protein LOC1034832860.0e+0095.1Show/hide
Query:  MERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNI
        MERHSQADIHDCTIKLRVNP+KQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLAD YQVMLSGGDGYDSRIA+WMKLLLPL+MKRNI
Subjt:  MERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNI

Query:  CIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
        CIITNMGAMDP AAQQ VIEVAGSLGLNV VAVAYEGSVKESGISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA
Subjt:  CIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILA

Query:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVI
        GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKP+ESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCS SSSRV+
Subjt:  GHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVI

Query:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEH
        CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAK AEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSNSSNC+EDIRLRMDGLFEQKEH
Subjt:  CSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEH

Query:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVD
        ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKC+EAVKLDSQST LQKD A+ACSSP VTLPCPIS+HAEK CTGSFPPET  
Subjt:  ALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVD

Query:  SPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNI
        SPIPSGQEIALY+VAHSRAGDKGNDLNFSLIPHYPSD ERLKMIITPEWVMRVLS L N TRFHSSNA EKRNEWVNEDVKVEIYEVK IHSLNVVVRNI
Subjt:  SPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNI

Query:  LDGGVNCSRRIDRHGKTISDLILNQPIVLPPGQ
        LDGGVNCSRRIDRHGKTISDLILNQ IVLPPGQ
Subjt:  LDGGVNCSRRIDRHGKTISDLILNQPIVLPPGQ

A0A5D3D3E7 Pentatricopeptide repeat-containing protein0.0e+0092.88Show/hide
Query:  MQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARK
        MQ DGFYPDNFTFPFLLKVCTGNVWLPVVE VHAQIE+FGFMSDVFVPNSLIDSYSKCGSRGI AAKKLFVSMGA RDVVSWNSMISGLAKGGLYEEARK
Subjt:  MQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARK

Query:  MFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK
        +FDEMPKRD ISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAG MEMARMLFDKMPVKNLVSWTII+SGFAEKGLAREAI LFDQMEK
Subjt:  MFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEK

Query:  ACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK
        ACLKLDNGTII IL ACAESGLLGLGEKIHASIKNNNFKCT EISNAL+DMYAKCG LNIAY +F+DIKNKDVVSWNAMLQGLAMHGHG+KALELFK+MK
Subjt:  ACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMK

Query:  EEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLD
        EEGFSP++VTMIGVLCACTHAGLIDDGIRYFS MERDY LVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPM PNAIIWGTLLGACRMHNAVELAREVLD
Subjt:  EEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLD

Query:  HLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPQKQR
        HLVEL+P DSGN SMLSNIYAAAGDW+CVANTRLRMRSIGT+KPSGASSIE+DNE                   VLMERHSQADIHDCTIKLRVNP+KQR
Subjt:  HLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPQKQR

Query:  DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQNVIEVAGS
        DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLAD YQVMLSGGDGYDSRIA+WMKLLLPL+MKRNICIITNMGAMDP AAQQ VIEVAGS
Subjt:  DKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQNVIEVAGS

Query:  LGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
        LGLNV VAVAYEGSVKESGISTYMG APIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS
Subjt:  LGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRS

Query:  MSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQGVPEKLLQLAPKDCG
        MSFQQLLNISLPYAEVECDGKLTVAKP+ESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCS SSSRV+CSGAKPSIQGVPEKLLQLAPKDCG
Subjt:  MSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSRVICSGAKPSIQGVPEKLLQLAPKDCG

Query:  WKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS
        WKGWGEISYGGRECVLRAK AEYLVRSWMEELLIGIN HIVSYTIGLDSLKASSNSSNC+EDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS
Subjt:  WKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS

A0A6J1IGN9 uncharacterized protein LOC111474742 isoform X10.0e+0083.94Show/hide
Query:  YQVLMERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAM
        + +LMER  + D+HDCTIKLRVNP+K+RDKV IGCGAGFGGDRPTAALKLLQRVK+LNYLVLECLAERTLADR+Q M SGGDGYDSRIADWMKLLLPLA+
Subjt:  YQVLMERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAM

Query:  KRNICIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQ
        KRNICIITNMGAMDPP AQQNVIE+A SLGL+V VAVAYE SVKESGISTY+GAAPIV+CLEKYHPNVIITSRVADAALF+APMVYELGWNWDDFP L+Q
Subjt:  KRNICIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQ

Query:  GILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSS
        G LAGHLLECGCQLTGGYFMHPGDK+RSM FQQLL+ISLPYAE++CDGK+ VAK +E+GGLLNFSTCAEQLLYE+GDPSAYITPDLVVD SNVSFCS SS
Subjt:  GILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSS

Query:  SRVICSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFE
        S+V CSGAKPSIQ VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAK AEYLVRSWMEE+L G+N HIVSY IGLDSLKAS NSS+ VEDIRLRMDGLFE
Subjt:  SRVICSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFE

Query:  QKEHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPP
         KEHALLFV+EFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRE++FW+  VKCT+AV+LDS+ T L++D AKA +SP VTLPC I A+A+ PC  S  P
Subjt:  QKEHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPP

Query:  ETVDSPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVV
        ET  SPIPSGQ++ALYNVAHSRAGDKGND+NFS++PHYPSD ERLKMIITPEWV RVLS LQNS+ FH  +A++KR+EWVNE VKVEIYEVKGIHSLNVV
Subjt:  ETVDSPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVV

Query:  VRNILDGGVNCSRRIDRHGKTISDLILNQPIVLPP
        VRNILDGGVNCSRRIDRHGKTISDL+LNQ +VLPP
Subjt:  VRNILDGGVNCSRRIDRHGKTISDLILNQPIVLPP

A0A6J1IJ63 uncharacterized protein LOC111474742 isoform X30.0e+0084.2Show/hide
Query:  VLMERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKR
        +LMER  + D+HDCTIKLRVNP+K+RDKV IGCGAGFGGDRPTAALKLLQRVK+LNYLVLECLAERTLADR+Q M SGGDGYDSRIADWMKLLLPLA+KR
Subjt:  VLMERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKR

Query:  NICIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGI
        NICIITNMGAMDPP AQQNVIE+A SLGL+V VAVAYE SVKESGISTY+GAAPIV+CLEKYHPNVIITSRVADAALF+APMVYELGWNWDDFP L+QG 
Subjt:  NICIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGI

Query:  LAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSR
        LAGHLLECGCQLTGGYFMHPGDK+RSM FQQLL+ISLPYAE++CDGK+ VAK +E+GGLLNFSTCAEQLLYE+GDPSAYITPDLVVD SNVSFCS SSS+
Subjt:  LAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSSR

Query:  VICSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQK
        V CSGAKPSIQ VPEKLLQLAPKDCGWKGWGEISYGGRECVLRAK AEYLVRSWMEE+L G+N HIVSY IGLDSLKAS NSS+ VEDIRLRMDGLFE K
Subjt:  VICSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQK

Query:  EHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPET
        EHALLFV+EFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRE++FW+  VKCT+AV+LDS+ T L++D AKA +SP VTLPC I A+A+ PC  S  PET
Subjt:  EHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPET

Query:  VDSPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVR
          SPIPSGQ++ALYNVAHSRAGDKGND+NFS++PHYPSD ERLKMIITPEWV RVLS LQNS+ FH  +A++KR+EWVNE VKVEIYEVKGIHSLNVVVR
Subjt:  VDSPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVR

Query:  NILDGGVNCSRRIDRHGKTISDLILNQPIVLPP
        NILDGGVNCSRRIDRHGKTISDL+LNQ +VLPP
Subjt:  NILDGGVNCSRRIDRHGKTISDLILNQPIVLPP

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.2e-11636.51Show/hide
Query:  PSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKL--ISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFA
        P+  +T     + +S + +C  L Q+KQ H  ++++    D Y   KL  ++A S    +  A   F+++  PN   +NT+IRA++    P  +   F  
Subjt:  PSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKL--ISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFA

Query:  MQSDG-FYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEAR
        M S+   YP+ +TFPFL+K       L + +++H    +    SDVFV NSLI  Y  CG   + +A K+F ++   +DVVSWNSMI+G  + G  ++A 
Subjt:  MQSDG-FYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEAR

Query:  KMFDEMPKRD--------------CIS-------------------------WNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA
        ++F +M   D              C                            N MLD Y K G ++DA +LFD M E++ V+W+TM+ GY  + D E A
Subjt:  KMFDEMPKRD--------------CIS-------------------------WNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA

Query:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELN
        R + + MP K++V+W  +IS + + G   EA+ +F +++ +  +KL+  T++  L+ACA+ G L LG  IH+ IK +  +    +++ALI MY+KCG+L 
Subjt:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELN

Query:  IAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGR
         +  +FN ++ +DV  W+AM+ GLAMHG G +A+++F +M+E    P+ VT   V CAC+H GL+D+    F  ME +Y +VPE +HY C+VD+LGR G 
Subjt:  IAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGR

Query:  LEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEF
        LE+A++ I  MP+ P+  +WG LLGAC++H  + LA      L+EL+P + G   +LSNIYA  G W+ V+  R  MR  G +K  G SSIEID  +HEF
Subjt:  LEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEF

Query:  TVFDRSHPKSDNIYQVLME
           D +HP S+ +Y  L E
Subjt:  TVFDRSHPKSDNIYQVLME

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.0e-11938.06Show/hide
Query:  LSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRH---MPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFT
        LS LH C  L  ++ +HAQ++K  LH   Y + KLI    L  H   +P A + F  +Q PN+ ++NTM R H+ +S P  A   +  M S G  P+++T
Subjt:  LSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRH---MPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFT

Query:  FPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEMPKRDCIS
        FPF+LK C  +      + +H  + + G   D++V  SLI  Y + G   +  A K+F      RDVVS+ ++I G A  G  E A+K+FDE+P +D +S
Subjt:  FPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEMPKRDCIS

Query:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARMLFDKMPVKNLV
        WN M+ GY + G   +A +LF +M + NV    STMV             LG                         Y K G++E A  LF+++P K+++
Subjt:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARMLFDKMPVKNLV

Query:  SWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCT--IEISNALIDMYAKCGELNIAYGIFNDIKNK
        SW  +I G+    L +EA+ LF +M ++    ++ T++ IL ACA  G + +G  IH  I       T    +  +LIDMYAKCG++  A+ +FN I +K
Subjt:  SWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCT--IEISNALIDMYAKCGELNIAYGIFNDIKNK

Query:  DVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMP
         + SWNAM+ G AMHG    + +LF RM++ G  PD +T +G+L AC+H+G++D G   F  M +DY + P++EHYGCM+DLLG  G  +EA  +I  M 
Subjt:  DVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMP

Query:  MGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDN
        M P+ +IW +LL AC+MH  VEL     ++L++++P + G++ +LSNIYA+AG W+ VA TR  +   G +K  G SSIEID+ VHEF + D+ HP++  
Subjt:  MGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDN

Query:  IYQVLME
        IY +L E
Subjt:  IYQVLME

Q9LS72 Pentatricopeptide repeat-containing protein At3g292303.9e-22462.54Show/hide
Query:  SVPIRTPSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA
        S+P+R PSW S+R++ E++L DL KC++LNQVKQLHAQI++ NLH DL++ PKLISA SLCR   LA   FNQVQ PNVHL N++IRAH+ NSQP QAF 
Subjt:  SVPIRTPSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA

Query:  TFFAMQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYE
         F  MQ  G + DNFT+PFLLK C+G  WLPVV+ +H  IE+ G  SD++VPN+LID YS+CG  G+  A KLF  M   RD VSWNSM+ GL K G   
Subjt:  TFFAMQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYE

Query:  EARKMFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL
        +AR++FDEMP+RD ISWNTMLDGY +  +M  AF+LF++MPERN VSWSTMV+GY KAGDMEMAR++FDKMP+  KN+V+WTIII+G+AEKGL +EA  L
Subjt:  EARKMFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL

Query:  FDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALE
         DQM  + LK D   +I ILAAC ESGLL LG +IH+ +K +N      + NAL+DMYAKCG L  A+ +FNDI  KD+VSWN ML GL +HGHG +A+E
Subjt:  FDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALE

Query:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVEL
        LF RM+ EG  PDKVT I VLC+C HAGLID+GI YF  ME+ Y LVP+VEHYGC+VDLLGR GRL+EAI++++ MPM PN +IWG LLGACRMHN V++
Subjt:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVEL

Query:  AREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVL
        A+EVLD+LV+L P D GN+S+LSNIYAAA DW+ VA+ R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD IYQ+L
Subjt:  AREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVL

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088206.4e-11039.46Show/hide
Query:  LNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFTFPFLLKVCTGNV
        +N +KQ+H  ++  +LH D ++V  L+      R    +   F+  Q+PN+ LYN++I    +N    +    F +++  G Y   FTFP +LK CT   
Subjt:  LNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFTFPFLLKVCTGNV

Query:  WLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEM----PKRDCISWNTMLDGY
           +   +H+ + + GF  DV    SL+  YS  GS  +  A KLF  +   R VV+W ++ SG    G + EA  +F +M     K D      +L   
Subjt:  WLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEM----PKRDCISWNTMLDGY

Query:  VKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAA
        V VG +D      K  +EM  ++N    +T+V  Y K G ME AR +FD M  K++V+W+ +I G+A     +E I LF QM +  LK D  +I+G L++
Subjt:  VKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAA

Query:  CAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLC
        CA  G L LGE   + I  + F   + ++NALIDMYAKCG +   + +F ++K KD+V  NA + GLA +GH   +  +F + ++ G SPD  T +G+LC
Subjt:  CAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLC

Query:  ACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSML
         C HAGLI DG+R+F+ +   YAL   VEHYGCMVDL GR G L++A RLI +MPM PNAI+WG LL  CR+    +LA  VL  L+ L+P+++GN+  L
Subjt:  ACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSML

Query:  SNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVL
        SNIY+  G WD  A  R  M   G +K  G S IE++ +VHEF   D+SHP SD IY  L
Subjt:  SNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVL

Q9SY02 Pentatricopeptide repeat-containing protein At4g027501.4e-10939.77Show/hide
Query:  ATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMS--DVFVPNSLIDSYSKCGS
        A + F+++   N   +N ++ A+  NS+  +A   F + ++      N         C    ++   + V A+ + F  M+  DV   N++I  Y++ G 
Subjt:  ATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMS--DVFVPNSLIDSYSKCGS

Query:  RGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA
          I  A++LF      +DV +W +M+SG  +  + EEAR++FD+MP+R+ +SWN ML GYV+  +M+ A +LFD MP RNV +W+TM+ GY + G +  A
Subjt:  RGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA

Query:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNI
        + LFDKMP ++ VSW  +I+G+++ G + EA+ LF QME+   +L+  +    L+ CA+   L LG+++H  +    ++    + NAL+ MY KCG +  
Subjt:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNI

Query:  AYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRL
        A  +F ++  KD+VSWN M+ G + HG G  AL  F+ MK EG  PD  TM+ VL AC+H GL+D G +YF  M +DY ++P  +HY CMVDLLGR G L
Subjt:  AYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRL

Query:  EEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFT
        E+A  L++NMP  P+A IWGTLLGA R+H   ELA    D +  ++P +SG + +LSN+YA++G W  V   R+RMR  G +K  G S IEI N+ H F+
Subjt:  EEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFT

Query:  VFDRSHPKSDNIYQVLME
        V D  HP+ D I+  L E
Subjt:  VFDRSHPKSDNIYQVLME

Arabidopsis top hitse value%identityAlignment
AT1G01770.1 unknown protein1.1e-22963.36Show/hide
Query:  DCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMD
        DC I LR NP+++R+ V +GCGAGFGGDRP AALKLLQRV+ LNYLVLECLAERTLADR+  M SGG GYD R+++WM+LLLPLA++R  CIITNMGA+D
Subjt:  DCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMD

Query:  PPAAQQNVIEVAGSLGLNVLVAVAYE-------GS------VKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQG
        P  AQ+ V+EVAG LGL + VAVA+E       GS          G STY+GAAPIVECLEKY PNVIITSRVADAALFLAPMVYELGWNW+D  LLAQG
Subjt:  PPAAQQNVIEVAGSLGLNVLVAVAYE-------GS------VKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQG

Query:  ILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSS
         LAGHLLECGCQLTGGYFMHPGD+YR M+F  L ++SLPYAE+  DGK+ V+K + SGG+LN STCAEQLLYEI DPSAYITPD+V+D   VSF   S  
Subjt:  ILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSSS

Query:  RVICSGAKPSIQ-GVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVE---DIRLRMDG
        +V CSGAKPS    VPEKLL+L PK+CGWKGWGEISYGG   + RAK +E+LVRSWMEE + G+N  I+SY IG+DSLKA+SN +   +   DIRLRMDG
Subjt:  RVICSGAKPSIQ-GVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVE---DIRLRMDG

Query:  LFEQKEHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGS
        LF+ KEHA+   KEFTALYTNGPAGGGGISTG+K EIVLEK+LV RE++ W+T ++ T   + ++   +  + + K     P  L               
Subjt:  LFEQKEHALLFVKEFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGS

Query:  FPPETVDSPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSL
        +      SP PSGQ+I LY+VAHSRAGDKGND+NFS+IPHY  D ERLK+IITP+WV  V+SVL +++ F   +A+      ++E+V VEIY+V+GIH++
Subjt:  FPPETVDSPIPSGQEIALYNVAHSRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSL

Query:  NVVVRNILDGGVNCSRRIDRHGKTISDLILNQPIVL
        NVVVRNILDGGVNCSRRIDRHGKTISDLIL Q +VL
Subjt:  NVVVRNILDGGVNCSRRIDRHGKTISDLILNQPIVL

AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-12038.06Show/hide
Query:  LSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRH---MPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFT
        LS LH C  L  ++ +HAQ++K  LH   Y + KLI    L  H   +P A + F  +Q PN+ ++NTM R H+ +S P  A   +  M S G  P+++T
Subjt:  LSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRH---MPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFT

Query:  FPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEMPKRDCIS
        FPF+LK C  +      + +H  + + G   D++V  SLI  Y + G   +  A K+F      RDVVS+ ++I G A  G  E A+K+FDE+P +D +S
Subjt:  FPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEMPKRDCIS

Query:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARMLFDKMPVKNLV
        WN M+ GY + G   +A +LF +M + NV    STMV             LG                         Y K G++E A  LF+++P K+++
Subjt:  WNTMLDGYVKVGKMDDAFKLFDEMPERNV-VSWSTMV-------------LG-------------------------YCKAGDMEMARMLFDKMPVKNLV

Query:  SWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCT--IEISNALIDMYAKCGELNIAYGIFNDIKNK
        SW  +I G+    L +EA+ LF +M ++    ++ T++ IL ACA  G + +G  IH  I       T    +  +LIDMYAKCG++  A+ +FN I +K
Subjt:  SWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCT--IEISNALIDMYAKCGELNIAYGIFNDIKNK

Query:  DVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMP
         + SWNAM+ G AMHG    + +LF RM++ G  PD +T +G+L AC+H+G++D G   F  M +DY + P++EHYGCM+DLLG  G  +EA  +I  M 
Subjt:  DVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMP

Query:  MGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDN
        M P+ +IW +LL AC+MH  VEL     ++L++++P + G++ +LSNIYA+AG W+ VA TR  +   G +K  G SSIEID+ VHEF + D+ HP++  
Subjt:  MGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDN

Query:  IYQVLME
        IY +L E
Subjt:  IYQVLME

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.5e-11836.51Show/hide
Query:  PSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKL--ISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFA
        P+  +T     + +S + +C  L Q+KQ H  ++++    D Y   KL  ++A S    +  A   F+++  PN   +NT+IRA++    P  +   F  
Subjt:  PSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKL--ISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFA

Query:  MQSDG-FYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEAR
        M S+   YP+ +TFPFL+K       L + +++H    +    SDVFV NSLI  Y  CG   + +A K+F ++   +DVVSWNSMI+G  + G  ++A 
Subjt:  MQSDG-FYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEAR

Query:  KMFDEMPKRD--------------CIS-------------------------WNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA
        ++F +M   D              C                            N MLD Y K G ++DA +LFD M E++ V+W+TM+ GY  + D E A
Subjt:  KMFDEMPKRD--------------CIS-------------------------WNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMA

Query:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELN
        R + + MP K++V+W  +IS + + G   EA+ +F +++ +  +KL+  T++  L+ACA+ G L LG  IH+ IK +  +    +++ALI MY+KCG+L 
Subjt:  RMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQME-KACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELN

Query:  IAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGR
         +  +FN ++ +DV  W+AM+ GLAMHG G +A+++F +M+E    P+ VT   V CAC+H GL+D+    F  ME +Y +VPE +HY C+VD+LGR G 
Subjt:  IAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGR

Query:  LEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEF
        LE+A++ I  MP+ P+  +WG LLGAC++H  + LA      L+EL+P + G   +LSNIYA  G W+ V+  R  MR  G +K  G SSIEID  +HEF
Subjt:  LEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEF

Query:  TVFDRSHPKSDNIYQVLME
           D +HP S+ +Y  L E
Subjt:  TVFDRSHPKSDNIYQVLME

AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein4.5e-11139.46Show/hide
Query:  LNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFTFPFLLKVCTGNV
        +N +KQ+H  ++  +LH D ++V  L+      R    +   F+  Q+PN+ LYN++I    +N    +    F +++  G Y   FTFP +LK CT   
Subjt:  LNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQSDGFYPDNFTFPFLLKVCTGNV

Query:  WLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEM----PKRDCISWNTMLDGY
           +   +H+ + + GF  DV    SL+  YS  GS  +  A KLF  +   R VV+W ++ SG    G + EA  +F +M     K D      +L   
Subjt:  WLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEM----PKRDCISWNTMLDGY

Query:  VKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAA
        V VG +D      K  +EM  ++N    +T+V  Y K G ME AR +FD M  K++V+W+ +I G+A     +E I LF QM +  LK D  +I+G L++
Subjt:  VKVGKMDDA---FKLFDEMP-ERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAA

Query:  CAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLC
        CA  G L LGE   + I  + F   + ++NALIDMYAKCG +   + +F ++K KD+V  NA + GLA +GH   +  +F + ++ G SPD  T +G+LC
Subjt:  CAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLC

Query:  ACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSML
         C HAGLI DG+R+F+ +   YAL   VEHYGCMVDL GR G L++A RLI +MPM PNAI+WG LL  CR+    +LA  VL  L+ L+P+++GN+  L
Subjt:  ACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSML

Query:  SNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVL
        SNIY+  G WD  A  R  M   G +K  G S IE++ +VHEF   D+SHP SD IY  L
Subjt:  SNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVL

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-22562.54Show/hide
Query:  SVPIRTPSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA
        S+P+R PSW S+R++ E++L DL KC++LNQVKQLHAQI++ NLH DL++ PKLISA SLCR   LA   FNQVQ PNVHL N++IRAH+ NSQP QAF 
Subjt:  SVPIRTPSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFA

Query:  TFFAMQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYE
         F  MQ  G + DNFT+PFLLK C+G  WLPVV+ +H  IE+ G  SD++VPN+LID YS+CG  G+  A KLF  M   RD VSWNSM+ GL K G   
Subjt:  TFFAMQSDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYE

Query:  EARKMFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL
        +AR++FDEMP+RD ISWNTMLDGY +  +M  AF+LF++MPERN VSWSTMV+GY KAGDMEMAR++FDKMP+  KN+V+WTIII+G+AEKGL +EA  L
Subjt:  EARKMFDEMPKRDCISWNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPV--KNLVSWTIIISGFAEKGLAREAIGL

Query:  FDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALE
         DQM  + LK D   +I ILAAC ESGLL LG +IH+ +K +N      + NAL+DMYAKCG L  A+ +FNDI  KD+VSWN ML GL +HGHG +A+E
Subjt:  FDQMEKACLKLDNGTIIGILAACAESGLLGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALE

Query:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVEL
        LF RM+ EG  PDKVT I VLC+C HAGLID+GI YF  ME+ Y LVP+VEHYGC+VDLLGR GRL+EAI++++ MPM PN +IWG LLGACRMHN V++
Subjt:  LFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFSMMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVEL

Query:  AREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVL
        A+EVLD+LV+L P D GN+S+LSNIYAAA DW+ VA+ R +M+S+G +KPSGASS+E+++ +HEFTVFD+SHPKSD IYQ+L
Subjt:  AREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQKPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATATGCAGCGTCCCAATTCGAACCCCGTCTTGGTTTTCCACTCGAAAGCTCTTAGAGCAGAAGCTCTCCGATCTCCACAAGTGCTCAGACCTCAACCAAGTGAA
GCAACTCCACGCCCAAATCGTCAAATCCAATCTCCACGTCGACCTCTACGTTGTACCCAAACTCATATCTGCCTTCTCCCTCTGTCGCCACATGCCCCTCGCCACTAACG
CTTTCAATCAAGTACAATATCCAAACGTCCATTTGTACAACACTATGATTCGTGCCCACTCCCATAACTCACAACCCTCACAAGCCTTTGCCACTTTCTTTGCTATGCAA
AGTGATGGATTCTACCCTGATAATTTCACTTTCCCGTTTCTTTTGAAAGTTTGTACAGGGAATGTGTGGTTGCCTGTTGTTGAAACGGTACATGCGCAAATCGAGCAATT
TGGGTTCATGTCGGATGTATTTGTGCCAAATTCTCTAATTGATTCGTATTCCAAATGTGGGTCTCGTGGAATTTTGGCAGCAAAGAAGTTGTTCGTGTCAATGGGGGCTT
GTAGGGATGTTGTGTCATGGAATTCAATGATCTCTGGATTAGCGAAAGGTGGGTTGTATGAGGAAGCTCGAAAGATGTTTGATGAAATGCCTAAAAGGGATTGTATTAGT
TGGAACACAATGTTGGATGGGTATGTTAAAGTTGGCAAAATGGATGATGCGTTTAAGTTGTTTGATGAAATGCCTGAGAGGAATGTCGTCTCTTGGTCGACAATGGTGTT
AGGGTATTGCAAGGCGGGGGATATGGAGATGGCAAGAATGTTGTTTGATAAAATGCCTGTGAAGAATTTGGTTTCTTGGACTATAATTATATCTGGGTTTGCTGAGAAAG
GGCTAGCTAGAGAGGCCATTGGTTTGTTTGATCAAATGGAAAAGGCTTGCTTGAAGTTAGACAATGGGACGATAATAGGTATTTTGGCTGCTTGTGCTGAGTCTGGTTTG
CTTGGGCTTGGTGAGAAAATACATGCTTCCATTAAGAACAATAATTTCAAATGTACTATTGAAATCTCCAATGCTTTGATTGATATGTATGCAAAATGTGGTGAGTTGAA
TATTGCATACGGCATTTTTAATGACATTAAAAATAAAGATGTCGTGTCTTGGAATGCTATGCTTCAAGGGCTTGCAATGCATGGACATGGAGTGAAAGCACTTGAACTTT
TCAAAAGAATGAAAGAGGAGGGTTTCTCACCCGATAAAGTTACAATGATTGGAGTCTTGTGTGCTTGTACGCATGCAGGATTGATCGACGATGGCATTCGCTACTTCTCT
ATGATGGAAAGGGACTACGCCCTTGTTCCTGAAGTTGAGCACTATGGCTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAGGAAGCCATAAGGCTCATTCGCAA
CATGCCAATGGGACCAAATGCCATCATTTGGGGAACCCTTTTGGGGGCATGTCGCATGCATAATGCTGTTGAACTTGCGAGGGAGGTTCTTGATCATTTGGTTGAGCTGC
AACCGTATGATTCGGGTAATTTTTCCATGTTGTCTAACATATATGCTGCGGCAGGGGACTGGGACTGTGTCGCCAACACGAGGTTGAGAATGCGGAGTATTGGAACTCAA
AAACCGTCGGGTGCTAGTTCCATTGAGATCGACAATGAGGTTCATGAATTTACAGTTTTTGATCGATCACATCCAAAATCTGATAATATATATCAGGTTCTAATGGAGAG
GCATAGTCAGGCTGACATACATGACTGCACAATTAAACTGAGAGTAAATCCTCAAAAACAGAGAGACAAGGTGTGCATTGGCTGTGGTGCTGGATTTGGAGGTGATAGGC
CAACTGCGGCTCTTAAATTGCTTCAGAGGGTCAAAAACCTAAACTATCTTGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCGCTATCAAGTTATGTTGTCTGGT
GGTGATGGTTACGATTCAAGGATTGCAGATTGGATGAAATTGCTTCTTCCCTTGGCTATGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGACCCCCCTGC
GGCTCAGCAAAATGTCATAGAAGTAGCAGGCAGTCTGGGGCTGAATGTTTTAGTTGCAGTTGCTTATGAGGGTTCGGTAAAAGAATCAGGAATTAGCACGTATATGGGAG
CAGCACCTATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCACGTGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTCTATGAACTTGGTTGG
AACTGGGATGATTTTCCATTGCTAGCACAGGGAATACTGGCTGGTCATCTTCTCGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGAGACAAGTATAG
AAGCATGTCTTTCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAATGTGATGGAAAGTTAACTGTAGCCAAGCCAGATGAGAGTGGAGGTCTTTTGAATT
TCAGTACATGTGCCGAACAACTTCTGTATGAGATTGGTGATCCATCAGCTTATATCACCCCTGACTTGGTGGTTGACTTCAGCAATGTTTCGTTTTGCTCTACATCCAGC
TCTAGGGTTATATGTTCCGGAGCAAAACCATCTATTCAAGGAGTGCCTGAGAAACTCTTGCAGTTGGCTCCAAAGGACTGTGGATGGAAAGGATGGGGAGAGATTTCCTA
TGGGGGACGTGAATGTGTTCTGCGTGCTAAAGGTGCAGAATATCTGGTTCGTTCGTGGATGGAAGAACTGTTGATTGGTATTAATCTGCATATAGTTTCTTACACAATTG
GACTCGACAGCCTTAAAGCATCCAGCAATAGTAGCAATTGTGTTGAAGATATTAGGTTGCGCATGGATGGACTCTTTGAGCAGAAGGAGCACGCTCTCCTGTTTGTTAAA
GAATTTACAGCTTTATACACAAATGGGCCAGCTGGTGGTGGTGGCATCAGCACTGGCTACAAGAAAGAAATTGTGCTTGAAAAACAACTGGTTGGGCGCGAAAATATTTT
CTGGCAAACAGAAGTGAAGTGCACTGAAGCAGTAAAGTTAGACAGTCAATCAACATATCTTCAAAAGGATCTGGCAAAGGCATGTTCTTCGCCCCCAGTAACGTTGCCAT
GTCCGATATCTGCTCATGCAGAGAAACCTTGTACAGGTTCCTTCCCACCAGAAACGGTTGATTCTCCTATTCCATCTGGCCAGGAGATTGCTCTTTACAATGTAGCCCAT
AGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTCTCATTCCTCATTATCCTTCTGATTTTGAGCGATTGAAGATGATCATCACGCCTGAATGGGTGATGAGAGT
TCTCTCGGTTCTGCAGAATTCGACTCGGTTTCATTCTTCAAATGCTGAAGAGAAAAGAAACGAGTGGGTAAATGAAGATGTGAAGGTTGAAATATATGAAGTTAAAGGTA
TCCATTCTTTGAATGTCGTTGTTCGTAACATTCTTGATGGTGGCGTAAATTGCTCGCGGAGAATTGATCGCCATGGAAAGACCATATCGGATCTCATCTTGAACCAGCCA
ATTGTTTTGCCACCTGGACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAATATGCAGCGTCCCAATTCGAACCCCGTCTTGGTTTTCCACTCGAAAGCTCTTAGAGCAGAAGCTCTCCGATCTCCACAAGTGCTCAGACCTCAACCAAGTGAA
GCAACTCCACGCCCAAATCGTCAAATCCAATCTCCACGTCGACCTCTACGTTGTACCCAAACTCATATCTGCCTTCTCCCTCTGTCGCCACATGCCCCTCGCCACTAACG
CTTTCAATCAAGTACAATATCCAAACGTCCATTTGTACAACACTATGATTCGTGCCCACTCCCATAACTCACAACCCTCACAAGCCTTTGCCACTTTCTTTGCTATGCAA
AGTGATGGATTCTACCCTGATAATTTCACTTTCCCGTTTCTTTTGAAAGTTTGTACAGGGAATGTGTGGTTGCCTGTTGTTGAAACGGTACATGCGCAAATCGAGCAATT
TGGGTTCATGTCGGATGTATTTGTGCCAAATTCTCTAATTGATTCGTATTCCAAATGTGGGTCTCGTGGAATTTTGGCAGCAAAGAAGTTGTTCGTGTCAATGGGGGCTT
GTAGGGATGTTGTGTCATGGAATTCAATGATCTCTGGATTAGCGAAAGGTGGGTTGTATGAGGAAGCTCGAAAGATGTTTGATGAAATGCCTAAAAGGGATTGTATTAGT
TGGAACACAATGTTGGATGGGTATGTTAAAGTTGGCAAAATGGATGATGCGTTTAAGTTGTTTGATGAAATGCCTGAGAGGAATGTCGTCTCTTGGTCGACAATGGTGTT
AGGGTATTGCAAGGCGGGGGATATGGAGATGGCAAGAATGTTGTTTGATAAAATGCCTGTGAAGAATTTGGTTTCTTGGACTATAATTATATCTGGGTTTGCTGAGAAAG
GGCTAGCTAGAGAGGCCATTGGTTTGTTTGATCAAATGGAAAAGGCTTGCTTGAAGTTAGACAATGGGACGATAATAGGTATTTTGGCTGCTTGTGCTGAGTCTGGTTTG
CTTGGGCTTGGTGAGAAAATACATGCTTCCATTAAGAACAATAATTTCAAATGTACTATTGAAATCTCCAATGCTTTGATTGATATGTATGCAAAATGTGGTGAGTTGAA
TATTGCATACGGCATTTTTAATGACATTAAAAATAAAGATGTCGTGTCTTGGAATGCTATGCTTCAAGGGCTTGCAATGCATGGACATGGAGTGAAAGCACTTGAACTTT
TCAAAAGAATGAAAGAGGAGGGTTTCTCACCCGATAAAGTTACAATGATTGGAGTCTTGTGTGCTTGTACGCATGCAGGATTGATCGACGATGGCATTCGCTACTTCTCT
ATGATGGAAAGGGACTACGCCCTTGTTCCTGAAGTTGAGCACTATGGCTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAGGAAGCCATAAGGCTCATTCGCAA
CATGCCAATGGGACCAAATGCCATCATTTGGGGAACCCTTTTGGGGGCATGTCGCATGCATAATGCTGTTGAACTTGCGAGGGAGGTTCTTGATCATTTGGTTGAGCTGC
AACCGTATGATTCGGGTAATTTTTCCATGTTGTCTAACATATATGCTGCGGCAGGGGACTGGGACTGTGTCGCCAACACGAGGTTGAGAATGCGGAGTATTGGAACTCAA
AAACCGTCGGGTGCTAGTTCCATTGAGATCGACAATGAGGTTCATGAATTTACAGTTTTTGATCGATCACATCCAAAATCTGATAATATATATCAGGTTCTAATGGAGAG
GCATAGTCAGGCTGACATACATGACTGCACAATTAAACTGAGAGTAAATCCTCAAAAACAGAGAGACAAGGTGTGCATTGGCTGTGGTGCTGGATTTGGAGGTGATAGGC
CAACTGCGGCTCTTAAATTGCTTCAGAGGGTCAAAAACCTAAACTATCTTGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCGCTATCAAGTTATGTTGTCTGGT
GGTGATGGTTACGATTCAAGGATTGCAGATTGGATGAAATTGCTTCTTCCCTTGGCTATGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGACCCCCCTGC
GGCTCAGCAAAATGTCATAGAAGTAGCAGGCAGTCTGGGGCTGAATGTTTTAGTTGCAGTTGCTTATGAGGGTTCGGTAAAAGAATCAGGAATTAGCACGTATATGGGAG
CAGCACCTATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCACGTGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTCTATGAACTTGGTTGG
AACTGGGATGATTTTCCATTGCTAGCACAGGGAATACTGGCTGGTCATCTTCTCGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGAGACAAGTATAG
AAGCATGTCTTTCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAATGTGATGGAAAGTTAACTGTAGCCAAGCCAGATGAGAGTGGAGGTCTTTTGAATT
TCAGTACATGTGCCGAACAACTTCTGTATGAGATTGGTGATCCATCAGCTTATATCACCCCTGACTTGGTGGTTGACTTCAGCAATGTTTCGTTTTGCTCTACATCCAGC
TCTAGGGTTATATGTTCCGGAGCAAAACCATCTATTCAAGGAGTGCCTGAGAAACTCTTGCAGTTGGCTCCAAAGGACTGTGGATGGAAAGGATGGGGAGAGATTTCCTA
TGGGGGACGTGAATGTGTTCTGCGTGCTAAAGGTGCAGAATATCTGGTTCGTTCGTGGATGGAAGAACTGTTGATTGGTATTAATCTGCATATAGTTTCTTACACAATTG
GACTCGACAGCCTTAAAGCATCCAGCAATAGTAGCAATTGTGTTGAAGATATTAGGTTGCGCATGGATGGACTCTTTGAGCAGAAGGAGCACGCTCTCCTGTTTGTTAAA
GAATTTACAGCTTTATACACAAATGGGCCAGCTGGTGGTGGTGGCATCAGCACTGGCTACAAGAAAGAAATTGTGCTTGAAAAACAACTGGTTGGGCGCGAAAATATTTT
CTGGCAAACAGAAGTGAAGTGCACTGAAGCAGTAAAGTTAGACAGTCAATCAACATATCTTCAAAAGGATCTGGCAAAGGCATGTTCTTCGCCCCCAGTAACGTTGCCAT
GTCCGATATCTGCTCATGCAGAGAAACCTTGTACAGGTTCCTTCCCACCAGAAACGGTTGATTCTCCTATTCCATCTGGCCAGGAGATTGCTCTTTACAATGTAGCCCAT
AGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTCTCATTCCTCATTATCCTTCTGATTTTGAGCGATTGAAGATGATCATCACGCCTGAATGGGTGATGAGAGT
TCTCTCGGTTCTGCAGAATTCGACTCGGTTTCATTCTTCAAATGCTGAAGAGAAAAGAAACGAGTGGGTAAATGAAGATGTGAAGGTTGAAATATATGAAGTTAAAGGTA
TCCATTCTTTGAATGTCGTTGTTCGTAACATTCTTGATGGTGGCGTAAATTGCTCGCGGAGAATTGATCGCCATGGAAAGACCATATCGGATCTCATCTTGAACCAGCCA
ATTGTTTTGCCACCTGGACAATAA
Protein sequenceShow/hide protein sequence
MQICSVPIRTPSWFSTRKLLEQKLSDLHKCSDLNQVKQLHAQIVKSNLHVDLYVVPKLISAFSLCRHMPLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQ
SDGFYPDNFTFPFLLKVCTGNVWLPVVETVHAQIEQFGFMSDVFVPNSLIDSYSKCGSRGILAAKKLFVSMGACRDVVSWNSMISGLAKGGLYEEARKMFDEMPKRDCIS
WNTMLDGYVKVGKMDDAFKLFDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIISGFAEKGLAREAIGLFDQMEKACLKLDNGTIIGILAACAESGL
LGLGEKIHASIKNNNFKCTIEISNALIDMYAKCGELNIAYGIFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPDKVTMIGVLCACTHAGLIDDGIRYFS
MMERDYALVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPMGPNAIIWGTLLGACRMHNAVELAREVLDHLVELQPYDSGNFSMLSNIYAAAGDWDCVANTRLRMRSIGTQ
KPSGASSIEIDNEVHEFTVFDRSHPKSDNIYQVLMERHSQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSG
GDGYDSRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQNVIEVAGSLGLNVLVAVAYEGSVKESGISTYMGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGW
NWDDFPLLAQGILAGHLLECGCQLTGGYFMHPGDKYRSMSFQQLLNISLPYAEVECDGKLTVAKPDESGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDFSNVSFCSTSS
SRVICSGAKPSIQGVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKGAEYLVRSWMEELLIGINLHIVSYTIGLDSLKASSNSSNCVEDIRLRMDGLFEQKEHALLFVK
EFTALYTNGPAGGGGISTGYKKEIVLEKQLVGRENIFWQTEVKCTEAVKLDSQSTYLQKDLAKACSSPPVTLPCPISAHAEKPCTGSFPPETVDSPIPSGQEIALYNVAH
SRAGDKGNDLNFSLIPHYPSDFERLKMIITPEWVMRVLSVLQNSTRFHSSNAEEKRNEWVNEDVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLILNQP
IVLPPGQ