; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G210880 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G210880
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCiama_Chr11:22962846..22972815
RNA-Seq ExpressionCaUC11G210880
SyntenyCaUC11G210880
Gene Ontology termsGO:0008380 - RNA splicing (biological process)
GO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0097573 - glutathione oxidoreductase activity (molecular function)
InterPro domainsIPR002109 - Glutaredoxin
IPR002885 - Pentatricopeptide repeat
IPR006869 - Domain of unknown function DUF547
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036249 - Thioredoxin-like superfamily
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138809.1 uncharacterized protein LOC101216716 [Cucumis sativus]0.0e+0067.72Show/hide
Query:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR
        MATSPT KRDEE  LSKQ+DGGE TVEIEQ LSVS DSATQAKD+GL EV KDKN DSKC+AQEE VERE VQSEV H++E K D QSKSEGEKSDQ I 
Subjt:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR

Query:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA
        N DSNEKLDEDKN  SESSSDDSDND V SK Q  +NQ +GEVIGEEKSPEPVFDGTEVPGIEGSG+LSNRSMDSD+ESQGVVDRALALKNFV+EKGVVA
Subjt:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA

Query:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE
        VSTVLRRFSGKKDEES DTP++ETKDDS SNKENEAKEIPEKPL+RS WNPLNYIKI+RDADAQIKTEQ EDV G+S  D+VIKGRIVLYTRLGCQECKE
Subjt:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE

Query:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
        ARLFLFWKRL YVEINIDVYP RKLELEK+AGS AVP++FFNTVLIGGLNELKELDESGKL+EKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
Subjt:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI

Query:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF
        VRKMKESIVVKDRYCKMRRFT+CFLGSEAVDFLSEDQYLEREE                                                         
Subjt:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF

Query:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI
                                                                                                          AI
Subjt:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI

Query:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
        EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDI+SRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
Subjt:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV

Query:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP
        EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGA+ERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSK SLPYVEP
Subjt:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP

Query:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL
        LIHFALVCGTRSGPALRCYSPGN+D EL+EAA+SFLREGGLVMDLNNNA S+  +                                             
Subjt:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL

Query:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL
                                                   +W FSTDFGKNEQE +KHASNYLKPDDSQ LLELLASSQLK+
Subjt:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL

XP_008441247.1 PREDICTED: uncharacterized protein LOC103485435 [Cucumis melo]0.0e+0067.01Show/hide
Query:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR
        MA S T +RDEEH LSKQ+DGGE TVEIEQSLSVS DSATQA+D+GL EV KDKN +SKC+AQEEIV RE VQSEV H++ETK D QSKSEGE+SDQ I 
Subjt:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR

Query:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA
        NDDSNEKLDEDKN  SE SSDDSDND VD K Q  +NQ +GEVIGEEKSPEPVFDGTEV GIEG+G+LSNRSMDSD+E QGVVDRALALKNFV+EKGVVA
Subjt:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA

Query:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE
        VS VLRRFSGKKDEES DTP + TKDDS+SNKENEAKEIPEKPL+RS WNPLNYIKI+RDADAQIKTEQ EDV+G+S  D+VIKGRIVLYTRLGCQECKE
Subjt:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE

Query:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
        ARLFLF KRLRYVEINIDVYP RKLELEK+AGS AVP+VFFNT LIGGLNELKELDESGKL+EKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
Subjt:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI

Query:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF
        VRKMKESIVVKDRYCKMRR+T+CFLGSEAVDFLSEDQYLEREE                                                         
Subjt:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF

Query:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI
                                                                                                          AI
Subjt:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI

Query:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
        EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHN+ARGIIEVKPKPITDI+SRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
Subjt:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV

Query:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP
        EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGA+ERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP
Subjt:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP

Query:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL
        LIHFALVCGTRSGPALRCYSPGN+DQEL+EAA+SFLREGGLVMDLNNN AS+  +                                             
Subjt:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL

Query:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL
                                                   +W FSTDFGKNEQEV+KHASNYLKPDDSQALLELLASS+LK+
Subjt:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL

XP_038884364.1 pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like [Benincasa hispida]0.0e+0092.28Show/hide
Query:  MLSPAFISTATISHPPPCLSVQSPPTSLSARRTCSKWNLTSIDRSKSFTNFRLNFLKDPSSSSPVACTTETCTTEEYAEVESCSNQSASGCLSPYLIAVW
        MLSPAFISTATISHPPPC SVQ PPT L+ARRTCSKWN TSID  KS TNFRLNF+KD S +SPVAC  ET TT+EYA VESCS+QS SGCLSPYLIAVW
Subjt:  MLSPAFISTATISHPPPCLSVQSPPTSLSARRTCSKWNLTSIDRSKSFTNFRLNFLKDPSSSSPVACTTETCTTEEYAEVESCSNQSASGCLSPYLIAVW

Query:  LRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILN
        LRSSRS KELRAIHAFILRH TS+ IYVGNNL+SSYLRFGML DARK FDEMPVR+VVTWT IINGYI LD TEEALGLFS SVKNGVQANGKMFVCILN
Subjt:  LRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILN

Query:  LCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLK
        LCAKRLDFELGRQIHGVIVKGNWGN+I+DSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAIS+FSNML++GFLPNEFSVCSVLK
Subjt:  LCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLK

Query:  ACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSIL
        ACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREG GEEA+NLFRLMKRQRIPANNLTIVSIL
Subjt:  ACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSIL

Query:  RACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSA
        RACGSIGASLTGREVHAQIVK+S QTNIHIGSTLVWFYCKCRN+LKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNM+EEGVEPNSFTYSSA
Subjt:  RACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSA

Query:  LKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGT
        LKACAKMEAILQGKLIHSSANKTSA SNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGT
Subjt:  LKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGT

Query:  VYGACGDVKCDVESTLEYSPRT
        VYGACGDVKCDV+S+LEY  +T
Subjt:  VYGACGDVKCDVESTLEYSPRT

XP_038885671.1 uncharacterized protein LOC120075979 isoform X1 [Benincasa hispida]0.0e+0067.9Show/hide
Query:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQ------------------AKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLET
        MATSP   RDEEHSLSKQ DGGEPTVEIEQSLSV+EDSATQ                  AKDKGL EVAKDKN+DSKC+AQEEIVERETVQSEV H++ET
Subjt:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQ------------------AKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLET

Query:  KQDCQSKSEGEKSDQTIRNDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGV
        K D QSKSEGEK DQ  RNDDSNE L E KNA SE+SSDDSDNDGVDSK QT TNQ +GEVIGEEKSPEPVFDGTE+PGIEGSG+LSNRSMDSDSESQGV
Subjt:  KQDCQSKSEGEKSDQTIRNDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGV

Query:  VDRALALKNFVREKGVVAVSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVV
        VDRALALKNFV+EKGVVAVS VLRRFSGKKDEE+ D PNDETKDDS SNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKT+QVED AGE  LDVV
Subjt:  VDRALALKNFVREKGVVAVSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVV

Query:  IKGRIVLYTRLGCQECKEARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPL
        +KGRIVLYTRLGCQECKEARLFLFWKRLRYVEINIDVYPGRKLELEKIAGS AVP+VFFNTVLIGG+NELKELDESGKL+EKIEYLKAEAPTLEAPLPPL
Subjt:  IKGRIVLYTRLGCQECKEARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPL

Query:  SGEDDVSSSGTVDELAMIVRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQET
        SGEDDVSSSGTVDELAMIVRKMKESIVVKDRYCKMRRFT+CFLGSEAVDFLSEDQYLEREE                                       
Subjt:  SGEDDVSSSGTVDELAMIVRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQET

Query:  YSGCSKKITFSRSKQIIFILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVP
                                                                                                            
Subjt:  YSGCSKKITFSRSKQIIFILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVP

Query:  CVYPIFLFSAKIYLCQAIEFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHV
                        AIEFGRKLASK FFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDI+SRLRFLSFAILEAYVSEDGKHV
Subjt:  CVYPIFLFSAKIYLCQAIEFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHV

Query:  DYRSIHGSEEFARYLRIVEELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMK
        DYRSIHGSEEFARYLRIVEELQRVEVHNLAREEK+AFFINLYNMMAIHAILVCGHP+GALERRKLFGDFKYVIGGATYSLSAIQNGILR NQRPPYNLMK
Subjt:  DYRSIHGSEEFARYLRIVEELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMK

Query:  PFGARDKRSKVSLPYVEPLIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLE
         FGARDKRSKVSLPYVEPLIHFALVCGTRSGPALRCYSPGNVD ELMEAA+SFLR+GGL+MDLNNN+AS               +S +L           
Subjt:  PFGARDKRSKVSLPYVEPLIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLE

Query:  PYFVILILLICLGRSTLLEICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQ
                                                                     +W FSTDFGKNEQEV+KHASNYLKP+DSQALLELLASSQ
Subjt:  PYFVILILLICLGRSTLLEICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQ

Query:  LKL
        LK+
Subjt:  LKL

XP_038885672.1 uncharacterized protein LOC120075979 isoform X2 [Benincasa hispida]0.0e+0069.14Show/hide
Query:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR
        MATSP   RDEEHSLSKQ DGGEPTVEIEQSLSV+EDSATQAKDKGL EVAKDKN+DSKC+AQEEIVERETVQSEV H++ETK D QSKSEGEK DQ  R
Subjt:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR

Query:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA
        NDDSNE L E KNA SE+SSDDSDNDGVDSK QT TNQ +GEVIGEEKSPEPVFDGTE+PGIEGSG+LSNRSMDSDSESQGVVDRALALKNFV+EKGVVA
Subjt:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA

Query:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE
        VS VLRRFSGKKDEE+ D PNDETKDDS SNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKT+QVED AGE  LDVV+KGRIVLYTRLGCQECKE
Subjt:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE

Query:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
        ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGS AVP+VFFNTVLIGG+NELKELDESGKL+EKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
Subjt:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI

Query:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF
        VRKMKESIVVKDRYCKMRRFT+CFLGSEAVDFLSEDQYLEREE                                                         
Subjt:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF

Query:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI
                                                                                                          AI
Subjt:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI

Query:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
        EFGRKLASK FFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDI+SRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
Subjt:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV

Query:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP
        EELQRVEVHNLAREEK+AFFINLYNMMAIHAILVCGHP+GALERRKLFGDFKYVIGGATYSLSAIQNGILR NQRPPYNLMK FGARDKRSKVSLPYVEP
Subjt:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP

Query:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL
        LIHFALVCGTRSGPALRCYSPGNVD ELMEAA+SFLR+GGL+MDLNNN+AS               +S +L                             
Subjt:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL

Query:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL
                                                   +W FSTDFGKNEQEV+KHASNYLKP+DSQALLELLASSQLK+
Subjt:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL

TrEMBL top hitse value%identityAlignment
A0A0A0LQQ2 Uncharacterized protein0.0e+0067.72Show/hide
Query:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR
        MATSPT KRDEE  LSKQ+DGGE TVEIEQ LSVS DSATQAKD+GL EV KDKN DSKC+AQEE VERE VQSEV H++E K D QSKSEGEKSDQ I 
Subjt:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR

Query:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA
        N DSNEKLDEDKN  SESSSDDSDND V SK Q  +NQ +GEVIGEEKSPEPVFDGTEVPGIEGSG+LSNRSMDSD+ESQGVVDRALALKNFV+EKGVVA
Subjt:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA

Query:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE
        VSTVLRRFSGKKDEES DTP++ETKDDS SNKENEAKEIPEKPL+RS WNPLNYIKI+RDADAQIKTEQ EDV G+S  D+VIKGRIVLYTRLGCQECKE
Subjt:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE

Query:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
        ARLFLFWKRL YVEINIDVYP RKLELEK+AGS AVP++FFNTVLIGGLNELKELDESGKL+EKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
Subjt:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI

Query:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF
        VRKMKESIVVKDRYCKMRRFT+CFLGSEAVDFLSEDQYLEREE                                                         
Subjt:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF

Query:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI
                                                                                                          AI
Subjt:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI

Query:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
        EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDI+SRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
Subjt:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV

Query:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP
        EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGA+ERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSK SLPYVEP
Subjt:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP

Query:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL
        LIHFALVCGTRSGPALRCYSPGN+D EL+EAA+SFLREGGLVMDLNNNA S+  +                                             
Subjt:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL

Query:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL
                                                   +W FSTDFGKNEQE +KHASNYLKPDDSQ LLELLASSQLK+
Subjt:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL

A0A1S3B2J4 uncharacterized protein LOC1034854350.0e+0067.01Show/hide
Query:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR
        MA S T +RDEEH LSKQ+DGGE TVEIEQSLSVS DSATQA+D+GL EV KDKN +SKC+AQEEIV RE VQSEV H++ETK D QSKSEGE+SDQ I 
Subjt:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR

Query:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA
        NDDSNEKLDEDKN  SE SSDDSDND VD K Q  +NQ +GEVIGEEKSPEPVFDGTEV GIEG+G+LSNRSMDSD+E QGVVDRALALKNFV+EKGVVA
Subjt:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA

Query:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE
        VS VLRRFSGKKDEES DTP + TKDDS+SNKENEAKEIPEKPL+RS WNPLNYIKI+RDADAQIKTEQ EDV+G+S  D+VIKGRIVLYTRLGCQECKE
Subjt:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE

Query:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
        ARLFLF KRLRYVEINIDVYP RKLELEK+AGS AVP+VFFNT LIGGLNELKELDESGKL+EKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
Subjt:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI

Query:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF
        VRKMKESIVVKDRYCKMRR+T+CFLGSEAVDFLSEDQYLEREE                                                         
Subjt:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF

Query:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI
                                                                                                          AI
Subjt:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI

Query:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
        EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHN+ARGIIEVKPKPITDI+SRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
Subjt:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV

Query:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP
        EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGA+ERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP
Subjt:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP

Query:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL
        LIHFALVCGTRSGPALRCYSPGN+DQEL+EAA+SFLREGGLVMDLNNN AS+  +                                             
Subjt:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL

Query:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL
                                                   +W FSTDFGKNEQEV+KHASNYLKPDDSQALLELLASS+LK+
Subjt:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL

A0A5D3C8K1 Uncharacterized protein0.0e+0067.01Show/hide
Query:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR
        MA S T +RDEEH LSKQ+DGGE TVEIEQSLSVS DSATQA+D+GL EV KDKN +SKC+AQEEIV RE VQSEV H++ETK D QSKSEGE+SDQ I 
Subjt:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIR

Query:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA
        NDDSNEKLDEDKN  SE SSDDSDND VD K Q  +NQ +GEVIGEEKSPEPVFDGTEV GIEG+G+LSNRSMDSD+E QGVVDRALALKNFV+EKGVVA
Subjt:  NDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVA

Query:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE
        VS VLRRFSGKKDEES DTP + TKDDS+SNKENEAKEIPEKPL+RS WNPLNYIKI+RDADAQIKTEQ EDV+G+S  D+VIKGRIVLYTRLGCQECKE
Subjt:  VSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKE

Query:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
        ARLFLF KRLRYVEINIDVYP RKLELEK+AGS AVP+VFFNT LIGGLNELKELDESGKL+EKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI
Subjt:  ARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMI

Query:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF
        VRKMKESIVVKDRYCKMRR+T+CFLGSEAVDFLSEDQYLEREE                                                         
Subjt:  VRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIF

Query:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI
                                                                                                          AI
Subjt:  ILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAI

Query:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
        EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHN+ARGIIEVKPKPITDI+SRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV
Subjt:  EFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIV

Query:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP
        EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGA+ERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP
Subjt:  EELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEP

Query:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL
        LIHFALVCGTRSGPALRCYSPGN+DQEL+EAA+SFLREGGLVMDLNNN AS+  +                                             
Subjt:  LIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLL

Query:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL
                                                   +W FSTDFGKNEQEV+KHASNYLKPDDSQALLELLASS+LK+
Subjt:  EICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL

A0A6J1DF02 uncharacterized protein LOC1110202090.0e+0063.21Show/hide
Query:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETV---------------QSEVQHKLETKQD
        MATSPT KRDEE  LSK+ D GE T+E E  LSVSEDS T A+DKG  E+AKDKN+DSK +AQ+E VERE V               Q E+ H++ETK D
Subjt:  MATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAKDKGLGEVAKDKNIDSKCDAQEEIVERETV---------------QSEVQHKLETKQD

Query:  CQSKSEGEKSDQTIRNDDSNEKLDEDKNAASESSSDDSDN--DGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVV
         QSKSE EKSDQ IRNDDSNEKLDEDKNA SESSSDDSDN  +G+DSK QTRTNQ   EV+ EEK+PEPVFDGTEVP IE + +LSNRS DSDSE+QGVV
Subjt:  CQSKSEGEKSDQTIRNDDSNEKLDEDKNAASESSSDDSDN--DGVDSKGQTRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVV

Query:  DRALALKNFVREKGVVAVSTVLRRFSGKKDEESLD-TPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVV
        D+ALALKNFV+EKGVVAVS+VLRRFSGK++EES D +PNDE+KDDS+SNKENEAKEIPEKP ERSAWNPLNYIKISRDADAQIKTEQVE+V+GE  L++V
Subjt:  DRALALKNFVREKGVVAVSTVLRRFSGKKDEESLD-TPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVV

Query:  IKGRIVLYTRLGCQECKEARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPL
        +KGRIVLYTRLGCQ+CKEARLFLFWKRLRYVEINIDVYPGRKLELEK+AGS AVPKVFFN  LIGGLNELKELDESGKL+EKIEYLKAEAP+ EAPLPPL
Subjt:  IKGRIVLYTRLGCQECKEARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPL

Query:  SGEDDVSSSGTVDELAMIVRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQET
        SGEDDVSS+GTVDELAM+ RKMKESIVVKDR+ KMRRFT+CFLGSEAVDFLSEDQYLEREE                                       
Subjt:  SGEDDVSSSGTVDELAMIVRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQET

Query:  YSGCSKKITFSRSKQIIFILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVP
                                                                                                            
Subjt:  YSGCSKKITFSRSKQIIFILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVP

Query:  CVYPIFLFSAKIYLCQAIEFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHV
                        AIEFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIE KPK I+DI+SRLRFLSFA+ EAYVSEDGKHV
Subjt:  CVYPIFLFSAKIYLCQAIEFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHV

Query:  DYRSIHGSEEFARYLRIVEELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMK
        DYRSIHGSEEFARYLRIVEELQRVEV NLAREEK+AFFINLYNMMAIHAILVCGHPVGALERRKLFG+FKYVIGGATYSLSAIQNGILR NQRPPYNLMK
Subjt:  DYRSIHGSEEFARYLRIVEELQRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMK

Query:  PFGARDKRSKVSLPYVEPLIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLE
        PFGA+DKR+KVSLPYVEPLIH ALVCGTRSGPALRCYSPGN+DQELMEAA+SFLR+GGLV+DLNNN AS+  +                           
Subjt:  PFGARDKRSKVSLPYVEPLIHFALVCGTRSGPALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLE

Query:  PYFVILILLICLGRSTLLEICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQ
                                                                     +W FSTDFGKNEQEV+KHASNYLKP+DSQALLELLAS+Q
Subjt:  PYFVILILLICLGRSTLLEICWKFWQYVDNWAWSSILLRPYMVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQ

Query:  LKL
        LK+
Subjt:  LKL

A0A6J1FJ52 pentatricopeptide repeat-containing protein At4g18520, chloroplastic-like0.0e+0088.26Show/hide
Query:  MLSPAFISTATISHPPPCLSVQSPPTSLSARRTCSKWNLTSIDRSKSFTNFRLNFLKDPSSSSPVACTTETCTTEEYAEVESCSNQSASGCLSPYLIAVW
        MLSPAFISTATIS  PPC SVQ PP SLS RR CSKWNLTSIDR KS TN R NF+KDP  +S VACTTETCT +E  +VESCSN+  +GCLSPYLIA W
Subjt:  MLSPAFISTATISHPPPCLSVQSPPTSLSARRTCSKWNLTSIDRSKSFTNFRLNFLKDPSSSSPVACTTETCTTEEYAEVESCSNQSASGCLSPYLIAVW

Query:  LRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILN
        LRSSR VKELRAIHAFILRHF+S V YVGNNL+SSYLRFGML DAR+VFDEMP+RSVVTWTAIINGYID DLT+EALGLF  SV++GV+ANGKMFVCILN
Subjt:  LRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILN

Query:  LCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLK
        LCAKRLDFELGRQIHG IVKGNWGN+IVDSAIVYFYAQCKDISSAFVAFEHMPKRD+VCWTSMITSCSQQGLGREAISMFSNMLS+GFLPNEFSVCSVLK
Subjt:  LCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLK

Query:  ACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSIL
        ACGEERELKIG+QLHGL+IKKIIKNDVFVG+SLVDMYAK G+LADSREVF+ MRNRNTVTWTSIIAGYAREGLGEEA+NLFRLM +QR+PAN+LTIVSIL
Subjt:  ACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSIL

Query:  RACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSA
        RACGSIGASLTGREVHAQIVKNS QTN+HIGSTLVWFYCKCRNQ KASMVLQ MPLRDVVSWTAIISGCAHLGHESEALEFL+NMIEEGVEPNSFTYSSA
Subjt:  RACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSA

Query:  LKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGT
        LKACAKMEAILQG+LIHSSANKTSA SNVFVGSALIYMYAKCGYVTEASQVFDSMP RNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDD+I GT
Subjt:  LKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGT

Query:  VYGACGDVKCDVESTLEYSPRT
        VYG CGDVKC+V+S+LEYS +T
Subjt:  VYGACGDVKCDVESTLEYSPRT

SwissProt top hitse value%identityAlignment
Q0WNP3 Pentatricopeptide repeat-containing protein At4g18520, chloroplastic3.4e-19763.48Show/hide
Query:  LIAVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQ-ANGKM
        L+A WL+SS  ++ ++ IHA  L+ F   VIY GNNL+SS +R G L  ARKVFD MP ++ VTWTA+I+GY+   L +EA  LF   VK+G++  N +M
Subjt:  LIAVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQ-ANGKM

Query:  FVCILNLCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFS
        FVC+LNLC++R +FELGRQ+HG +VK   GN+IV+S++VYFYAQC +++SA  AF+ M ++DV+ WT++I++CS++G G +AI MF  ML++ FLPNEF+
Subjt:  FVCILNLCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFS

Query:  VCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNL
        VCS+LKAC EE+ L+ GRQ+H L++K++IK DVFVGTSL+DMYAKCG ++D R+VF+GM NRNTVTWTSIIA +AREG GEEAI+LFR+MKR+ + ANNL
Subjt:  VCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNL

Query:  TIVSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNS
        T+VSILRACGS+GA L G+E+HAQI+KNS + N++IGSTLVW YCKC     A  VLQ +P RDVVSWTA+ISGC+ LGHESEAL+FLK MI+EGVEPN 
Subjt:  TIVSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNS

Query:  FTYSSALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVD
        FTYSSALKACA  E++L G+ IHS A K  A SNVFVGSALI+MYAKCG+V+EA +VFDSMP +NLVSWKAMI+ YARNG CREALKLMYRM+AEG EVD
Subjt:  FTYSSALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVD

Query:  DYILGTVYGACGDVKCD--VEST
        DYI  T+   CGD++ D  VES+
Subjt:  DYILGTVYGACGDVKCD--VEST

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.3e-7630.26Show/hide
Query:  AVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVC
        A+ L    S+KELR I   + ++      +    L+S + R+G + +A +VF+ +  +  V +  ++ G+  +   ++AL  F     + V+     F  
Subjt:  AVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVC

Query:  ILNLCAKRLDFELGRQIHGVIVKGNWG-NMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVC
        +L +C    +  +G++IHG++VK  +  ++   + +   YA+C+ ++ A   F+ MP+RD+V W +++   SQ G+ R A+ M  +M      P+  ++ 
Subjt:  ILNLCAKRLDFELGRQIHGVIVKGNWG-NMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVC

Query:  SVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTI
        SVL A    R + +G+++HG  ++    + V + T+LVDMYAKCG+L  +R++F+GM  RN V+W S+I  Y +    +EA+ +F+ M  + +   ++++
Subjt:  SVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTI

Query:  VSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFT
        +  L AC  +G    GR +H   V+     N+ + ++L+  YCKC+    A+ +   +  R +VSW A+I G A  G   +AL +   M    V+P++FT
Subjt:  VSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFT

Query:  YSSALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDY
        Y S + A A++      K IH    ++    NVFV +AL+ MYAKCG +  A  +FD M  R++ +W AMI  Y  +G  + AL+L   MQ   I+ +  
Subjt:  YSSALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDY

Query:  ILGTVYGAC
           +V  AC
Subjt:  ILGTVYGAC

Q9LFI1 Pentatricopeptide repeat-containing protein At3g53360, mitochondrial1.9e-7533.2Show/hide
Query:  SSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLC
        SSRS+ + R IH  IL     Y   + N++LS Y + G L DAR+VFD MP R++V++T++I GY       EA+ L+   ++  +  +   F  I+  C
Subjt:  SSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLC

Query:  AKRLDFELGRQIHGVIVK-GNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNG-FLPNEFSVCSVLK
        A   D  LG+Q+H  ++K  +  ++I  +A++  Y +   +S A   F  +P +D++ W+S+I   SQ G   EA+S    MLS G F PNE+   S LK
Subjt:  AKRLDFELGRQIHGVIVK-GNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNG-FLPNEFSVCSVLK

Query:  ACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSIL
        AC        G Q+HGL IK  +  +   G SL DMYA+CG L  +R VF+ +   +T +W  IIAG A  G  +EA+++F  M+      + +++ S+L
Subjt:  ACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSIL

Query:  RACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKA-SMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSS
         A     A   G ++H+ I+K     ++ + ++L+  Y  C +     ++        D VSW  I++ C       E L   K M+    EP+  T  +
Subjt:  RACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKA-SMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSS

Query:  ALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILG
         L+ C ++ ++  G  +H  + KT  +   F+ + LI MYAKCG + +A ++FDSM  R++VSW  +I+ YA++G   EAL L   M++ GIE +     
Subjt:  ALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILG

Query:  TVYGACGDV
         V  AC  V
Subjt:  TVYGACGDV

Q9ZQ74 Pentatricopeptide repeat-containing protein At2g03380, mitochondrial6.8e-7331.56Show/hide
Query:  PVACTTETCTTEEYAEVESCSNQSASGCLSPYLIAVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAI
        P  C + T   E     E+  +       SP  +   L    ++  LR  H  +  +     I +   L+S Y  FG   DAR VFD++P      W  +
Subjt:  PVACTTETCTTEEYAEVESCSNQSASGCLSPYLIAVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAI

Query:  INGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSM
        +  Y     + E + L+   +K+G + +  +F   L  C +  D + G++IH  +VK    + +V + ++  YA+C +I SA   F  +  R+VVCWTSM
Subjt:  INGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSM

Query:  ITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTS
        I    +  L  E + +F+ M  N  L NE++  +++ AC +   L  G+  HG ++K  I+    + TSL+DMY KCG+++++R VFN   + + V WT+
Subjt:  ITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTS

Query:  IIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWT
        +I GY   G   EA++LF+ MK   I  N +TI S+L  CG I     GR VH   +K     + ++ + LV  Y KC     A  V ++   +D+V+W 
Subjt:  IIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWT

Query:  AIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSALKACAKMEAILQGKLIHSSANKTS--ASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLV
        +IISG +  G   EAL     M  E V PN  T +S   ACA + ++  G  +H+ + K    ASS+V VG+AL+  YAKCG    A  +FD++  +N +
Subjt:  AIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSALKACAKMEAILQGKLIHSSANKTS--ASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLV

Query:  SWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGTVYGACG
        +W AMI  Y + G    +L+L   M  +  + ++    ++  ACG
Subjt:  SWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGTVYGACG

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276104.7e-7434.16Show/hide
Query:  IYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLCAKRLDFELGRQIHGVIVKGNWGN
        + VG +L+ +Y++     D RKVFDEM  R+VVTWT +I+GY    + +E L LF      G Q N   F   L + A+      G Q+H V+VK     
Subjt:  IYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLCAKRLDFELGRQIHGVIVKGNWGN

Query:  MI-VDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIK
         I V ++++  Y +C ++  A + F+    + VV W SMI+  +  GL  EA+ MF +M  N    +E S  SV+K C   +EL+   QLH  ++K    
Subjt:  MI-VDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIK

Query:  NDVFVGTSLVDMYAKCGNLADSREVFNGMR-NRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSILRACGSIGASLTGREVHAQIVKNS
         D  + T+L+  Y+KC  + D+  +F  +    N V+WT++I+G+ +    EEA++LF  MKR+ +  N  T   IL A   I  S    EVHAQ+VK +
Subjt:  NDVFVGTSLVDMYAKCGNLADSREVFNGMR-NRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSILRACGSIGASLTGREVHAQIVKNS

Query:  CQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSALKACAKMEAIL-QGKLIHSSANK
         + +  +G+ L+  Y K     +A+ V   +  +D+V+W+A+++G A  G    A++    + + G++PN FT+SS L  CA   A + QGK  H  A K
Subjt:  CQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSALKACAKMEAIL-QGKLIHSSANK

Query:  TSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGTVYGAC
        +   S++ V SAL+ MYAK G +  A +VF     ++LVSW +MI  YA++G   +AL +   M+   +++D      V+ AC
Subjt:  TSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGTVYGAC

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-7730.26Show/hide
Query:  AVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVC
        A+ L    S+KELR I   + ++      +    L+S + R+G + +A +VF+ +  +  V +  ++ G+  +   ++AL  F     + V+     F  
Subjt:  AVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVC

Query:  ILNLCAKRLDFELGRQIHGVIVKGNWG-NMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVC
        +L +C    +  +G++IHG++VK  +  ++   + +   YA+C+ ++ A   F+ MP+RD+V W +++   SQ G+ R A+ M  +M      P+  ++ 
Subjt:  ILNLCAKRLDFELGRQIHGVIVKGNWG-NMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVC

Query:  SVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTI
        SVL A    R + +G+++HG  ++    + V + T+LVDMYAKCG+L  +R++F+GM  RN V+W S+I  Y +    +EA+ +F+ M  + +   ++++
Subjt:  SVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTI

Query:  VSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFT
        +  L AC  +G    GR +H   V+     N+ + ++L+  YCKC+    A+ +   +  R +VSW A+I G A  G   +AL +   M    V+P++FT
Subjt:  VSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFT

Query:  YSSALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDY
        Y S + A A++      K IH    ++    NVFV +AL+ MYAKCG +  A  +FD M  R++ +W AMI  Y  +G  + AL+L   MQ   I+ +  
Subjt:  YSSALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDY

Query:  ILGTVYGAC
           +V  AC
Subjt:  ILGTVYGAC

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.3e-7534.16Show/hide
Query:  IYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLCAKRLDFELGRQIHGVIVKGNWGN
        + VG +L+ +Y++     D RKVFDEM  R+VVTWT +I+GY    + +E L LF      G Q N   F   L + A+      G Q+H V+VK     
Subjt:  IYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLCAKRLDFELGRQIHGVIVKGNWGN

Query:  MI-VDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIK
         I V ++++  Y +C ++  A + F+    + VV W SMI+  +  GL  EA+ MF +M  N    +E S  SV+K C   +EL+   QLH  ++K    
Subjt:  MI-VDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIK

Query:  NDVFVGTSLVDMYAKCGNLADSREVFNGMR-NRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSILRACGSIGASLTGREVHAQIVKNS
         D  + T+L+  Y+KC  + D+  +F  +    N V+WT++I+G+ +    EEA++LF  MKR+ +  N  T   IL A   I  S    EVHAQ+VK +
Subjt:  NDVFVGTSLVDMYAKCGNLADSREVFNGMR-NRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSILRACGSIGASLTGREVHAQIVKNS

Query:  CQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSALKACAKMEAIL-QGKLIHSSANK
         + +  +G+ L+  Y K     +A+ V   +  +D+V+W+A+++G A  G    A++    + + G++PN FT+SS L  CA   A + QGK  H  A K
Subjt:  CQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSALKACAKMEAIL-QGKLIHSSANK

Query:  TSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGTVYGAC
        +   S++ V SAL+ MYAK G +  A +VF     ++LVSW +MI  YA++G   +AL +   M+   +++D      V+ AC
Subjt:  TSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGTVYGAC

AT3G53360.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-7633.2Show/hide
Query:  SSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLC
        SSRS+ + R IH  IL     Y   + N++LS Y + G L DAR+VFD MP R++V++T++I GY       EA+ L+   ++  +  +   F  I+  C
Subjt:  SSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQANGKMFVCILNLC

Query:  AKRLDFELGRQIHGVIVK-GNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNG-FLPNEFSVCSVLK
        A   D  LG+Q+H  ++K  +  ++I  +A++  Y +   +S A   F  +P +D++ W+S+I   SQ G   EA+S    MLS G F PNE+   S LK
Subjt:  AKRLDFELGRQIHGVIVK-GNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNG-FLPNEFSVCSVLK

Query:  ACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSIL
        AC        G Q+HGL IK  +  +   G SL DMYA+CG L  +R VF+ +   +T +W  IIAG A  G  +EA+++F  M+      + +++ S+L
Subjt:  ACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNLTIVSIL

Query:  RACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKA-SMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSS
         A     A   G ++H+ I+K     ++ + ++L+  Y  C +     ++        D VSW  I++ C       E L   K M+    EP+  T  +
Subjt:  RACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKA-SMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSS

Query:  ALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILG
         L+ C ++ ++  G  +H  + KT  +   F+ + LI MYAKCG + +A ++FDSM  R++VSW  +I+ YA++G   EAL L   M++ GIE +     
Subjt:  ALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILG

Query:  TVYGACGDV
         V  AC  V
Subjt:  TVYGACGDV

AT4G08550.1 electron carriers;protein disulfide oxidoreductases3.5e-13340.02Show/hide
Query:  TQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEV--QHKLETKQDCQSKS---EGEKSDQTIRNDDSNEKLDEDKNAA-SESSSDDSDNDGVDSKGQ
        T+ + K    +AK+ +   + + +EE    E +  EV  +H +E K++  ++    E E+ ++ + + D +E+ + D N +  E  S DS +D  DS+  
Subjt:  TQAKDKGLGEVAKDKNIDSKCDAQEEIVERETVQSEV--QHKLETKQDCQSKS---EGEKSDQTIRNDDSNEKLDEDKNAA-SESSSDDSDNDGVDSKGQ

Query:  TRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVAVSTVLRRFSGKKDEESLDTPNDETKDDSASNKE
            +   E + +E+S           GI    N         S S    ++A A+KNFVR K  VAV T++RR SGK    S+D     TKDD   + +
Subjt:  TRTNQRSGEVIGEEKSPEPVFDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVAVSTVLRRFSGKKDEESLDTPNDETKDDSASNKE

Query:  NEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKEARLFLFWKRLRYVEINIDVYPGRKLELEKIAGS
         E K         S WNPL+Y+K+ ++ +  +  E  E+   E    VV+KGRI+LYTRLGC+EC+  RLFL  KRLRYVEINID+YP RK+ELEKI+G 
Subjt:  NEAKEIPEKPLERSAWNPLNYIKISRDADAQIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKEARLFLFWKRLRYVEINIDVYPGRKLELEKIAGS

Query:  SAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMIVRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFL
          VP VFFN  L+G   ELK L+ESG+L EKI++L  E P  EAPLPP SGEDD SS G VDELA+IV KMK   VVKDR+ KMRRF +CFLGSEAVDFL
Subjt:  SAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAPTLEAPLPPLSGEDDVSSSGTVDELAMIVRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFL

Query:  SEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIFILKLDAFLCVLEILKRNWKENLPIFVLILSFFT
        S DQ LER                                                                                            
Subjt:  SEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETYSGCSKKITFSRSKQIIFILKLDAFLCVLEILKRNWKENLPIFVLILSFFT

Query:  INVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAIEFGRKLASKLFFQHVLEENLFEDGSHLYRFLDD
                                                                                                 DG         
Subjt:  INVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAKIYLCQAIEFGRKLASKLFFQHVLEENLFEDGSHLYRFLDD

Query:  DPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIVEELQRVEVHNLAREEKIAFFINLYNMMAIHAIL
                           P+PI +I+SRLR +  AILEAY S DGKHVDYRSIHGSEEFARYLRI++EL RVE+ ++ REEK+AFFINLYNMMAIH+IL
Subjt:  DPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIVEELQRVEVHNLAREEKIAFFINLYNMMAIHAIL

Query:  VCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEPLIHFALVCGTRSGPALRCYSPGNVDQELMEAAQ
        V GHP G  +R K+F DFKYVIGG TYSLSAIQNGILRGNQRP +N MKPFG +DKRSKV+LPY EPL HF LVCGTRSGP LRC++PG +D+ELMEAA+
Subjt:  VCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEPLIHFALVCGTRSGPALRCYSPGNVDQELMEAAQ

Query:  SFLREGGLVMDLNNNAASMKTV
         FLR GGL +DLN   A +  +
Subjt:  SFLREGGLVMDLNNNAASMKTV

AT4G08550.1 electron carriers;protein disulfide oxidoreductases7.6e-0343.59Show/hide
Query:  FSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL
        +  DFG  ++E+LKHAS +L+P  S+ALL+ L  +Q ++
Subjt:  FSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKL

AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-19863.48Show/hide
Query:  LIAVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQ-ANGKM
        L+A WL+SS  ++ ++ IHA  L+ F   VIY GNNL+SS +R G L  ARKVFD MP ++ VTWTA+I+GY+   L +EA  LF   VK+G++  N +M
Subjt:  LIAVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLDLTEEALGLFSASVKNGVQ-ANGKM

Query:  FVCILNLCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFS
        FVC+LNLC++R +FELGRQ+HG +VK   GN+IV+S++VYFYAQC +++SA  AF+ M ++DV+ WT++I++CS++G G +AI MF  ML++ FLPNEF+
Subjt:  FVCILNLCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFSNMLSNGFLPNEFS

Query:  VCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNL
        VCS+LKAC EE+ L+ GRQ+H L++K++IK DVFVGTSL+DMYAKCG ++D R+VF+GM NRNTVTWTSIIA +AREG GEEAI+LFR+MKR+ + ANNL
Subjt:  VCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPANNL

Query:  TIVSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNS
        T+VSILRACGS+GA L G+E+HAQI+KNS + N++IGSTLVW YCKC     A  VLQ +P RDVVSWTA+ISGC+ LGHESEAL+FLK MI+EGVEPN 
Subjt:  TIVSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNS

Query:  FTYSSALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVD
        FTYSSALKACA  E++L G+ IHS A K  A SNVFVGSALI+MYAKCG+V+EA +VFDSMP +NLVSWKAMI+ YARNG CREALKLMYRM+AEG EVD
Subjt:  FTYSSALKACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVD

Query:  DYILGTVYGACGDVKCD--VEST
        DYI  T+   CGD++ D  VES+
Subjt:  DYILGTVYGACGDVKCD--VEST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTACCAATTTTGCCCCCGGCCCCCACGAGGGAAGTCCAATTCCAATTTCCAATCCAACCGACCAACGGCTCCGCTTCAAATTATCCGGTGAAATTAGGACATACTAATTT
TCAACCTGAATTGTGGTCGTGGCGGGAATTTCCGGCTATGCTCTCGCCAGCGTTCATCTCAACGGCCACCATTTCTCACCCGCCGCCGTGTTTATCCGTTCAATCACCGC
CTACATCACTATCCGCACGACGGACTTGTTCCAAATGGAACCTCACTAGTATTGACCGTTCTAAAAGCTTCACAAATTTTCGGCTTAATTTTCTCAAAGACCCATCTAGT
TCTTCGCCGGTCGCCTGCACAACGGAAACATGTACTACCGAAGAATACGCCGAGGTCGAATCTTGCAGCAATCAGTCGGCGAGTGGATGTTTAAGTCCCTATTTGATTGC
CGTTTGGCTTCGTTCTAGTCGTAGCGTCAAGGAATTAAGGGCGATACATGCTTTCATTTTGCGACATTTTACAAGTTATGTGATCTATGTTGGAAATAATTTGCTTAGTT
CTTACTTGAGATTCGGAATGTTGGCTGATGCTAGAAAGGTGTTCGATGAAATGCCAGTGAGGAGTGTTGTGACCTGGACGGCTATTATTAATGGATATATTGATTTAGAT
TTGACTGAAGAAGCTTTAGGGTTGTTCAGTGCTTCGGTCAAGAATGGGGTGCAAGCGAATGGAAAGATGTTTGTTTGCATCTTGAACTTGTGTGCTAAGAGGTTGGATTT
TGAGCTTGGGAGGCAAATTCATGGCGTCATTGTGAAAGGTAATTGGGGGAATATGATTGTTGACAGTGCCATTGTTTACTTTTATGCACAATGTAAAGATATTTCCAGTG
CTTTTGTTGCATTTGAACATATGCCGAAGCGGGATGTTGTTTGTTGGACTTCCATGATTACTTCTTGTTCCCAACAAGGGCTTGGACGAGAAGCAATTTCAATGTTTTCA
AATATGCTAAGCAATGGATTTCTTCCAAATGAGTTTTCTGTATGCAGTGTTCTCAAGGCTTGTGGAGAAGAGAGGGAGTTGAAGATTGGGAGACAGTTACATGGTTTGAT
AATTAAGAAAATAATCAAGAATGATGTTTTTGTTGGGACTTCATTGGTTGATATGTATGCCAAGTGCGGGAATTTGGCAGATTCTAGAGAAGTATTCAATGGGATGAGGA
ATAGAAACACGGTTACATGGACATCGATCATAGCTGGCTATGCACGGGAGGGGCTCGGTGAGGAGGCCATTAACCTCTTTAGGTTGATGAAGAGGCAAAGGATTCCTGCC
AATAACTTGACCATTGTAAGTATTCTCCGTGCTTGTGGCTCAATTGGGGCGTCATTGACTGGGAGGGAAGTCCATGCCCAGATTGTAAAAAATTCATGTCAGACTAATAT
ACACATAGGAAGCACTCTAGTTTGGTTCTACTGTAAATGCAGGAATCAACTTAAGGCCTCCATGGTTCTTCAGCTGATGCCGCTTAGAGATGTGGTTTCGTGGACGGCTA
TCATTTCTGGATGTGCTCATCTTGGGCACGAGTCCGAGGCTCTTGAGTTTCTAAAAAACATGATAGAGGAAGGAGTTGAACCAAATTCCTTTACTTATTCATCGGCTTTA
AAAGCGTGCGCGAAGATGGAAGCGATCCTCCAAGGGAAATTGATCCACTCCTCTGCTAATAAAACATCTGCATCGTCCAATGTTTTTGTGGGAAGTGCACTTATTTACAT
GTATGCAAAATGTGGATATGTAACCGAAGCTTCTCAAGTTTTCGACAGCATGCCGGTGAGGAATTTGGTTTCTTGGAAGGCTATGATTTTATGCTATGCTAGGAATGGTC
TCTGCCGAGAGGCGTTGAAGCTCATGTACCGGATGCAGGCTGAAGGTATCGAAGTGGATGACTACATTCTTGGAACAGTTTATGGAGCTTGTGGAGATGTGAAGTGTGAT
GTGGAATCAACACTTGAATACAGCCCACGGACTTCCCGCTTCTCACCATTTCTTCTCAATTCAAACGCCTTTAGATCCTCCAACCTTCTTCATCACTCCGTTTTCGTTAA
GAAAGATCTGTTGCTGGATTCTGGATTCTGGTTTCGTTTTTGTCACCCCAGACGGTGGAAGATTTTTCGAAATGGAAGTGGACGAGAAGAGATGGCGACTTCACCGACCG
TTAAAAGAGATGAGGAACATAGCCTTTCTAAACAGGATGATGGAGGAGAGCCTACAGTGGAGATTGAACAGTCCTTGTCAGTTAGTGAGGATAGTGCCACTCAAGCAAAG
GATAAAGGGCTTGGTGAAGTAGCAAAGGACAAGAACATCGATTCGAAATGTGATGCTCAAGAGGAAATAGTGGAGAGGGAGACCGTCCAAAGTGAGGTACAACACAAGTT
GGAAACCAAACAGGATTGCCAGTCTAAATCTGAAGGTGAAAAGTCGGACCAGACTATTAGAAATGACGACTCTAATGAAAAGCTAGATGAGGATAAGAATGCAGCATCCG
AATCATCATCAGATGATTCAGACAATGATGGTGTAGATTCTAAGGGTCAAACCAGGACAAACCAACGAAGTGGAGAAGTTATTGGCGAGGAGAAATCTCCAGAACCAGTC
TTTGATGGGACTGAGGTTCCTGGGATTGAAGGTAGTGGGAATTTGTCAAACCGTTCCATGGATAGTGATTCAGAGAGCCAAGGTGTCGTTGACAGGGCTTTGGCACTAAA
AAACTTTGTTAGAGAAAAGGGAGTTGTTGCAGTATCTACCGTTTTGCGTCGTTTTTCAGGAAAAAAAGATGAAGAAAGTCTGGATACACCTAATGACGAGACTAAGGATG
ATTCTGCTTCCAATAAAGAAAATGAAGCCAAGGAAATCCCTGAAAAACCATTGGAAAGATCTGCTTGGAATCCCCTCAACTATATTAAGATATCACGTGATGCTGATGCA
CAAATCAAAACTGAGCAGGTGGAGGATGTTGCTGGAGAATCAAACTTAGATGTTGTTATTAAAGGCAGAATTGTTCTGTACACACGGTTAGGTTGTCAAGAATGCAAAGA
GGCTAGACTATTTTTGTTTTGGAAGAGACTTAGATACGTTGAGATCAATATTGATGTTTATCCGGGTAGAAAGTTGGAGCTTGAGAAAATTGCCGGGTCTTCTGCCGTTC
CAAAGGTATTTTTCAATACAGTCCTTATTGGAGGTTTGAACGAACTAAAAGAGTTGGACGAGTCTGGCAAGCTTAATGAGAAAATTGAGTATCTGAAAGCTGAAGCACCA
ACCCTTGAAGCCCCTTTACCACCTCTTTCTGGTGAAGATGATGTCTCCAGTAGTGGGACTGTTGATGAATTGGCTATGATTGTTCGAAAAATGAAAGAATCTATTGTTGT
TAAGGACCGGTACTGCAAAATGCGACGATTCACCAGCTGTTTCCTAGGCTCAGAAGCTGTGGATTTCTTGTCAGAGGATCAATACTTGGAAAGAGAAGAGAAAGAAGAGA
CTTCTCATTCAAATAACAATATTTCACTATTTATCTCTGTCAAGTTCACAATGCAGTTAGGTTACATGGTTGTTTCATTCTATAATGCTTTGCAAAACCAAGAAACATAC
TCGGGTTGTTCAAAGAAAATTACTTTTTCTCGGTCTAAACAAATAATTTTTATACTAAAATTGGATGCTTTTCTCTGCGTCCTCGAAATACTTAAAAGGAATTGGAAAGA
AAACCTTCCAATTTTCGTTCTCATTCTTAGTTTCTTCACTATCAATGTTAGGTACTCATTATTAAAGCAATTTTTATGTAAAACACTTTCTGTAGACACTGTAGTTAGGA
CGTTTTCCCCTCTCCCACATATTAGGAGGCATAATAAACTGAGGATATTGTATGTTTTCCTAGTAGCTGTAGTACCGTGTGTATACCCGATCTTTTTGTTTTCTGCAAAA
ATATATTTGTGTCAAGCTATTGAATTTGGACGAAAGCTCGCAAGCAAACTTTTTTTCCAACATGTTCTCGAGGAAAATCTATTTGAAGATGGTAGCCACTTATACCGATT
CTTGGATGATGATCCCGTTGTGGCAACTCAATGTCACAACGTCGCAAGAGGTATAATTGAGGTGAAGCCAAAGCCGATAACAGATATTTCATCAAGGCTGAGATTTTTAT
CCTTCGCCATTTTAGAAGCCTATGTATCAGAGGATGGAAAGCATGTAGATTACAGAAGTATTCATGGGAGTGAGGAATTTGCTAGGTATTTGAGAATTGTTGAAGAGCTC
CAAAGAGTTGAAGTGCATAATTTGGCACGGGAAGAGAAAATCGCTTTCTTTATTAATCTCTACAATATGATGGCCATCCATGCAATATTAGTATGTGGACATCCTGTTGG
AGCGCTGGAAAGAAGGAAACTGTTTGGAGACTTCAAATATGTTATTGGAGGGGCCACCTACTCACTTTCAGCTATTCAAAACGGAATTTTGAGGGGCAATCAGAGACCAC
CTTACAATCTCATGAAACCATTTGGTGCAAGAGATAAACGTTCTAAAGTTTCTCTTCCGTATGTAGAGCCTTTAATCCATTTTGCACTAGTATGTGGTACTCGATCTGGG
CCTGCCCTTCGATGCTATTCTCCAGGCAATGTGGATCAAGAGTTGATGGAAGCAGCACAGAGTTTCTTAAGAGAGGGAGGACTTGTCATGGATTTAAATAACAATGCTGC
ATCAATGAAAACAGTTCCTCAGAAACCTTCTGTTTTGTTTCAAACATTTATTAGTTTTTTACTTTTGATTGTCGCCACATTAAATTTTCATTTAGAGCCCTACTTTGTGA
TCTTAATTTTGCTAATTTGTCTTGGAAGAAGCACTCTCTTGGAGATTTGTTGGAAATTCTGGCAGTACGTAGATAATTGGGCGTGGAGCAGTATTCTCCTGAGGCCTTAT
ATGGTTCAAAGTCCAGTTTATAGTACTTTCTTGTGTATCCAGGATTTAGGAACTAATCATAGGTGGAGGTTTAGCACGGACTTCGGTAAGAACGAGCAAGAAGTCCTGAA
GCATGCTTCAAACTACTTGAAACCAGATGATTCTCAAGCTCTGCTTGAATTGCTTGCCAGTTCCCAGTTGAAGTTGGGGAAATTACTTCTCAGGCTGAAGCCTTCATGGG
AAGGAAGGCAGTGGGATGCTATAAGCTTGTGGCTGTGA
mRNA sequenceShow/hide mRNA sequence
TTACCAATTTTGCCCCCGGCCCCCACGAGGGAAGTCCAATTCCAATTTCCAATCCAACCGACCAACGGCTCCGCTTCAAATTATCCGGTGAAATTAGGACATACTAATTT
TCAACCTGAATTGTGGTCGTGGCGGGAATTTCCGGCTATGCTCTCGCCAGCGTTCATCTCAACGGCCACCATTTCTCACCCGCCGCCGTGTTTATCCGTTCAATCACCGC
CTACATCACTATCCGCACGACGGACTTGTTCCAAATGGAACCTCACTAGTATTGACCGTTCTAAAAGCTTCACAAATTTTCGGCTTAATTTTCTCAAAGACCCATCTAGT
TCTTCGCCGGTCGCCTGCACAACGGAAACATGTACTACCGAAGAATACGCCGAGGTCGAATCTTGCAGCAATCAGTCGGCGAGTGGATGTTTAAGTCCCTATTTGATTGC
CGTTTGGCTTCGTTCTAGTCGTAGCGTCAAGGAATTAAGGGCGATACATGCTTTCATTTTGCGACATTTTACAAGTTATGTGATCTATGTTGGAAATAATTTGCTTAGTT
CTTACTTGAGATTCGGAATGTTGGCTGATGCTAGAAAGGTGTTCGATGAAATGCCAGTGAGGAGTGTTGTGACCTGGACGGCTATTATTAATGGATATATTGATTTAGAT
TTGACTGAAGAAGCTTTAGGGTTGTTCAGTGCTTCGGTCAAGAATGGGGTGCAAGCGAATGGAAAGATGTTTGTTTGCATCTTGAACTTGTGTGCTAAGAGGTTGGATTT
TGAGCTTGGGAGGCAAATTCATGGCGTCATTGTGAAAGGTAATTGGGGGAATATGATTGTTGACAGTGCCATTGTTTACTTTTATGCACAATGTAAAGATATTTCCAGTG
CTTTTGTTGCATTTGAACATATGCCGAAGCGGGATGTTGTTTGTTGGACTTCCATGATTACTTCTTGTTCCCAACAAGGGCTTGGACGAGAAGCAATTTCAATGTTTTCA
AATATGCTAAGCAATGGATTTCTTCCAAATGAGTTTTCTGTATGCAGTGTTCTCAAGGCTTGTGGAGAAGAGAGGGAGTTGAAGATTGGGAGACAGTTACATGGTTTGAT
AATTAAGAAAATAATCAAGAATGATGTTTTTGTTGGGACTTCATTGGTTGATATGTATGCCAAGTGCGGGAATTTGGCAGATTCTAGAGAAGTATTCAATGGGATGAGGA
ATAGAAACACGGTTACATGGACATCGATCATAGCTGGCTATGCACGGGAGGGGCTCGGTGAGGAGGCCATTAACCTCTTTAGGTTGATGAAGAGGCAAAGGATTCCTGCC
AATAACTTGACCATTGTAAGTATTCTCCGTGCTTGTGGCTCAATTGGGGCGTCATTGACTGGGAGGGAAGTCCATGCCCAGATTGTAAAAAATTCATGTCAGACTAATAT
ACACATAGGAAGCACTCTAGTTTGGTTCTACTGTAAATGCAGGAATCAACTTAAGGCCTCCATGGTTCTTCAGCTGATGCCGCTTAGAGATGTGGTTTCGTGGACGGCTA
TCATTTCTGGATGTGCTCATCTTGGGCACGAGTCCGAGGCTCTTGAGTTTCTAAAAAACATGATAGAGGAAGGAGTTGAACCAAATTCCTTTACTTATTCATCGGCTTTA
AAAGCGTGCGCGAAGATGGAAGCGATCCTCCAAGGGAAATTGATCCACTCCTCTGCTAATAAAACATCTGCATCGTCCAATGTTTTTGTGGGAAGTGCACTTATTTACAT
GTATGCAAAATGTGGATATGTAACCGAAGCTTCTCAAGTTTTCGACAGCATGCCGGTGAGGAATTTGGTTTCTTGGAAGGCTATGATTTTATGCTATGCTAGGAATGGTC
TCTGCCGAGAGGCGTTGAAGCTCATGTACCGGATGCAGGCTGAAGGTATCGAAGTGGATGACTACATTCTTGGAACAGTTTATGGAGCTTGTGGAGATGTGAAGTGTGAT
GTGGAATCAACACTTGAATACAGCCCACGGACTTCCCGCTTCTCACCATTTCTTCTCAATTCAAACGCCTTTAGATCCTCCAACCTTCTTCATCACTCCGTTTTCGTTAA
GAAAGATCTGTTGCTGGATTCTGGATTCTGGTTTCGTTTTTGTCACCCCAGACGGTGGAAGATTTTTCGAAATGGAAGTGGACGAGAAGAGATGGCGACTTCACCGACCG
TTAAAAGAGATGAGGAACATAGCCTTTCTAAACAGGATGATGGAGGAGAGCCTACAGTGGAGATTGAACAGTCCTTGTCAGTTAGTGAGGATAGTGCCACTCAAGCAAAG
GATAAAGGGCTTGGTGAAGTAGCAAAGGACAAGAACATCGATTCGAAATGTGATGCTCAAGAGGAAATAGTGGAGAGGGAGACCGTCCAAAGTGAGGTACAACACAAGTT
GGAAACCAAACAGGATTGCCAGTCTAAATCTGAAGGTGAAAAGTCGGACCAGACTATTAGAAATGACGACTCTAATGAAAAGCTAGATGAGGATAAGAATGCAGCATCCG
AATCATCATCAGATGATTCAGACAATGATGGTGTAGATTCTAAGGGTCAAACCAGGACAAACCAACGAAGTGGAGAAGTTATTGGCGAGGAGAAATCTCCAGAACCAGTC
TTTGATGGGACTGAGGTTCCTGGGATTGAAGGTAGTGGGAATTTGTCAAACCGTTCCATGGATAGTGATTCAGAGAGCCAAGGTGTCGTTGACAGGGCTTTGGCACTAAA
AAACTTTGTTAGAGAAAAGGGAGTTGTTGCAGTATCTACCGTTTTGCGTCGTTTTTCAGGAAAAAAAGATGAAGAAAGTCTGGATACACCTAATGACGAGACTAAGGATG
ATTCTGCTTCCAATAAAGAAAATGAAGCCAAGGAAATCCCTGAAAAACCATTGGAAAGATCTGCTTGGAATCCCCTCAACTATATTAAGATATCACGTGATGCTGATGCA
CAAATCAAAACTGAGCAGGTGGAGGATGTTGCTGGAGAATCAAACTTAGATGTTGTTATTAAAGGCAGAATTGTTCTGTACACACGGTTAGGTTGTCAAGAATGCAAAGA
GGCTAGACTATTTTTGTTTTGGAAGAGACTTAGATACGTTGAGATCAATATTGATGTTTATCCGGGTAGAAAGTTGGAGCTTGAGAAAATTGCCGGGTCTTCTGCCGTTC
CAAAGGTATTTTTCAATACAGTCCTTATTGGAGGTTTGAACGAACTAAAAGAGTTGGACGAGTCTGGCAAGCTTAATGAGAAAATTGAGTATCTGAAAGCTGAAGCACCA
ACCCTTGAAGCCCCTTTACCACCTCTTTCTGGTGAAGATGATGTCTCCAGTAGTGGGACTGTTGATGAATTGGCTATGATTGTTCGAAAAATGAAAGAATCTATTGTTGT
TAAGGACCGGTACTGCAAAATGCGACGATTCACCAGCTGTTTCCTAGGCTCAGAAGCTGTGGATTTCTTGTCAGAGGATCAATACTTGGAAAGAGAAGAGAAAGAAGAGA
CTTCTCATTCAAATAACAATATTTCACTATTTATCTCTGTCAAGTTCACAATGCAGTTAGGTTACATGGTTGTTTCATTCTATAATGCTTTGCAAAACCAAGAAACATAC
TCGGGTTGTTCAAAGAAAATTACTTTTTCTCGGTCTAAACAAATAATTTTTATACTAAAATTGGATGCTTTTCTCTGCGTCCTCGAAATACTTAAAAGGAATTGGAAAGA
AAACCTTCCAATTTTCGTTCTCATTCTTAGTTTCTTCACTATCAATGTTAGGTACTCATTATTAAAGCAATTTTTATGTAAAACACTTTCTGTAGACACTGTAGTTAGGA
CGTTTTCCCCTCTCCCACATATTAGGAGGCATAATAAACTGAGGATATTGTATGTTTTCCTAGTAGCTGTAGTACCGTGTGTATACCCGATCTTTTTGTTTTCTGCAAAA
ATATATTTGTGTCAAGCTATTGAATTTGGACGAAAGCTCGCAAGCAAACTTTTTTTCCAACATGTTCTCGAGGAAAATCTATTTGAAGATGGTAGCCACTTATACCGATT
CTTGGATGATGATCCCGTTGTGGCAACTCAATGTCACAACGTCGCAAGAGGTATAATTGAGGTGAAGCCAAAGCCGATAACAGATATTTCATCAAGGCTGAGATTTTTAT
CCTTCGCCATTTTAGAAGCCTATGTATCAGAGGATGGAAAGCATGTAGATTACAGAAGTATTCATGGGAGTGAGGAATTTGCTAGGTATTTGAGAATTGTTGAAGAGCTC
CAAAGAGTTGAAGTGCATAATTTGGCACGGGAAGAGAAAATCGCTTTCTTTATTAATCTCTACAATATGATGGCCATCCATGCAATATTAGTATGTGGACATCCTGTTGG
AGCGCTGGAAAGAAGGAAACTGTTTGGAGACTTCAAATATGTTATTGGAGGGGCCACCTACTCACTTTCAGCTATTCAAAACGGAATTTTGAGGGGCAATCAGAGACCAC
CTTACAATCTCATGAAACCATTTGGTGCAAGAGATAAACGTTCTAAAGTTTCTCTTCCGTATGTAGAGCCTTTAATCCATTTTGCACTAGTATGTGGTACTCGATCTGGG
CCTGCCCTTCGATGCTATTCTCCAGGCAATGTGGATCAAGAGTTGATGGAAGCAGCACAGAGTTTCTTAAGAGAGGGAGGACTTGTCATGGATTTAAATAACAATGCTGC
ATCAATGAAAACAGTTCCTCAGAAACCTTCTGTTTTGTTTCAAACATTTATTAGTTTTTTACTTTTGATTGTCGCCACATTAAATTTTCATTTAGAGCCCTACTTTGTGA
TCTTAATTTTGCTAATTTGTCTTGGAAGAAGCACTCTCTTGGAGATTTGTTGGAAATTCTGGCAGTACGTAGATAATTGGGCGTGGAGCAGTATTCTCCTGAGGCCTTAT
ATGGTTCAAAGTCCAGTTTATAGTACTTTCTTGTGTATCCAGGATTTAGGAACTAATCATAGGTGGAGGTTTAGCACGGACTTCGGTAAGAACGAGCAAGAAGTCCTGAA
GCATGCTTCAAACTACTTGAAACCAGATGATTCTCAAGCTCTGCTTGAATTGCTTGCCAGTTCCCAGTTGAAGTTGGGGAAATTACTTCTCAGGCTGAAGCCTTCATGGG
AAGGAAGGCAGTGGGATGCTATAAGCTTGTGGCTGTGA
Protein sequenceShow/hide protein sequence
LPILPPAPTREVQFQFPIQPTNGSASNYPVKLGHTNFQPELWSWREFPAMLSPAFISTATISHPPPCLSVQSPPTSLSARRTCSKWNLTSIDRSKSFTNFRLNFLKDPSS
SSPVACTTETCTTEEYAEVESCSNQSASGCLSPYLIAVWLRSSRSVKELRAIHAFILRHFTSYVIYVGNNLLSSYLRFGMLADARKVFDEMPVRSVVTWTAIINGYIDLD
LTEEALGLFSASVKNGVQANGKMFVCILNLCAKRLDFELGRQIHGVIVKGNWGNMIVDSAIVYFYAQCKDISSAFVAFEHMPKRDVVCWTSMITSCSQQGLGREAISMFS
NMLSNGFLPNEFSVCSVLKACGEERELKIGRQLHGLIIKKIIKNDVFVGTSLVDMYAKCGNLADSREVFNGMRNRNTVTWTSIIAGYAREGLGEEAINLFRLMKRQRIPA
NNLTIVSILRACGSIGASLTGREVHAQIVKNSCQTNIHIGSTLVWFYCKCRNQLKASMVLQLMPLRDVVSWTAIISGCAHLGHESEALEFLKNMIEEGVEPNSFTYSSAL
KACAKMEAILQGKLIHSSANKTSASSNVFVGSALIYMYAKCGYVTEASQVFDSMPVRNLVSWKAMILCYARNGLCREALKLMYRMQAEGIEVDDYILGTVYGACGDVKCD
VESTLEYSPRTSRFSPFLLNSNAFRSSNLLHHSVFVKKDLLLDSGFWFRFCHPRRWKIFRNGSGREEMATSPTVKRDEEHSLSKQDDGGEPTVEIEQSLSVSEDSATQAK
DKGLGEVAKDKNIDSKCDAQEEIVERETVQSEVQHKLETKQDCQSKSEGEKSDQTIRNDDSNEKLDEDKNAASESSSDDSDNDGVDSKGQTRTNQRSGEVIGEEKSPEPV
FDGTEVPGIEGSGNLSNRSMDSDSESQGVVDRALALKNFVREKGVVAVSTVLRRFSGKKDEESLDTPNDETKDDSASNKENEAKEIPEKPLERSAWNPLNYIKISRDADA
QIKTEQVEDVAGESNLDVVIKGRIVLYTRLGCQECKEARLFLFWKRLRYVEINIDVYPGRKLELEKIAGSSAVPKVFFNTVLIGGLNELKELDESGKLNEKIEYLKAEAP
TLEAPLPPLSGEDDVSSSGTVDELAMIVRKMKESIVVKDRYCKMRRFTSCFLGSEAVDFLSEDQYLEREEKEETSHSNNNISLFISVKFTMQLGYMVVSFYNALQNQETY
SGCSKKITFSRSKQIIFILKLDAFLCVLEILKRNWKENLPIFVLILSFFTINVRYSLLKQFLCKTLSVDTVVRTFSPLPHIRRHNKLRILYVFLVAVVPCVYPIFLFSAK
IYLCQAIEFGRKLASKLFFQHVLEENLFEDGSHLYRFLDDDPVVATQCHNVARGIIEVKPKPITDISSRLRFLSFAILEAYVSEDGKHVDYRSIHGSEEFARYLRIVEEL
QRVEVHNLAREEKIAFFINLYNMMAIHAILVCGHPVGALERRKLFGDFKYVIGGATYSLSAIQNGILRGNQRPPYNLMKPFGARDKRSKVSLPYVEPLIHFALVCGTRSG
PALRCYSPGNVDQELMEAAQSFLREGGLVMDLNNNAASMKTVPQKPSVLFQTFISFLLLIVATLNFHLEPYFVILILLICLGRSTLLEICWKFWQYVDNWAWSSILLRPY
MVQSPVYSTFLCIQDLGTNHRWRFSTDFGKNEQEVLKHASNYLKPDDSQALLELLASSQLKLGKLLLRLKPSWEGRQWDAISLWL