; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G020037 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G020037
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationGy14Chr6:20746704..20751292
RNA-Seq ExpressionCsGy6G020037
SyntenyCsGy6G020037
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064811.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.093.21Show/hide
Query:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD
        MATP+ GF SSNNAS RLPSFPKFHFDLYPNSSFSRNSMNVACRMHF+AV A NRPNCQFSPIAIRTD  CEGVNVPIP SF LF+H++QVVKLN CRVD
Subjt:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD

Query:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV
        NLFGKKLTKFYVKDVKCVD DSKVFDEIPER LP YAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPT+LKACSRRQMVKTGKM HGYAIRKRMV
Subjt:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV

Query:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
        SDIVI NALMDFYGNC DL SSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAM+VFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
Subjt:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL

Query:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA
        RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLR+LGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDD AEE+FAKA
Subjt:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA

Query:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT
        EKKN+TLWNEIIATY+NQGKNS ALE FRSMQHHGLKPDVVTYNTLLAGYAKNG+KVEAYELLSDML+ENLVPNVISLNVLVSGFQ SGL+YEALELCQT
Subjt:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT

Query:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM
        MLCTGSLLNK IAFPVIP+TVT+TAALAACASLNLLHKGKEIHGYMLRNYF NN+FISSALINMYAKC +IDSAIQVFSRIKNRNVVCWNALIAGLLR M
Subjt:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM

Query:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI
        QH++AVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDN DVGVLLHGI
Subjt:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI

KAG6598662.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.079.17Show/hide
Query:  MATPVYGFASSNNAS-LRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRV
        MAT V GF SSNN S   LPS  K + DLYP+  FSRNSMNVACRMH  A+SAHNRP C+F+P+A   D N  G NVPI RSFALF+ + Q VKLN  RV
Subjt:  MATPVYGFASSNNAS-LRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRV

Query:  DNLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRM
        D+L G  L KF  K   CVDSD KVFDE+PER LPAY ALIRAYCRSEKWNELFAAF SMV+EGILPDKYLVPTILKACS RQ VKTGKM HGYAIRKR+
Subjt:  DNLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRM

Query:  VSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEG
        VSDI I NALMDFYGNCGDL  SINVFDSMSEKDVVSWTALVSAY+EEGLL+EAME FHSMQSSGLKPDLISWNALVSGFAR+G+  TAL YLEAMQE+G
Subjt:  VSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEG

Query:  LRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAK
        L PRVNSWNGVISGCV NG+FKDAL VFINMLLFPENPNSVTVAS+LPACAGLR LGLGRAVHAYALKCELCTNIYVEGSLV+MYSKCGQDD AEEIFAK
Subjt:  LRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAK

Query:  AEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQ
        AEKKNITLWNEIIATY+NQG+ S ALE FRSMQHHGL+PDVVTYNTLLAGYAKNGQKVEAY LL++MLQ++L PNV+SLNVLVSGFQQSGL+YEALEL Q
Subjt:  AEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQ

Query:  TMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRT
        TML T  L++K I  P+ PN VT+TA LAACASLNLLHKGKEIHGYMLRN F +++ +SSALI+MY+KC  IDS I+VF  IKNRN VCWNALIAG  R 
Subjt:  TMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRT

Query:  MQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHG
        MQ KMAVELFCQMLVEGIKPSS TFSIL PAL+ R DL +RRQLHSYIIKSQ +ES +DLANVLSS+  D GVLLHG
Subjt:  MQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHG

XP_004146805.1 pentatricopeptide repeat-containing protein At1g19720 [Cucumis sativus]0.0100Show/hide
Query:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD
        MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD
Subjt:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD

Query:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV
        NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV
Subjt:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV

Query:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
        SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
Subjt:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL

Query:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA
        RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA
Subjt:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA

Query:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT
        EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT
Subjt:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT

Query:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM
        MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM
Subjt:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM

Query:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI
        QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI
Subjt:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI

XP_008445371.1 PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Cucumis melo]0.093.5Show/hide
Query:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD
        MATP+ GF SSNNAS RLPSFPKFHFDLYPNSSFSRNSMNVACRMHF+AV A NRPNCQFSPIAIRTD  CEGVNVPIP SF LFDH++QVVKLN CRVD
Subjt:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD

Query:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV
        NLFGKKLTKFYVKDVKCVD DSKVFDEIPERTLP YAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPT+LKACSRRQMVKTGKM HGYAIRKRMV
Subjt:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV

Query:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
        SDIVI NALMDFYGNC DL SSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAM+VFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
Subjt:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL

Query:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA
        RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLR+LGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDD AEE+FAKA
Subjt:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA

Query:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT
        EKKN+TLWNEIIATY+NQGKNS ALE FRSMQHHGLKPDVVTYNTLLAGYAKNG+KVEAYELLSDML+ENLVPNVISLNVLVSGFQ SGL+YEALELCQT
Subjt:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT

Query:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM
        MLCTGSLLNK IAFPVIP+TVT+TAALAACASLNLLHKGKEIHGYMLRNYF NN+FISSALINMYAKC +IDSAIQVFSRIKNRNVVCWNALIAGLLR M
Subjt:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM

Query:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI
        QH++AVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDN DVGVLLHGI
Subjt:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI

XP_038884429.1 pentatricopeptide repeat-containing protein At1g19720-like [Benincasa hispida]0.084.44Show/hide
Query:  TPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVDNL
        T + GF SS+NAS  LPS  KF+FDL P+   SRNSM VACRMHF A+SAH+RP  QFSPIA  TDRN  G  VPI RSF LF+H+AQVVKLN CRVDNL
Subjt:  TPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVDNL

Query:  FGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSD
        FGKKL  FY KDV CVDSDSK+FDEIPERTL AY+ALIRAYCRSEKWNELFAAFRSMVDEGILP KYLVPTILKACSRRQMVKTGKM HGYAIRKR+VSD
Subjt:  FGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSD

Query:  IVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRP
        I I NAL+D YGNCGDL  SINVFDSMSEKDVVSWTALVSAYIEEGLL+E MEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL P
Subjt:  IVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRP

Query:  RVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEK
        RVNSWNGVISG VQNGYFKDALDVFINMLLF ENPNSVTVASILPACAGLRDLGLGRA+HAYALKCELCTNIYVEGSLVDMYSKCGQDD AEE+FAKAEK
Subjt:  RVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEK

Query:  KNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTML
        KNITLWNEIIATY+NQ K S ALE FRS+QHHGLKPDVVTYNTLLAG+AKNGQKVEAY+LLS+MLQ++L PNV+SLNVLVSGFQQSGL+YEALEL QTML
Subjt:  KNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTML

Query:  CTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQH
        C G L NK I FP+ P+TVT+TAAL ACASLNLLHKGKEIHGYM RN F +N+FISSALI+MYAKC +ID AIQVF  IKNRNVVCWNALIAGL+R MQ 
Subjt:  CTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQH

Query:  KMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI
        KMAVELFCQMLVEG+KPSS TFSILLPAL+E+ADLK RRQLHSYIIKS++LES NDLANVLSSDN D GVLLHGI
Subjt:  KMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI

TrEMBL top hitse value%identityAlignment
A0A0A0KFW8 Uncharacterized protein0.0100Show/hide
Query:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD
        MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD
Subjt:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD

Query:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV
        NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV
Subjt:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV

Query:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
        SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
Subjt:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL

Query:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA
        RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA
Subjt:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA

Query:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT
        EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT
Subjt:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT

Query:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM
        MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM
Subjt:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM

Query:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI
        QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI
Subjt:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI

A0A1S3BDB0 pentatricopeptide repeat-containing protein At1g19720-like0.093.5Show/hide
Query:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD
        MATP+ GF SSNNAS RLPSFPKFHFDLYPNSSFSRNSMNVACRMHF+AV A NRPNCQFSPIAIRTD  CEGVNVPIP SF LFDH++QVVKLN CRVD
Subjt:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD

Query:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV
        NLFGKKLTKFYVKDVKCVD DSKVFDEIPERTLP YAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPT+LKACSRRQMVKTGKM HGYAIRKRMV
Subjt:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV

Query:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
        SDIVI NALMDFYGNC DL SSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAM+VFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
Subjt:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL

Query:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA
        RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLR+LGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDD AEE+FAKA
Subjt:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA

Query:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT
        EKKN+TLWNEIIATY+NQGKNS ALE FRSMQHHGLKPDVVTYNTLLAGYAKNG+KVEAYELLSDML+ENLVPNVISLNVLVSGFQ SGL+YEALELCQT
Subjt:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT

Query:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM
        MLCTGSLLNK IAFPVIP+TVT+TAALAACASLNLLHKGKEIHGYMLRNYF NN+FISSALINMYAKC +IDSAIQVFSRIKNRNVVCWNALIAGLLR M
Subjt:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM

Query:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI
        QH++AVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDN DVGVLLHGI
Subjt:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI

A0A5A7VGH4 Pentatricopeptide repeat-containing protein0.093.21Show/hide
Query:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD
        MATP+ GF SSNNAS RLPSFPKFHFDLYPNSSFSRNSMNVACRMHF+AV A NRPNCQFSPIAIRTD  CEGVNVPIP SF LF+H++QVVKLN CRVD
Subjt:  MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVD

Query:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV
        NLFGKKLTKFYVKDVKCVD DSKVFDEIPER LP YAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPT+LKACSRRQMVKTGKM HGYAIRKRMV
Subjt:  NLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMV

Query:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
        SDIVI NALMDFYGNC DL SSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAM+VFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL
Subjt:  SDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGL

Query:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA
        RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLR+LGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDD AEE+FAKA
Subjt:  RPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKA

Query:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT
        EKKN+TLWNEIIATY+NQGKNS ALE FRSMQHHGLKPDVVTYNTLLAGYAKNG+KVEAYELLSDML+ENLVPNVISLNVLVSGFQ SGL+YEALELCQT
Subjt:  EKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQT

Query:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM
        MLCTGSLLNK IAFPVIP+TVT+TAALAACASLNLLHKGKEIHGYMLRNYF NN+FISSALINMYAKC +IDSAIQVFSRIKNRNVVCWNALIAGLLR M
Subjt:  MLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTM

Query:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI
        QH++AVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDN DVGVLLHGI
Subjt:  QHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHGI

A0A6J1BQ73 pentatricopeptide repeat-containing protein At1g19720-like0.076.74Show/hide
Query:  MATPVYGFASSNNAS--LRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHS---------A
        MAT    F S NNAS  L  PS  K +FDL+P+  FSRNSMN+ CRMHF AVSAHN P  QF P A   DRN  G N+PI RS  L + +         A
Subjt:  MATPVYGFASSNNAS--LRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHS---------A

Query:  QVVKLNDCRVDNLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKM
         VVK N  RVD+LFG KL KF  +DVKCVDSD K+FDEIPERTLPAYAALIRAYCRS+KWNELFAAFRSMVDEGI PDKYLVPTILKACS RQ+VKTGKM
Subjt:  QVVKLNDCRVDNLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKM

Query:  AHGYAIRKRMVSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTAL
         HG+ IRK  VSDI + NALM+FYGNCGDL SSI VFDSMSEKDVVSWTALVSAY+EEGLL+EAMEVFH+MQSSGLKPDLISWNALVSGFARYGE + AL
Subjt:  AHGYAIRKRMVSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTAL

Query:  TYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQ
         YLE MQE+GL PRVNSWNG+ISGCVQNGYF+DALDVFINML FPENPNSVTVASILPACAGLRD+GLGRA+HAYALK ELC N+YVEGSLVDMYSKCGQ
Subjt:  TYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQ

Query:  DDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSG
        D  AE++FA+AEKKNITLWNEIIA Y+NQGK S ALE FRSMQHHGLKPDVVTYNTLLAG+AKNGQKVEAY+LLS+MLQ++L PNV+SLNVLVSGFQQ G
Subjt:  DDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSG

Query:  LNYEALELCQTMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCW
        L+YEAL+L +TMLCTG LLNK I  P+ PNTVT+TAALAACA LNL H+GKEIHGYMLRN F +N+FISSALI+ Y KC DIDSAI+VF RIKNRNVVCW
Subjt:  LNYEALELCQTMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCW

Query:  NALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLS
        NALIAG ++  Q K+A+ELFC+MLVEGIKPSS T SIL PAL    DLKVRRQLHSYI KSQ LE  NDLANV S
Subjt:  NALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLANVLS

A0A6J1K3Z4 pentatricopeptide repeat-containing protein At1g19720-like0.079.81Show/hide
Query:  MNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVDNLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAA
        MNVACRMH  A+SAHNR  C+F+P+A   D N  G NVPI RSFALF+ + Q VKLN  RVD+L G KL K   K   CVDSD KVFDE+PER LPAY A
Subjt:  MNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVDNLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAA

Query:  LIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWT
        LIRAYCRSEKWNELFAAF SMV+EGILPDKYLVPTILKACS+ Q VKTGKM HGYAIRKR+VSDI I NALMDFYGNCGDL  SINVFDSMSEKDVVSWT
Subjt:  LIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWT

Query:  ALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPN
        ALVSAY+EEGLL+EAME FHSMQSSGLKPDLISWNALVSGFAR+G+  TAL YLEAMQE+GL PRVNSWNGVISGCV NGYFKDAL VFINMLLFPENPN
Subjt:  ALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPN

Query:  SVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKP
        SVTVAS+LPACAGLR LGLGRAVHAYALKCELCTNIYVEGSLV+MYSKCGQDD AEEIFAKAEKKNITLWNEIIATY+NQG+ S ALE FRSMQHHGL+P
Subjt:  SVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKP

Query:  DVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPNTV-TLTAALAACASLNLLH
        DVVTYNTLLAGYAKNGQKVEAY LL++MLQ++L PNV+SLN LVSGFQQSGL+YEALEL QTML T  L++K I  P+ PN V T+TAALAACASLNLLH
Subjt:  DVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPNTV-TLTAALAACASLNLLH

Query:  KGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADL
        KGKEIHGYMLRN F +N+ +SSALI+MY+KC  IDS IQVF  IKNRN VCWNALIAG  R MQ KMAVELFCQMLVEGIKPSS +FSILLPAL+ R DL
Subjt:  KGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADL

Query:  KVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHG
         +RRQLHSYIIKSQ +ES +DL+ VLSS+  D GV+LHG
Subjt:  KVRRQLHSYIIKSQHLESRNDLANVLSSDNVDVGVLLHG

SwissProt top hitse value%identityAlignment
O80647 Pentatricopeptide repeat-containing protein At2g396209.8e-6027.49Show/hide
Query:  DSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDE-GILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVIENALMDFYGNCG
        D    +FD + +  +  + ++IR Y R+    E    F  M +E GI PDKY     LKAC+     K G   H       + SD+ I  AL++ Y    
Subjt:  DSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDE-GILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVIENALMDFYGNCG

Query:  DLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTA-----------------------------
        DL S+  VFD M  KDVV+W  +VS   + G  + A+ +FH M+S  +  D +S   L+   ++  +++                               
Subjt:  DLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTA-----------------------------

Query:  LTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCG
        L   E++ EE  R   +SW  +++    NG+F++ L++F  M  +    N V  AS L A A + DL  G A+H YA++  L  ++ V  SL+ MYSKCG
Subjt:  LTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCG

Query:  QDDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYA-----KNGQKVEAYELLSDM-----------------
        + + AE++F   E +++  W+ +IA+Y   G++  A+  FR M    +KP+ VT  ++L G A     + G+ +  Y + +D+                 
Subjt:  QDDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYA-----KNGQKVEAYELLSDM-----------------

Query:  ---------LQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFI
                  +   + + ++ N L  G+ Q G   +A ++ + M   G          V P++ T+   L  CA  +   +G  ++G ++++ F +   +
Subjt:  ---------LQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFI

Query:  SSALINMYAKCGDIDSAIQVFSRIK-NRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIK
        + ALINM+ KC  + +AI +F +    ++ V WN ++ G L   Q + AV  F QM VE  +P++ TF  ++ A +E + L+V   +HS +I+
Subjt:  SSALINMYAKCGDIDSAIQVFSRIK-NRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIK

Q9FM64 Pentatricopeptide repeat-containing protein At5g55740, chloroplastic3.5e-7328.05Show/hide
Query:  AQVVKLNDCRVDNLF-GKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTG
        A+++K  D    N +   KL  FY K    ++    +F ++  R + ++AA+I   CR          F  M++  I PD ++VP + KAC   +  + G
Subjt:  AQVVKLNDCRVDNLF-GKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTG

Query:  KMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNT
        +  HGY ++  +   + + ++L D YG CG L  +  VFD + +++ V+W AL+  Y++ G   EA+ +F  M+  G++P  ++ +  +S  A  G    
Subjt:  KMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNT

Query:  A-------------------------------LTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLG
                                        + Y E + +      V +WN +ISG VQ G  +DA+ +   M L     + VT+A+++ A A   +L 
Subjt:  A-------------------------------LTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLG

Query:  LGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQK
        LG+ V  Y ++    ++I +  +++DMY+KCG    A+++F    +K++ LWN ++A Y   G +  AL  F  MQ  G+ P+V+T+N ++    +NGQ 
Subjt:  LGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQK

Query:  VEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYF
         EA ++   M    ++PN+IS   +++G  Q+G + EA+   + M  +G          + PN  ++T AL+ACA L  LH G+ IHGY++RN   ++  
Subjt:  VEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYF

Query:  -ISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLE
         I ++L++MYAKCGDI+ A +VF       +   NA+I+        K A+ L+  +   G+KP + T + +L A +   D+    ++ + I+  + ++
Subjt:  -ISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLE

Q9FXH1 Pentatricopeptide repeat-containing protein At1g197205.1e-9333.7Show/hide
Query:  KLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVI
        KL   Y K   C+    KVFD + ER L  ++A+I AY R  +W E+   FR M+ +G+LPD +L P IL+ C+    V+ GK+ H   I+  M S + +
Subjt:  KLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVI

Query:  ENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVN
         N+++  Y  CG+L  +   F  M E+DV++W +++ AY + G   EA+E+   M+  G+ P L++WN L+ G+ + G+ + A+  ++ M+  G+   V 
Subjt:  ENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVN

Query:  SWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNI
        +W  +ISG + NG    ALD+F  M L    PN+VT+ S + AC+ L+ +  G  VH+ A+K     ++ V  SLVDMYSKCG+ + A ++F   + K++
Subjt:  SWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNI

Query:  TLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVP-NVISLNVLVSGFQQSGLNYEALELCQTMLCT
          WN +I  Y   G    A E F  MQ   L+P+++T+NT+++GY KNG + EA +L   M ++  V  N  + N++++G+ Q+G   EALEL + M  +
Subjt:  TLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVP-NVISLNVLVSGFQQSGLNYEALELCQTMLCT

Query:  GSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKM
                    +PN+VT+ + L ACA+L      +EIHG +LR      + + +AL + YAK GDI+ +  +F  ++ ++++ WN+LI G +    +  
Subjt:  GSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKM

Query:  AVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHL
        A+ LF QM  +GI P+  T S ++ A     ++   +++   I    H+
Subjt:  AVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHL

Q9SS83 Pentatricopeptide repeat-containing protein At3g09040, mitochondrial3.3e-6329.45Show/hide
Query:  GKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDI
        G  +   Y K  + V    K FD + E+ + A+ +++  Y    K  ++  +F S+ +  I P+K+    +L  C+R   V+ G+  H   I+  +  + 
Subjt:  GKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDI

Query:  VIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPR
            AL+D Y  C  +S +  VF+ + + + V WT L S Y++ GL  EA+ VF  M+  G +PD +++  +++ + R G+   A      M      P 
Subjt:  VIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPR

Query:  VNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKK
        V +WN +ISG  + G    A++ F NM          T+ S+L A   + +L LG  VHA A+K  L +NIYV  SLV MYSKC + + A ++F   E+K
Subjt:  VNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKK

Query:  NITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTM--
        N   WN +I  Y + G++   +E F  M+  G   D  T+ +LL+  A +       +  S ++++ L  N+   N LV  + + G   +A ++ + M  
Subjt:  NITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTM--

Query:  -------LCTGSLL---NKTIAFP---------VIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIK
                  GS +   N++ AF          ++ +   L + L AC  ++ L++GK++H   ++     +    S+LI+MY+KCG I  A +VFS + 
Subjt:  -------LCTGSLL---NKTIAFP---------VIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIK

Query:  NRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIK
          +VV  NALIAG  +    + AV LF +ML  G+ PS  TF+ ++ A  +   L +  Q H  I K
Subjt:  NRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIK

Q9SV26 Pentatricopeptide repeat-containing protein At4g01030, mitochondrial5.9e-7331.04Show/hide
Query:  SKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSS
        +K+FDE+P+R   A+  ++    RS  W +    FR M   G       +  +L+ CS ++    G+  HGY +R  + S++ + N+L+  Y   G L  
Subjt:  SKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSS

Query:  SINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFK
        S  VF+SM ++++ SW +++S+Y + G +++A+ +   M+  GLKPD+++WN+L+SG+A  G +  A+  L+ MQ  GL+P  +S               
Subjt:  SINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFK

Query:  DALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIA--TYMNQG
                            ++S+L A A    L LG+A+H Y L+ +L  ++YVE +L+DMY K G    A  +F   + KNI  WN +++  +Y    
Subjt:  DALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIA--TYMNQG

Query:  KNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPN
        K++ AL     M+  G+KPD +T+N+L +GYA  G+  +A +++  M ++ + PNV+S   + SG  ++G    AL++   M   G          V PN
Subjt:  KNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPN

Query:  TVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKP
          T++  L     L+LLH GKE+HG+ LR   + + ++++AL++MY K GD+ SAI++F  IKN+++  WN ++ G     + +  +  F  ML  G++P
Subjt:  TVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKP

Query:  SSATFSILL
         + TF+ +L
Subjt:  SSATFSILL

Arabidopsis top hitse value%identityAlignment
AT1G19720.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.7e-9433.7Show/hide
Query:  KLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVI
        KL   Y K   C+    KVFD + ER L  ++A+I AY R  +W E+   FR M+ +G+LPD +L P IL+ C+    V+ GK+ H   I+  M S + +
Subjt:  KLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVI

Query:  ENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVN
         N+++  Y  CG+L  +   F  M E+DV++W +++ AY + G   EA+E+   M+  G+ P L++WN L+ G+ + G+ + A+  ++ M+  G+   V 
Subjt:  ENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVN

Query:  SWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNI
        +W  +ISG + NG    ALD+F  M L    PN+VT+ S + AC+ L+ +  G  VH+ A+K     ++ V  SLVDMYSKCG+ + A ++F   + K++
Subjt:  SWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNI

Query:  TLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVP-NVISLNVLVSGFQQSGLNYEALELCQTMLCT
          WN +I  Y   G    A E F  MQ   L+P+++T+NT+++GY KNG + EA +L   M ++  V  N  + N++++G+ Q+G   EALEL + M  +
Subjt:  TLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVP-NVISLNVLVSGFQQSGLNYEALELCQTMLCT

Query:  GSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKM
                    +PN+VT+ + L ACA+L      +EIHG +LR      + + +AL + YAK GDI+ +  +F  ++ ++++ WN+LI G +    +  
Subjt:  GSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKM

Query:  AVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHL
        A+ LF QM  +GI P+  T S ++ A     ++   +++   I    H+
Subjt:  AVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHL

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein7.0e-6127.15Show/hide
Query:  DNLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRM
        D     KL   Y  +  C +    V   IP+ T+ ++++LI A  +++ + +    F  M   G++PD +++P + K C+     K GK  H  +    +
Subjt:  DNLFGKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRM

Query:  VSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEG
          D  ++ ++   Y  CG +  +  VFD MS+KDVV+ +AL+ AY  +G L E + +   M+SSG++ +++SWN ++SGF R                  
Subjt:  VSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEG

Query:  LRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAK
                         +GY K+A+ +F  +      P+ VTV+S+LP+      L +GR +H Y +K  L  +  V  +++DMY K G       +F +
Subjt:  LRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAK

Query:  AEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQ
         E     + N  I      G    ALE F   +   ++ +VV++ +++AG A+NG+ +EA EL  +M                   Q +G          
Subjt:  AEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQ

Query:  TMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRT
                        V PN VT+ + L AC ++  L  G+  HG+ +R + ++N  + SALI+MYAKCG I+ +  VF+ +  +N+VCWN+L+ G    
Subjt:  TMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRT

Query:  MQHKMAVELFCQMLVEGIKPSSATFSILLPALSE
         + K  + +F  ++   +KP   +F+ LL A  +
Subjt:  MQHKMAVELFCQMLVEGIKPSSATFSILLPALSE

AT3G09040.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-6429.45Show/hide
Query:  GKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDI
        G  +   Y K  + V    K FD + E+ + A+ +++  Y    K  ++  +F S+ +  I P+K+    +L  C+R   V+ G+  H   I+  +  + 
Subjt:  GKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDI

Query:  VIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPR
            AL+D Y  C  +S +  VF+ + + + V WT L S Y++ GL  EA+ VF  M+  G +PD +++  +++ + R G+   A      M      P 
Subjt:  VIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPR

Query:  VNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKK
        V +WN +ISG  + G    A++ F NM          T+ S+L A   + +L LG  VHA A+K  L +NIYV  SLV MYSKC + + A ++F   E+K
Subjt:  VNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKK

Query:  NITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTM--
        N   WN +I  Y + G++   +E F  M+  G   D  T+ +LL+  A +       +  S ++++ L  N+   N LV  + + G   +A ++ + M  
Subjt:  NITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTM--

Query:  -------LCTGSLL---NKTIAFP---------VIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIK
                  GS +   N++ AF          ++ +   L + L AC  ++ L++GK++H   ++     +    S+LI+MY+KCG I  A +VFS + 
Subjt:  -------LCTGSLL---NKTIAFP---------VIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIK

Query:  NRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIK
          +VV  NALIAG  +    + AV LF +ML  G+ PS  TF+ ++ A  +   L +  Q H  I K
Subjt:  NRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIK

AT4G01030.1 pentatricopeptide (PPR) repeat-containing protein4.2e-7431.04Show/hide
Query:  SKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSS
        +K+FDE+P+R   A+  ++    RS  W +    FR M   G       +  +L+ CS ++    G+  HGY +R  + S++ + N+L+  Y   G L  
Subjt:  SKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSS

Query:  SINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFK
        S  VF+SM ++++ SW +++S+Y + G +++A+ +   M+  GLKPD+++WN+L+SG+A  G +  A+  L+ MQ  GL+P  +S               
Subjt:  SINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFK

Query:  DALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIA--TYMNQG
                            ++S+L A A    L LG+A+H Y L+ +L  ++YVE +L+DMY K G    A  +F   + KNI  WN +++  +Y    
Subjt:  DALDVFINMLLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIA--TYMNQG

Query:  KNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPN
        K++ AL     M+  G+KPD +T+N+L +GYA  G+  +A +++  M ++ + PNV+S   + SG  ++G    AL++   M   G          V PN
Subjt:  KNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPN

Query:  TVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKP
          T++  L     L+LLH GKE+HG+ LR   + + ++++AL++MY K GD+ SAI++F  IKN+++  WN ++ G     + +  +  F  ML  G++P
Subjt:  TVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKP

Query:  SSATFSILL
         + TF+ +L
Subjt:  SSATFSILL

AT5G55740.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-7428.05Show/hide
Query:  AQVVKLNDCRVDNLF-GKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTG
        A+++K  D    N +   KL  FY K    ++    +F ++  R + ++AA+I   CR          F  M++  I PD ++VP + KAC   +  + G
Subjt:  AQVVKLNDCRVDNLF-GKKLTKFYVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTG

Query:  KMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNT
        +  HGY ++  +   + + ++L D YG CG L  +  VFD + +++ V+W AL+  Y++ G   EA+ +F  M+  G++P  ++ +  +S  A  G    
Subjt:  KMAHGYAIRKRMVSDIVIENALMDFYGNCGDLSSSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNT

Query:  A-------------------------------LTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLG
                                        + Y E + +      V +WN +ISG VQ G  +DA+ +   M L     + VT+A+++ A A   +L 
Subjt:  A-------------------------------LTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINMLLFPENPNSVTVASILPACAGLRDLG

Query:  LGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQK
        LG+ V  Y ++    ++I +  +++DMY+KCG    A+++F    +K++ LWN ++A Y   G +  AL  F  MQ  G+ P+V+T+N ++    +NGQ 
Subjt:  LGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDVVTYNTLLAGYAKNGQK

Query:  VEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYF
         EA ++   M    ++PN+IS   +++G  Q+G + EA+   + M  +G          + PN  ++T AL+ACA L  LH G+ IHGY++RN   ++  
Subjt:  VEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNYFVNNYF

Query:  -ISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLE
         I ++L++MYAKCGDI+ A +VF       +   NA+I+        K A+ L+  +   G+KP + T + +L A +   D+    ++ + I+  + ++
Subjt:  -ISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACTCCGGTATATGGTTTTGCTTCCTCAAATAATGCTTCTCTTCGCCTTCCATCATTCCCCAAGTTTCACTTCGACCTCTATCCCAATTCTAGCTTTTCTCGAAA
TTCCATGAATGTAGCTTGTAGAATGCATTTCCATGCGGTATCGGCCCATAATAGACCCAATTGTCAATTTTCTCCAATTGCTATACGTACGGATCGTAATTGCGAAGGTG
TTAATGTCCCAATCCCTCGTAGTTTTGCTTTGTTTGATCATAGTGCCCAGGTTGTTAAATTAAATGATTGTCGAGTTGATAATTTGTTTGGAAAGAAGTTGACTAAGTTT
TATGTCAAGGATGTTAAGTGCGTGGACAGTGACAGTAAGGTGTTCGATGAAATTCCTGAGAGAACGCTGCCAGCCTATGCAGCTTTGATTAGAGCGTATTGTCGATCAGA
GAAGTGGAATGAGCTCTTTGCGGCATTCAGATCGATGGTTGATGAGGGTATACTACCCGATAAATACTTAGTGCCCACGATTCTTAAAGCATGTTCCAGAAGACAAATGG
TGAAGACTGGTAAAATGGCTCATGGGTATGCCATTAGGAAGAGGATGGTCTCTGATATTGTTATTGAGAATGCTCTTATGGATTTCTATGGCAATTGTGGGGATTTGAGT
TCTTCGATCAATGTTTTTGATTCGATGAGTGAAAAAGATGTGGTTTCGTGGACCGCGCTTGTCTCTGCCTACATTGAAGAAGGTCTTTTGAATGAGGCGATGGAAGTATT
TCACTCCATGCAGTCTAGTGGGTTGAAGCCTGATTTGATATCTTGGAATGCACTGGTCTCAGGGTTTGCTCGATATGGAGAGACTAATACTGCGCTCACATACTTGGAAG
CCATGCAAGAAGAAGGATTGAGACCAAGGGTTAATTCATGGAACGGAGTCATATCAGGCTGTGTTCAAAATGGATATTTCAAAGATGCTTTGGATGTATTTATTAATATG
CTGTTGTTTCCTGAGAATCCAAATTCTGTTACTGTTGCGAGTATATTACCAGCTTGTGCAGGGTTGAGAGATCTAGGTTTAGGCAGGGCTGTTCATGCATATGCTCTTAA
GTGCGAGCTGTGTACAAACATTTACGTCGAAGGATCATTAGTTGATATGTATTCAAAATGCGGACAAGATGATCGTGCTGAAGAAATTTTTGCCAAAGCAGAGAAGAAAA
ACATTACATTGTGGAATGAAATTATTGCAACTTACATGAACCAGGGAAAGAATAGCTGGGCGTTAGAACATTTTAGATCAATGCAGCATCATGGACTAAAACCTGATGTT
GTAACCTACAACACACTGCTAGCTGGATATGCAAAAAATGGGCAGAAAGTTGAAGCATATGAGTTGCTATCTGATATGTTACAAGAAAATTTGGTGCCAAATGTTATATC
TTTAAATGTTTTAGTGTCTGGATTTCAACAATCTGGGTTAAATTATGAAGCTCTAGAATTATGCCAGACCATGCTATGCACGGGCTCCCTTCTTAACAAGACGATTGCTT
TTCCGGTCATACCAAATACCGTCACTCTAACTGCTGCTCTGGCAGCTTGTGCTAGCTTGAATTTGTTGCACAAAGGGAAGGAAATCCATGGATATATGTTGAGGAATTAT
TTTGTAAACAACTACTTCATTTCAAGTGCTCTAATTAACATGTACGCAAAGTGTGGGGATATTGATTCGGCAATTCAAGTATTTAGCAGAATAAAGAACCGGAATGTAGT
TTGTTGGAATGCTTTGATTGCAGGTCTTTTGAGAACAATGCAGCACAAAATGGCAGTTGAACTATTCTGTCAAATGCTAGTAGAAGGCATAAAACCAAGTTCAGCCACTT
TTTCAATACTTCTCCCTGCCTTATCTGAAAGGGCAGATTTGAAAGTGAGAAGACAGCTGCATTCCTATATCATCAAAAGTCAGCACCTTGAATCACGCAACGACCTTGCA
AATGTCTTAAGTTCAGACAATGTTGATGTTGGAGTTTTGCTCCATGGAATATAA
mRNA sequenceShow/hide mRNA sequence
GTACATCTTTCAATTCTGAGCTCACAACGCTTTGTCTTCAACATTTTTTTTCCTTTAACTATGGCAACTCCGGTATATGGTTTTGCTTCCTCAAATAATGCTTCTCTTCG
CCTTCCATCATTCCCCAAGTTTCACTTCGACCTCTATCCCAATTCTAGCTTTTCTCGAAATTCCATGAATGTAGCTTGTAGAATGCATTTCCATGCGGTATCGGCCCATA
ATAGACCCAATTGTCAATTTTCTCCAATTGCTATACGTACGGATCGTAATTGCGAAGGTGTTAATGTCCCAATCCCTCGTAGTTTTGCTTTGTTTGATCATAGTGCCCAG
GTTGTTAAATTAAATGATTGTCGAGTTGATAATTTGTTTGGAAAGAAGTTGACTAAGTTTTATGTCAAGGATGTTAAGTGCGTGGACAGTGACAGTAAGGTGTTCGATGA
AATTCCTGAGAGAACGCTGCCAGCCTATGCAGCTTTGATTAGAGCGTATTGTCGATCAGAGAAGTGGAATGAGCTCTTTGCGGCATTCAGATCGATGGTTGATGAGGGTA
TACTACCCGATAAATACTTAGTGCCCACGATTCTTAAAGCATGTTCCAGAAGACAAATGGTGAAGACTGGTAAAATGGCTCATGGGTATGCCATTAGGAAGAGGATGGTC
TCTGATATTGTTATTGAGAATGCTCTTATGGATTTCTATGGCAATTGTGGGGATTTGAGTTCTTCGATCAATGTTTTTGATTCGATGAGTGAAAAAGATGTGGTTTCGTG
GACCGCGCTTGTCTCTGCCTACATTGAAGAAGGTCTTTTGAATGAGGCGATGGAAGTATTTCACTCCATGCAGTCTAGTGGGTTGAAGCCTGATTTGATATCTTGGAATG
CACTGGTCTCAGGGTTTGCTCGATATGGAGAGACTAATACTGCGCTCACATACTTGGAAGCCATGCAAGAAGAAGGATTGAGACCAAGGGTTAATTCATGGAACGGAGTC
ATATCAGGCTGTGTTCAAAATGGATATTTCAAAGATGCTTTGGATGTATTTATTAATATGCTGTTGTTTCCTGAGAATCCAAATTCTGTTACTGTTGCGAGTATATTACC
AGCTTGTGCAGGGTTGAGAGATCTAGGTTTAGGCAGGGCTGTTCATGCATATGCTCTTAAGTGCGAGCTGTGTACAAACATTTACGTCGAAGGATCATTAGTTGATATGT
ATTCAAAATGCGGACAAGATGATCGTGCTGAAGAAATTTTTGCCAAAGCAGAGAAGAAAAACATTACATTGTGGAATGAAATTATTGCAACTTACATGAACCAGGGAAAG
AATAGCTGGGCGTTAGAACATTTTAGATCAATGCAGCATCATGGACTAAAACCTGATGTTGTAACCTACAACACACTGCTAGCTGGATATGCAAAAAATGGGCAGAAAGT
TGAAGCATATGAGTTGCTATCTGATATGTTACAAGAAAATTTGGTGCCAAATGTTATATCTTTAAATGTTTTAGTGTCTGGATTTCAACAATCTGGGTTAAATTATGAAG
CTCTAGAATTATGCCAGACCATGCTATGCACGGGCTCCCTTCTTAACAAGACGATTGCTTTTCCGGTCATACCAAATACCGTCACTCTAACTGCTGCTCTGGCAGCTTGT
GCTAGCTTGAATTTGTTGCACAAAGGGAAGGAAATCCATGGATATATGTTGAGGAATTATTTTGTAAACAACTACTTCATTTCAAGTGCTCTAATTAACATGTACGCAAA
GTGTGGGGATATTGATTCGGCAATTCAAGTATTTAGCAGAATAAAGAACCGGAATGTAGTTTGTTGGAATGCTTTGATTGCAGGTCTTTTGAGAACAATGCAGCACAAAA
TGGCAGTTGAACTATTCTGTCAAATGCTAGTAGAAGGCATAAAACCAAGTTCAGCCACTTTTTCAATACTTCTCCCTGCCTTATCTGAAAGGGCAGATTTGAAAGTGAGA
AGACAGCTGCATTCCTATATCATCAAAAGTCAGCACCTTGAATCACGCAACGACCTTGCAAATGTCTTAAGTTCAGACAATGTTGATGTTGGAGTTTTGCTCCATGGAAT
ATAATGCATAGTATTGTGTGATCATATAAAGGCCGTTCTCTCTCTGAAAATGGTCGGGATTCCCTTTTTTCTAGTACCTGGATGAAAATATTAGATGAAGATTAAGGCAG
TGAAAGATAATTCTGTATTATGAAAGAGTCCAGTTCAGCATGATACACCTGACCCGCGTTAGTGGCAATCGGCCAAACTTTGGTTTCAATACATGGGACTTCCTTCAGAT
GATGAGGCACGACATCTGATCACCACCACGGTTCAAGGGGAACTTTCTTGCCGGATGGGATGATGTTTAAAAAATGCATCTGGTGCTAAGGTGCGCGCCATGCTATTTCA
TTGCAAGTAAGAGTTTAATATATACCTCCCAGCCTCTTATCCCTAAGACGACAGACTGATCAAGCATTACTATACGTATTATCCTCCTAAGAGTACTCGTCTAGACTTGG
CTCTCTGACCCATTGTAACTAAGACTCAGATTTTCCTATTTTTGGGATTACATCCAATATGCAAATCCATGCCATGGTTAAGTTTAGTGAGAAGGCATGTGCTTATTGAA
AATGAGTGAATCCTAAGTTTCTACGTTTTTAAGTTTTCACAATGGAAAAAAGTCAAGTGGGAATGTGGATGAAGATGAAAATCCGAGGATGATACGCATACAAGACAAGG
TTTATATATAATTAGTCTTGGGCATCTTTTCTCAATGCTCAAGAATTGATGTTGCATCTCACGTTTTGCACTGACATCAATCAATTGAGAGATGGAGAGAGTTGACGTCA
GATCGTTATATAAATCTGTACTTGAGGGAATTTTTCTCCTTAGCAGAGGTAGACTGACACTAATTATTTGAGCCATGGTCTAAAATTTTTCGGTATCAGCTGATTCACTT
TTTAAATGCTTAAAAATAATTACGAAATGCTACACTTCACAACCTGCATCTGCATAGCATCATTTTCTATTCCAACAAAATCAAGACATGAGAACGCTACCAGCTACCAT
CACCATCATGTATTATGGGAATGAATGAATATGTTCAGTAAAAGATCTATTGTTTACAGATTTAATCCCTAGATCTATCGACATCTTCCCATTCATGTTGTAATTTGAAC
AATGAGATTCTAAACGTCTAATTCTTCTAAATGTCGGGATTGAAGTTGTATCTCAATTGTTTGTCATGAAGTTCATAGACATTGAATTCGACTACGTATTCAGTTTGACC
ATTTTCACTAGTTGGGTTGGGTTCGTTTGATAATTTTCCTCTCAAAGTTTCAACTGTTTAAATGGTCAAACCCATATTTAAGAATCACAATTAAGGATAAAAATGGCAGG
AAGAAAGAGAAAACACGATAGAGAAAGCAGTGAGAACTGGACAAGTCTTTAAATAGACAGGCAGATAATGTAGGTACGAAAATGAAAATAGGGAATCGTATTATCAGTTT
GAAAAAGATTACCTTCAAATTCAAGCGGACGGCAGGGAGTACAATCGGTAACAACATCAAATCGAGCGTTTATTGACCAACAACAGTTAGCAAGATGCATTTCGATTATT
GATAAAGATTCGAATTGGGGAAAAACCCTAAGATTTGAACTTCTGAAGCAATTTTTTACTGACTTCATACTGTGGTTTTAGATAGTGATTCTCTTGGAAGACCTTTCCAA
ATGGATGTAAACAGACTGCAGAAAGCGCAAGAGGCCCTTGATGCCGAAATTAAATCATTTTTTGATTCAGCTCCCCCTTTGAGGAATATTGAGGATATCGGCAAAGATTT
AAGAAAGTTTGTTGAATTTAATCCACCCCAAGCCGGTAAGTTCTCTCTTCGTTAATCTCTTAAGCATGTCTTTTTGTTTTGATTAGTTGTTACTCCCTAATATTCTACTG
GGGATTACTCAATCTGCTATCCTGCAAGTGCCTGTATTGTTTAACACTTACTTGACATTCTGGAAAGTTACAGATCTATTAAGAACTATTTTTTGAAGTAAATGGAATAA
ACAAACCATTCGAATAAATTGAGGA
Protein sequenceShow/hide protein sequence
MATPVYGFASSNNASLRLPSFPKFHFDLYPNSSFSRNSMNVACRMHFHAVSAHNRPNCQFSPIAIRTDRNCEGVNVPIPRSFALFDHSAQVVKLNDCRVDNLFGKKLTKF
YVKDVKCVDSDSKVFDEIPERTLPAYAALIRAYCRSEKWNELFAAFRSMVDEGILPDKYLVPTILKACSRRQMVKTGKMAHGYAIRKRMVSDIVIENALMDFYGNCGDLS
SSINVFDSMSEKDVVSWTALVSAYIEEGLLNEAMEVFHSMQSSGLKPDLISWNALVSGFARYGETNTALTYLEAMQEEGLRPRVNSWNGVISGCVQNGYFKDALDVFINM
LLFPENPNSVTVASILPACAGLRDLGLGRAVHAYALKCELCTNIYVEGSLVDMYSKCGQDDRAEEIFAKAEKKNITLWNEIIATYMNQGKNSWALEHFRSMQHHGLKPDV
VTYNTLLAGYAKNGQKVEAYELLSDMLQENLVPNVISLNVLVSGFQQSGLNYEALELCQTMLCTGSLLNKTIAFPVIPNTVTLTAALAACASLNLLHKGKEIHGYMLRNY
FVNNYFISSALINMYAKCGDIDSAIQVFSRIKNRNVVCWNALIAGLLRTMQHKMAVELFCQMLVEGIKPSSATFSILLPALSERADLKVRRQLHSYIIKSQHLESRNDLA
NVLSSDNVDVGVLLHGI