; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012475 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012475
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold63:978442..980532
RNA-Seq ExpressionMS012475
SyntenyMS012475
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047827.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0079.31Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA
        E L+LY++IRISGA L+D  V PSILK+CSN+SF LGTAMHGCLIKQG +SSTSI NSTI  YMK+G+L SAQRAFDS KNKDSVSWNVMVHGNFSN S 
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA

Query:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE
         AGLWWF KGRF+HFQPN+SSL+LV+ AFR+LK Y +GFA+HGY +R GF AI+SVQNSLLSLYAEV++  AHK+F EMS RNDVVSWSVMIGGFVQIGE
Subjt:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE

Query:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN
        DE+GL MFRNMVTEAGI+ DGVTVVSVLKACTNLRDISLG MVHG  I RG EDDLFVGNSL+DMYSKCC+  SAFK FKE+PE+NI+SWN MLS Y+LN
Subjt:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN

Query:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH
        + HLEA+ALLGTMVEEG EKDEVT VNVLQI +HFLD L+C+SVHG+IIR+G+ESN  L+NSVIDAYAKCNLVELAG +FDGM KKDVVAWSTM+AGF  
Subjt:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH

Query:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI
         G+PD+AISVFKQMN+EV PN VSIMNLMEACA+SAELR+SKWAHGIA+RRGLAG+VA+GT+I+DMYSKCGDI+ASIRAFNQIP+KN+VCWSAMISA+ I
Subjt:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI

Query:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL
        NGLAHE+L+  EK+KQNG KPNAVT LSLLSACSHGGL+EEG+SFFTSM ++HG+EPGLEHYSC +DML+RAGKFN+ALELIEKMP +MEAGASIWGTLL
Subjt:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL

Query:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        SSCRSYGN++LGSGAAS +L+LEP SS GYMLASNLYANCG M +SA+MRRLAKE+GVKVVAGYSLVH NS+TWRFVAGD L+PRA EIYLMV QL
Subjt:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

KAG6570977.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0081.32Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA
        E LQLY +IR+SG+ L D SV PSILKACSN+SFKLGTAMHGCLIKQG +SSTS+ANSTIDLYMKWG+L SA RAF SLKNKDSVSWNVMVHGNFSN   
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA

Query:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE
         AGLWWF   RF++FQPNVSSLVLV+ AFR+ KSY EGFA HGY IR GF AI+SVQNSLLSLY EV+M  AHK+FDEMS RND+VSWSVM GGFVQIGE
Subjt:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE

Query:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN
        DE GL MFR+MVTEAGI+PDGVT+VSVLKACTNLRDISLG MVHG  +CRG EDDLFVGNSLIDMYSKC    S+FK F  MPE+NIVSWNSMLS Y LN
Subjt:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN

Query:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH
        EK LEAVALL TMVEEGVEKDEVTFVNVLQIV+HFLD LQC+SVH  IIRRG+ESN  ++NSVIDAYAKCNL+ELAG LF GMKKKDVV WSTM+AGF +
Subjt:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH

Query:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI
         G PD+AIS+FK+MN+EVKPN+VSIMNLMEACAVSAE RRSKWAHGIAVRRGLA +VAVGTAIIDMYSKCGDI ASIRAFNQIPEKNVVCWSAMISA+GI
Subjt:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI

Query:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL
        NGLAH++L+  EKMKQN MKPNAVT LSLLSACSHGGL+EEG+SFFTSM K+H + PGLEHYSC IDMLARAGKF DALELIEKMP +MEAGASIWGTLL
Subjt:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL

Query:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        SSCRSYGN+VLGSGAAS +L LEP +S GYMLASNLYANCGLM+ SA+MRRLAKERGVKVVAGYSLVHINS++WRFVAGDE +PRA EIYLMV+QL
Subjt:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

XP_022149161.1 pentatricopeptide repeat-containing protein At2g17210 [Momordica charantia]0.0e+0099.14Show/hide
Query:  LEILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS
        LEILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS
Subjt:  LEILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS

Query:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIG
        AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIG
Subjt:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIG

Query:  EDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVL
        EDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVL
Subjt:  EDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVL

Query:  NEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFV
        NEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMK+KDVVAWSTM+AGFV
Subjt:  NEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFV

Query:  HCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYG
        HCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDI+ASIRAFNQIPEKNVVCWSAMISAYG
Subjt:  HCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYG

Query:  INGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTL
        INGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEG+SFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIE+MPGQMEAGASIWGTL
Subjt:  INGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTL

Query:  LSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        LSSCRSYGNVVLGSGAASHILRLEPSSS GYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
Subjt:  LSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

XP_023512125.1 pentatricopeptide repeat-containing protein At2g17210 [Cucurbita pepo subsp. pepo]0.0e+0080.75Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA
        E LQLY +IR+SG+ L D SV PSILKACSN+SFKLGTAMHGCLIKQG +SSTS+ANS IDLYMKWG+L SA RAF SLKNKDSVSWNVMVHGNFSN   
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA

Query:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE
         AGLWWF   RF+ FQPNVSSLV+V+ AFR+ KSY EGFA HGY IR GF AI+SVQNSLLSLY EV+M  AHK+FDEM  RND+VSWSVM GGFVQIGE
Subjt:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE

Query:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN
        DE GL MFR+MVTEAGI+PDGVT+VSVLKACTNLRDISLG MVHG  +CRG EDDLFVGNSLIDMYSKC    S+FK FK MPE+NIVSWNSMLS Y LN
Subjt:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN

Query:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH
        EK LEAVALL TMVEEGVEKDEVTFVNVLQI +HFLD LQC+SVHG IIRRG+ESN  ++NSVIDAYAKCNL+ELAG LFDGMKKKDVV WSTM+AGF +
Subjt:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH

Query:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI
         G PD+AIS+FK+MN+EVKPN+VSIMNLMEACAVSAE RRSKWAHGIAVRRGLA +VAVGTAIIDMYSKCGDI ASIRAFNQIPEKNVVCWSAMISA+GI
Subjt:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI

Query:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL
        NGLAHE+L+  EKMKQ  MKPNAVT LSLLSACSHGGL+EEG+S F SM K+H + PGLEHYSC +DMLARAGKF DALELIEKMP +MEAGASIWGTLL
Subjt:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL

Query:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        SSCRSYGN+VLGSGAAS +L LEP +S GYMLASNLYANCGLM++SA+MRRLAKERGVKVVAGYSLVHINS++WRFVAGDE +PRA EIYLMV+QL
Subjt:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

XP_038878587.1 pentatricopeptide repeat-containing protein At2g17210 isoform X1 [Benincasa hispida]0.0e+0081.03Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA
        E LQ+YH+IR SG HL +  V P ILKACSN+SFKLGTAMHGCLIKQG ESSTSIANSTIDLYMKWG+L SA RAFDS  NKDSVSWNVMVHGNFSN   
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA

Query:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE
         AG WWF KGRF+HFQPNVSSLVLV+ AFR+LK Y +GFA+HGY IR GF AI+SVQNSLLSLYAEVNM  AHK+FDEMS RNDVVSWSVM GGFVQIGE
Subjt:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE

Query:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN
         E GL MFRNMVTEAGI+PDGV VVSVLKACT+LRDISLG +VHG  I RG EDDLFVGNSLIDMYSKC D  SAFK FKE+PE+NI+SWN MLS Y+LN
Subjt:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN

Query:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH
        EK LEAVAL+GTMVEEG EKDEVTFVNVLQ+V+HFLD LQC+SVHGMIIR+G+ESN  +++S+ID+YAKCNLVELAGTLFDGMKKKDVVAWSTM+AG   
Subjt:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH

Query:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI
         G+PD+AISVFKQMN+EV PN+VSIMNLMEACAVSAELR+++WAHGIAVRRGLAG+VAVGTAIIDMYSKCGDI+AS+RAFNQIPEKNVVCWSAMISA+GI
Subjt:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI

Query:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL
        NGLAHE+L+  EK+KQN  KPNAVT LSLLSACSHGGL+EEG+SFFTSM K+HG+EPGLEHYSC +DML+RAGKFN+ALELIEKMP +MEAGASIWGTLL
Subjt:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL

Query:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        SSCRSYGN+VLG  AAS +L+LEP SS GY+LASNLYANCGLM +SA+MRRLAK+RGVKVVAGYSLVHINS+TWRFVAGDEL+PRA EIYLMV+QL
Subjt:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

TrEMBL top hitse value%identityAlignment
A0A1S3BJ38 pentatricopeptide repeat-containing protein At2g172100.0e+0079.31Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA
        E L+LY++IRISGA L+D  V PSILK+CSN+SF LGTAMHGCLIKQG +SSTSIANSTI  YMK+G+L SAQRAFDS KNKDSVSWNVMVHGNFSN S 
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA

Query:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE
         AGLWWF KGRF+HFQPN+SSL+LV+ AFR+LK Y +GFA+HGY +R GF AI+SVQNSLLSLYAEV++  AHK+F EMS RNDVVSWSVMIGGFVQIGE
Subjt:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE

Query:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN
        DE+GL MFRNMVTEAGI+ DGVTVVSVLKACTNLRDISLG MVHG  I RG EDDLFVGNSL+DMYSKCC+  SAFK FKE+PE+NI+SWN MLS Y+LN
Subjt:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN

Query:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH
        + HLEA+ALLGTMVEEG EKDEVT VNVLQI +HFLD L+C+SVHG+IIR+G+ESN  L+NSVIDAYAKCNLVELAG +F GM KKDVVAWSTM+AGF  
Subjt:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH

Query:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI
         G+PD+AISVFKQMN+EV PN VSIMNLMEACA+SAELR+SKWAHGIA+RRGLAG+VA+GT+IIDMYSKCGDI+ASIRAFNQIP+KN+VCWSAMISA+ I
Subjt:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI

Query:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL
        NGLAHE+L+  EK+KQNG KPNAVT LSLLSACSHGGL+EEG+SFFTSM ++HG+EPGLEHYSC +DML+RAGKFN+ALELIEKMP +MEAGASIWGTLL
Subjt:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL

Query:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        SSCRSYGN++LGSGAAS +L+LEP SS GYMLASNLYA CG M +SA+MRRLAKE+GVKVVAGYSLVH NS+TWRFVAGD L+PRA EIYLMV QL
Subjt:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

A0A5A7U0T7 Pentatricopeptide repeat-containing protein0.0e+0079.31Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA
        E L+LY++IRISGA L+D  V PSILK+CSN+SF LGTAMHGCLIKQG +SSTSI NSTI  YMK+G+L SAQRAFDS KNKDSVSWNVMVHGNFSN S 
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA

Query:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE
         AGLWWF KGRF+HFQPN+SSL+LV+ AFR+LK Y +GFA+HGY +R GF AI+SVQNSLLSLYAEV++  AHK+F EMS RNDVVSWSVMIGGFVQIGE
Subjt:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE

Query:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN
        DE+GL MFRNMVTEAGI+ DGVTVVSVLKACTNLRDISLG MVHG  I RG EDDLFVGNSL+DMYSKCC+  SAFK FKE+PE+NI+SWN MLS Y+LN
Subjt:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN

Query:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH
        + HLEA+ALLGTMVEEG EKDEVT VNVLQI +HFLD L+C+SVHG+IIR+G+ESN  L+NSVIDAYAKCNLVELAG +FDGM KKDVVAWSTM+AGF  
Subjt:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH

Query:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI
         G+PD+AISVFKQMN+EV PN VSIMNLMEACA+SAELR+SKWAHGIA+RRGLAG+VA+GT+I+DMYSKCGDI+ASIRAFNQIP+KN+VCWSAMISA+ I
Subjt:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI

Query:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL
        NGLAHE+L+  EK+KQNG KPNAVT LSLLSACSHGGL+EEG+SFFTSM ++HG+EPGLEHYSC +DML+RAGKFN+ALELIEKMP +MEAGASIWGTLL
Subjt:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL

Query:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        SSCRSYGN++LGSGAAS +L+LEP SS GYMLASNLYANCG M +SA+MRRLAKE+GVKVVAGYSLVH NS+TWRFVAGD L+PRA EIYLMV QL
Subjt:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

A0A5D3CWI7 Pentatricopeptide repeat-containing protein0.0e+0079.31Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA
        E L+LY++IRISGA L+D  V PSILK+CSN+SF LGTAMHGCLIKQG +SSTSIANSTI  YMK+G+L SAQRAFDS KNKDSVSWNVMVHGNFSN S 
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA

Query:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE
         AGLWWF KGRF+HFQPN+SSL+LV+ AFR+LK Y +GFA+HGY +R GF AI+SVQNSLLSLYAEV++  AHK+F EMS RNDVVSWSVMIGGFVQIGE
Subjt:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE

Query:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN
        DE+GL MFRNMVTEAGI+ DGVTVVSVLKACTNLRDISLG MVHG  I RG EDDLFVGNSL+DMYSKCC+  SAFK FKE+PE+NI+SWN MLS Y+LN
Subjt:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN

Query:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH
        + HLEA+ALLGTMVEEG EKDEVT VNVLQI +HFLD L+C+SVHG+IIR+G+ESN  L+NSVIDAYAKCNLVELAG +F GM KKDVVAWSTM+AGF  
Subjt:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH

Query:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI
         G+PD+AISVFKQMN+EV PN VSIMNLMEACA+SAELR+SKWAHGIA+RRGLAG+VA+GT+IIDMYSKCGDI+ASIRAFNQIP+KN+VCWSAMISA+ I
Subjt:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI

Query:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL
        NGLAHE+L+  EK+KQNG KPNAVT LSLLSACSHGGL+EEG+SFFTSM ++HG+EPGLEHYSC +DML+RAGKFN+ALELIEKMP +MEAGASIWGTLL
Subjt:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL

Query:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        SSCRSYGN++LGSGAAS +L+LEP SS GYMLASNLYA CG M +SA+MRRLAKE+GVKVVAGYSLVH NS+TWRFVAGD L+PRA EIYLMV QL
Subjt:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

A0A6J1D7J1 pentatricopeptide repeat-containing protein At2g172100.0e+0099.14Show/hide
Query:  LEILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS
        LEILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS
Subjt:  LEILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS

Query:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIG
        AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIG
Subjt:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIG

Query:  EDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVL
        EDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVL
Subjt:  EDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVL

Query:  NEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFV
        NEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMK+KDVVAWSTM+AGFV
Subjt:  NEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFV

Query:  HCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYG
        HCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDI+ASIRAFNQIPEKNVVCWSAMISAYG
Subjt:  HCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYG

Query:  INGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTL
        INGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEG+SFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIE+MPGQMEAGASIWGTL
Subjt:  INGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTL

Query:  LSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        LSSCRSYGNVVLGSGAASHILRLEPSSS GYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
Subjt:  LSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

A0A6J1FSK0 pentatricopeptide repeat-containing protein At2g172100.0e+0080.6Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA
        E LQLY +IRISG+ L D SV PSILKACSN+SFKLGTAMHGCLIKQG ESSTS+ANSTIDLYMKWG+L SA RAF SLKNKDSVSWNVMVHGNFSN   
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA

Query:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE
         AGLWWF   RF++FQPNVSSLVLV+ AFR+ KSY EGFA HGY IR GF AI+SVQNSLLSLY EV+M  AHK+FDEMS RND+VSWSVM GGFVQIGE
Subjt:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE

Query:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN
        DE GL MFR+MVTEAGI+PDGVT+VSVLKACTNLRDISLG MVHG  +CRG EDDLFVGNSLIDMYSKC    S+FK F  MPE+NIVSWNSMLS Y LN
Subjt:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLN

Query:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH
        EK LEAVALL TMVEE VEKDEVTFVNVLQIV+HFLD LQC+SVH  IIRRG+ESN  ++NSVIDAYAKCNL+ELAG LFDGMKKKDVV WSTM+AGF +
Subjt:  EKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVH

Query:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI
         G PD+AI +FK+MN+EVKPN+VSIMNLMEACAVSAE RRSKWAHGIAVRRGLA +VAVGTAIIDMYSKCGDI ASIRAFNQIPEKNVVCWSAMISA+GI
Subjt:  CGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGI

Query:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL
        N LAHE+L+  EKMKQN MKPNAVT LSLLSACSHGGL+EEG+SFFTSM K+H + PGLEHYSC IDMLAR GKF DALE+IE MP +MEAGASIWGTLL
Subjt:  NGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLL

Query:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
        SSCRSYGN++LGSGAAS +L LEP +S GYMLASNLYANCGLM++SA+MRRLAKERGVKVVAGYSLVHINS++WRFVAGDE +PRA EIYL ++QL
Subjt:  SSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.8e-11031.75Show/hide
Query:  ILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSAEAGLWWFMKGRFSHFQPNVSSLVL
        +L+ CS+L  K    +   + K G           + L+ ++G +  A R F+ + +K +V ++ M+ G       +  L +F++ R+   +P V +   
Subjt:  ILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSAEAGLWWFMKGRFSHFQPNVSSLVL

Query:  VLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCS-AHKMFDEMSYRNDVVSWSVMIGGFVQIGEDERGLEMFRNMVTEAGITPDGVT
        +L    D    R G  +HG  ++ GF   +     L ++YA+    + A K+FD M  R D+VSW+ ++ G+ Q G     LEM ++M  E  + P  +T
Subjt:  VLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCS-AHKMFDEMSYRNDVVSWSVMIGGFVQIGEDERGLEMFRNMVTEAGITPDGVT

Query:  VVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLNEKHLEAVALLGTMVEEGVEKDEV
        +VSVL A + LR IS+G+ +HG+A+  G++  + +  +L+DMY+KC    +A ++F  M ERN+VSWNSM+  YV NE   EA+ +   M++EGV+  +V
Subjt:  VVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLNEKHLEAVALLGTMVEEGVEKDEV

Query:  TFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVHCGQPDQAISVFKQM-NKEVKPNE
        + +  L       D  + + +H + +  G + N S+VNS+I  Y KC  V+ A ++F  ++ + +V+W+ M+ GF   G+P  A++ F QM ++ VKP+ 
Subjt:  TFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVHCGQPDQAISVFKQM-NKEVKPNE

Query:  VSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGINGLAHESLVSLEKMKQNGMKPN
         + ++++ A A  +    +KW HG+ +R  L  +V V TA++DMY+KCG I  +   F+ + E++V  W+AMI  YG +G    +L   E+M++  +KPN
Subjt:  VSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGINGLAHESLVSLEKMKQNGMKPN

Query:  AVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLLSSCRSYGNVVLGSGAASHILRL
         VT LS++SACSH GL+E G+  F  M++ + +E  ++HY   +D+L RAG+ N+A + I +MP  ++   +++G +L +C+ + NV     AA  +  L
Subjt:  AVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLLSSCRSYGNVVLGSGAASHILRL

Query:  EPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
         P     ++L +N+Y    +  +  ++R     +G++   G S+V I +E   F +G   HP + +IY  +++L
Subjt:  EPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

Q7Y211 Pentatricopeptide repeat-containing protein At3g57430, chloroplastic2.4e-11431.78Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNL-SFKLGTAMHGCLIKQGY-ESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNR
        E +  Y D+ + G    D   FP++LKA ++L   +LG  +H  + K GY   S ++AN+ ++LY K G+  +  + FD +  ++ VSWN ++    S  
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNL-SFKLGTAMHGCLIKQGY-ESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNR

Query:  SAEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDL---KSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGF
          E  L  F      + +P+  +LV V+ A  +L   +    G  +H Y +R G      + N+L+++Y ++   ++ K+        D+V+W+ ++   
Subjt:  SAEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDL---KSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGF

Query:  VQIGEDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRG-WEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSML
         Q  +    LE  R MV E G+ PD  T+ SVL AC++L  +  G+ +H +A+  G  +++ FVG++L+DMY  C   +S  ++F  M +R I  WN+M+
Subjt:  VQIGEDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRG-WEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSML

Query:  STYVLNEKHLEAVAL-LGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWST
        + Y  NE   EA+ L +G     G+  +  T   V+          + +++HG +++RG + +  + N+++D Y++   +++A  +F  M+ +D+V W+T
Subjt:  STYVLNEKHLEAVAL-LGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWST

Query:  MLAGFVHCGQPDQAISVFKQMNK------------EVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFN
        M+ G+V     + A+ +  +M               +KPN +++M ++ +CA  + L + K  H  A++  LA DVAVG+A++DMY+KCG ++ S + F+
Subjt:  MLAGFVHCGQPDQAISVFKQMNK------------EVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFN

Query:  QIPEKNVVCWSAMISAYGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALEL
        QIP+KNV+ W+ +I AYG++G   E++  L  M   G+KPN VT +S+ +ACSH G+++EG+  F  M+ ++GVEP  +HY+C +D+L RAG+  +A +L
Subjt:  QIPEKNVVCWSAMISAYGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALEL

Query:  IEKMPGQMEAGASIWGTLLSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDE
        +  MP      A  W +LL + R + N+ +G  AA ++++LEP+ +  Y+L +N+Y++ GL  ++  +RR  KE+GV+   G S +    E  +FVAGD 
Subjt:  IEKMPGQMEAGASIWGTLLSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDE

Query:  LHPRAYEI
         HP++ ++
Subjt:  LHPRAYEI

Q9SII7 Pentatricopeptide repeat-containing protein At2g172101.7e-19250.36Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKL-GTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS
        E++  Y +I+ +G    DP VFP + KAC+ LS+   G  +   L+K+G+ES  S+ NS  D YMK G+L S  R FD + ++DSVSWNV+V G      
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKL-GTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS

Query:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIG
         E GLWWF K R   F+PN S+LVLV+ A R L  + +G  +HGY IR GFC I SVQNS+L +YA+ +  SA K+FDEMS R DV+SWSV+I  +VQ  
Subjt:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIG

Query:  EDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWE-DDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYV
        E   GL++F+ MV EA   PD VTV SVLKACT + DI +GR VHGF+I RG++  D+FV NSLIDMYSK  D  SAF++F E   RNIVSWNS+L+ +V
Subjt:  EDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWE-DDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYV

Query:  LNEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGF
         N+++ EA+ +   MV+E VE DEVT V++L++ + F  PL CKS+HG+IIRRG+ESN   ++S+IDAY  C+LV+ AGT+ D M  KDVV+ STM++G 
Subjt:  LNEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGF

Query:  VHCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLA-GDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISA
         H G+ D+AIS+F  M     PN +++++L+ AC+VSA+LR SKWAHGIA+RR LA  D++VGT+I+D Y+KCG I+ + R F+QI EKN++ W+ +ISA
Subjt:  VHCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLA-GDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISA

Query:  YGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSM-EKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIW
        Y INGL  ++L   ++MKQ G  PNAVT L+ LSAC+HGGL+++G+  F SM E++H  +P L+HYSC +DML+RAG+ + A+ELI+ +P  ++AGAS W
Subjt:  YGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSM-EKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIW

Query:  GTLLSSCRS-YGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQ
        G +LS CR+ +  +++ S   + +L LEP  S GY+LAS+ +A      + A MRRL KER V+VVAGYS+V   +   RF+AGD+L     E+  +V  
Subjt:  GTLLSSCRS-YGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQ

Query:  L
        L
Subjt:  L

Q9STE1 Pentatricopeptide repeat-containing protein At4g213004.6e-11333.09Show/hide
Query:  DPSVFPSILKACSNL-SFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSAEAGLWWFMKGRFSHFQ
        D S FP ++KAC  L +FK    +   +   G + +  +A+S I  Y+++G++    + FD +  KD V WNVM++G     + ++ +  F   R     
Subjt:  DPSVFPSILKACSNL-SFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSAEAGLWWFMKGRFSHFQ

Query:  PNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEV-NMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGEDERGLEMFRNMVTEA
        PN  +   VL           G  LHG  +  G     S++NSLLS+Y++      A K+F  MS R D V+W+ MI G+VQ G  E  L  F  M++ +
Subjt:  PNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEV-NMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGEDERGLEMFRNMVTEA

Query:  GITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLNEKHLEAVALLGTMVE
        G+ PD +T  S+L + +   ++   + +H + +      D+F+ ++LID Y KC     A  IF +    ++V + +M+S Y+ N  +++++ +   +V+
Subjt:  GITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLNEKHLEAVALLGTMVE

Query:  EGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVHCGQPDQAISVFKQMN
          +  +E+T V++L ++   L     + +HG II++G+++  ++  +VID YAKC  + LA  +F+ + K+D+V+W++M+        P  AI +F+QM 
Subjt:  EGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVHCGQPDQAISVFKQMN

Query:  -KEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGINGLAHESLVSLEKM
           +  + VSI   + ACA        K  HG  ++  LA DV   + +IDMY+KCG++KA++  F  + EKN+V W+++I+A G +G   +SL    +M
Subjt:  -KEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGINGLAHESLVSLEKM

Query:  -KQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLLSSCRSYGNVVLGS
         +++G++P+ +T L ++S+C H G ++EG+ FF SM +++G++P  EHY+C +D+  RAG+  +A E ++ MP   +AG  +WGTLL +CR + NV L  
Subjt:  -KQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLLSSCRSYGNVVLGS

Query:  GAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
         A+S ++ L+PS+S  Y+L SN +AN        ++R L KER V+ + GYS + IN  T  FV+GD  HP +  IY +++ L
Subjt:  GAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276107.3e-11132.67Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKL-GTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS
        E  +L+ +I   G  + D S+F S+LK  + L  +L G  +H   IK G+    S+  S +D YMK       ++ FD +K ++ V+W  ++ G   N  
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKL-GTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS

Query:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEV-NMCSAHKMFDEMSYRNDVVSWSVMIGGFVQI
         +  L  FM+ +    QPN  +    L    +      G  +H   ++ G    I V NSL++LY +  N+  A  +FD+   ++ VV+W+ MI G+   
Subjt:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEV-NMCSAHKMFDEMSYRNDVVSWSVMIGGFVQI

Query:  GEDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMP-ERNIVSWNSMLSTY
        G D   L MF +M     +     +  SV+K C NL+++     +H   +  G+  D  +  +L+  YSKC   + A ++FKE+    N+VSW +M+S +
Subjt:  GEDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMP-ERNIVSWNSMLSTY

Query:  VLNEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAG
        + N+   EAV L   M  +GV  +E T+  +L      L  +    VH  +++  +E + ++  +++DAY K   VE A  +F G+  KD+VAWS MLAG
Subjt:  VLNEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAG

Query:  FVHCGQPDQAISVFKQMNK-EVKPNEVSIMNLMEAC-AVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMI
        +   G+ + AI +F ++ K  +KPNE +  +++  C A +A + + K  HG A++  L   + V +A++ MY+K G+I+++   F +  EK++V W++MI
Subjt:  FVHCGQPDQAISVFKQMNK-EVKPNEVSIMNLMEAC-AVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMI

Query:  SAYGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASI
        S Y  +G A ++L   ++MK+  +K + VT + + +AC+H GL+EEG  +F  M ++  + P  EH SC +D+ +RAG+   A+++IE MP    AG++I
Subjt:  SAYGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASI

Query:  WGTLLSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQ
        W T+L++CR +    LG  AA  I+ ++P  S  Y+L SN+YA  G   E A++R+L  ER VK   GYS + + ++T+ F+AGD  HP   +IY+ ++ 
Subjt:  WGTLLSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQ

Query:  L
        L
Subjt:  L

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-11131.75Show/hide
Query:  ILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSAEAGLWWFMKGRFSHFQPNVSSLVL
        +L+ CS+L  K    +   + K G           + L+ ++G +  A R F+ + +K +V ++ M+ G       +  L +F++ R+   +P V +   
Subjt:  ILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSAEAGLWWFMKGRFSHFQPNVSSLVL

Query:  VLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCS-AHKMFDEMSYRNDVVSWSVMIGGFVQIGEDERGLEMFRNMVTEAGITPDGVT
        +L    D    R G  +HG  ++ GF   +     L ++YA+    + A K+FD M  R D+VSW+ ++ G+ Q G     LEM ++M  E  + P  +T
Subjt:  VLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCS-AHKMFDEMSYRNDVVSWSVMIGGFVQIGEDERGLEMFRNMVTEAGITPDGVT

Query:  VVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLNEKHLEAVALLGTMVEEGVEKDEV
        +VSVL A + LR IS+G+ +HG+A+  G++  + +  +L+DMY+KC    +A ++F  M ERN+VSWNSM+  YV NE   EA+ +   M++EGV+  +V
Subjt:  VVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLNEKHLEAVALLGTMVEEGVEKDEV

Query:  TFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVHCGQPDQAISVFKQM-NKEVKPNE
        + +  L       D  + + +H + +  G + N S+VNS+I  Y KC  V+ A ++F  ++ + +V+W+ M+ GF   G+P  A++ F QM ++ VKP+ 
Subjt:  TFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVHCGQPDQAISVFKQM-NKEVKPNE

Query:  VSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGINGLAHESLVSLEKMKQNGMKPN
         + ++++ A A  +    +KW HG+ +R  L  +V V TA++DMY+KCG I  +   F+ + E++V  W+AMI  YG +G    +L   E+M++  +KPN
Subjt:  VSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGINGLAHESLVSLEKMKQNGMKPN

Query:  AVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLLSSCRSYGNVVLGSGAASHILRL
         VT LS++SACSH GL+E G+  F  M++ + +E  ++HY   +D+L RAG+ N+A + I +MP  ++   +++G +L +C+ + NV     AA  +  L
Subjt:  AVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLLSSCRSYGNVVLGSGAASHILRL

Query:  EPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
         P     ++L +N+Y    +  +  ++R     +G++   G S+V I +E   F +G   HP + +IY  +++L
Subjt:  EPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

AT2G17210.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-18749.71Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA
        E++  Y +I+ +G    DP VFP + KAC+ LS+          + QG        NS  D YMK G+L S  R FD + ++DSVSWNV+V G       
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSA

Query:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE
        E GLWWF K R   F+PN S+LVLV+ A R L  + +G  +HGY IR GFC I SVQNS+L +YA+ +  SA K+FDEMS R DV+SWSV+I  +VQ  E
Subjt:  EAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGE

Query:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWE-DDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVL
           GL++F+ MV EA   PD VTV SVLKACT + DI +GR VHGF+I RG++  D+FV NSLIDMYSK  D  SAF++F E   RNIVSWNS+L+ +V 
Subjt:  DERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWE-DDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVL

Query:  NEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFV
        N+++ EA+ +   MV+E VE DEVT V++L++ + F  PL CKS+HG+IIRRG+ESN   ++S+IDAY  C+LV+ AGT+ D M  KDVV+ STM++G  
Subjt:  NEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFV

Query:  HCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLA-GDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAY
        H G+ D+AIS+F  M     PN +++++L+ AC+VSA+LR SKWAHGIA+RR LA  D++VGT+I+D Y+KCG I+ + R F+QI EKN++ W+ +ISAY
Subjt:  HCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLA-GDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAY

Query:  GINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSM-EKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWG
         INGL  ++L   ++MKQ G  PNAVT L+ LSAC+HGGL+++G+  F SM E++H  +P L+HYSC +DML+RAG+ + A+ELI+ +P  ++AGAS WG
Subjt:  GINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSM-EKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWG

Query:  TLLSSCRS-YGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
         +LS CR+ +  +++ S   + +L LEP  S GY+LAS+ +A      + A MRRL KER V+VVAGYS+V   +   RF+AGD+L     E+  +V  L
Subjt:  TLLSSCRS-YGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.2e-11232.67Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKL-GTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS
        E  +L+ +I   G  + D S+F S+LK  + L  +L G  +H   IK G+    S+  S +D YMK       ++ FD +K ++ V+W  ++ G   N  
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKL-GTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRS

Query:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEV-NMCSAHKMFDEMSYRNDVVSWSVMIGGFVQI
         +  L  FM+ +    QPN  +    L    +      G  +H   ++ G    I V NSL++LY +  N+  A  +FD+   ++ VV+W+ MI G+   
Subjt:  AEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEV-NMCSAHKMFDEMSYRNDVVSWSVMIGGFVQI

Query:  GEDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMP-ERNIVSWNSMLSTY
        G D   L MF +M     +     +  SV+K C NL+++     +H   +  G+  D  +  +L+  YSKC   + A ++FKE+    N+VSW +M+S +
Subjt:  GEDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMP-ERNIVSWNSMLSTY

Query:  VLNEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAG
        + N+   EAV L   M  +GV  +E T+  +L      L  +    VH  +++  +E + ++  +++DAY K   VE A  +F G+  KD+VAWS MLAG
Subjt:  VLNEKHLEAVALLGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAG

Query:  FVHCGQPDQAISVFKQMNK-EVKPNEVSIMNLMEAC-AVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMI
        +   G+ + AI +F ++ K  +KPNE +  +++  C A +A + + K  HG A++  L   + V +A++ MY+K G+I+++   F +  EK++V W++MI
Subjt:  FVHCGQPDQAISVFKQMNK-EVKPNEVSIMNLMEAC-AVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMI

Query:  SAYGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASI
        S Y  +G A ++L   ++MK+  +K + VT + + +AC+H GL+EEG  +F  M ++  + P  EH SC +D+ +RAG+   A+++IE MP    AG++I
Subjt:  SAYGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASI

Query:  WGTLLSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQ
        W T+L++CR +    LG  AA  I+ ++P  S  Y+L SN+YA  G   E A++R+L  ER VK   GYS + + ++T+ F+AGD  HP   +IY+ ++ 
Subjt:  WGTLLSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQ

Query:  L
        L
Subjt:  L

AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-11531.78Show/hide
Query:  EILQLYHDIRISGAHLTDPSVFPSILKACSNL-SFKLGTAMHGCLIKQGY-ESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNR
        E +  Y D+ + G    D   FP++LKA ++L   +LG  +H  + K GY   S ++AN+ ++LY K G+  +  + FD +  ++ VSWN ++    S  
Subjt:  EILQLYHDIRISGAHLTDPSVFPSILKACSNL-SFKLGTAMHGCLIKQGY-ESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNR

Query:  SAEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDL---KSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGF
          E  L  F      + +P+  +LV V+ A  +L   +    G  +H Y +R G      + N+L+++Y ++   ++ K+        D+V+W+ ++   
Subjt:  SAEAGLWWFMKGRFSHFQPNVSSLVLVLLAFRDL---KSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGF

Query:  VQIGEDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRG-WEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSML
         Q  +    LE  R MV E G+ PD  T+ SVL AC++L  +  G+ +H +A+  G  +++ FVG++L+DMY  C   +S  ++F  M +R I  WN+M+
Subjt:  VQIGEDERGLEMFRNMVTEAGITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRG-WEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSML

Query:  STYVLNEKHLEAVAL-LGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWST
        + Y  NE   EA+ L +G     G+  +  T   V+          + +++HG +++RG + +  + N+++D Y++   +++A  +F  M+ +D+V W+T
Subjt:  STYVLNEKHLEAVAL-LGTMVEEGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWST

Query:  MLAGFVHCGQPDQAISVFKQMNK------------EVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFN
        M+ G+V     + A+ +  +M               +KPN +++M ++ +CA  + L + K  H  A++  LA DVAVG+A++DMY+KCG ++ S + F+
Subjt:  MLAGFVHCGQPDQAISVFKQMNK------------EVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFN

Query:  QIPEKNVVCWSAMISAYGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALEL
        QIP+KNV+ W+ +I AYG++G   E++  L  M   G+KPN VT +S+ +ACSH G+++EG+  F  M+ ++GVEP  +HY+C +D+L RAG+  +A +L
Subjt:  QIPEKNVVCWSAMISAYGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALEL

Query:  IEKMPGQMEAGASIWGTLLSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDE
        +  MP      A  W +LL + R + N+ +G  AA ++++LEP+ +  Y+L +N+Y++ GL  ++  +RR  KE+GV+   G S +    E  +FVAGD 
Subjt:  IEKMPGQMEAGASIWGTLLSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDE

Query:  LHPRAYEI
         HP++ ++
Subjt:  LHPRAYEI

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.3e-11433.09Show/hide
Query:  DPSVFPSILKACSNL-SFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSAEAGLWWFMKGRFSHFQ
        D S FP ++KAC  L +FK    +   +   G + +  +A+S I  Y+++G++    + FD +  KD V WNVM++G     + ++ +  F   R     
Subjt:  DPSVFPSILKACSNL-SFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSAEAGLWWFMKGRFSHFQ

Query:  PNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEV-NMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGEDERGLEMFRNMVTEA
        PN  +   VL           G  LHG  +  G     S++NSLLS+Y++      A K+F  MS R D V+W+ MI G+VQ G  E  L  F  M++ +
Subjt:  PNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEV-NMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGEDERGLEMFRNMVTEA

Query:  GITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLNEKHLEAVALLGTMVE
        G+ PD +T  S+L + +   ++   + +H + +      D+F+ ++LID Y KC     A  IF +    ++V + +M+S Y+ N  +++++ +   +V+
Subjt:  GITPDGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLNEKHLEAVALLGTMVE

Query:  EGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVHCGQPDQAISVFKQMN
          +  +E+T V++L ++   L     + +HG II++G+++  ++  +VID YAKC  + LA  +F+ + K+D+V+W++M+        P  AI +F+QM 
Subjt:  EGVEKDEVTFVNVLQIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVHCGQPDQAISVFKQMN

Query:  -KEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGINGLAHESLVSLEKM
           +  + VSI   + ACA        K  HG  ++  LA DV   + +IDMY+KCG++KA++  F  + EKN+V W+++I+A G +G   +SL    +M
Subjt:  -KEVKPNEVSIMNLMEACAVSAELRRSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGINGLAHESLVSLEKM

Query:  -KQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLLSSCRSYGNVVLGS
         +++G++P+ +T L ++S+C H G ++EG+ FF SM +++G++P  EHY+C +D+  RAG+  +A E ++ MP   +AG  +WGTLL +CR + NV L  
Subjt:  -KQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSMEKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLLSSCRSYGNVVLGS

Query:  GAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL
         A+S ++ L+PS+S  Y+L SN +AN        ++R L KER V+ + GYS + IN  T  FV+GD  HP +  IY +++ L
Subjt:  GAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVKVVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTAGAAATTCTCCAACTTTACCACGACATTAGAATCTCTGGAGCTCATTTGACAGACCCTTCAGTGTTCCCTTCGATTCTCAAGGCCTGTTCAAATCTCTCTTTCAAACT
TGGAACCGCTATGCACGGATGCCTGATCAAACAGGGGTACGAATCTTCCACTTCCATCGCTAATTCCACCATTGACTTGTATATGAAGTGGGGTGAATTAGTTTCTGCTC
AGCGCGCATTTGATTCTCTGAAGAACAAAGATTCAGTATCTTGGAACGTGATGGTTCATGGGAATTTCTCGAACAGGAGCGCAGAGGCAGGTCTGTGGTGGTTTATGAAG
GGTAGATTTTCCCATTTTCAGCCCAATGTTTCTTCGTTGGTACTTGTACTTCTCGCATTTCGTGATCTTAAATCATACAGGGAAGGTTTTGCGCTTCATGGTTATGCAAT
TCGCTGTGGATTTTGTGCCATTATTTCGGTTCAAAACTCTCTGTTGAGCTTGTATGCTGAAGTTAATATGTGTTCTGCCCACAAGATGTTTGATGAAATGTCCTATAGAA
ACGATGTCGTTTCCTGGAGTGTGATGATAGGAGGTTTCGTACAAATTGGGGAAGATGAACGTGGGTTGGAGATGTTTCGGAATATGGTAACAGAGGCTGGCATTACACCA
GATGGGGTAACTGTTGTAAGTGTTCTTAAAGCTTGCACCAACTTGAGAGATATTTCACTTGGAAGAATGGTTCATGGGTTTGCGATTTGTAGAGGCTGGGAGGATGATTT
GTTTGTTGGGAACTCCTTGATAGACATGTATTCCAAGTGTTGTGATGCTGTTTCTGCCTTTAAAATTTTCAAGGAGATGCCAGAAAGGAACATCGTCTCATGGAATTCAA
TGTTGTCGACGTATGTCCTCAATGAGAAGCATTTGGAAGCTGTGGCATTGCTTGGGACAATGGTGGAAGAAGGGGTTGAGAAAGATGAGGTGACTTTTGTAAATGTTCTT
CAGATAGTTGAGCATTTTCTTGACCCATTGCAATGCAAGTCTGTCCACGGCATGATTATCCGACGGGGGTGGGAATCAAACGGATCACTGGTGAATTCAGTAATCGATGC
TTATGCAAAATGCAATCTGGTTGAGCTTGCAGGTACACTTTTTGATGGGATGAAGAAGAAGGATGTTGTTGCTTGGAGCACTATGCTTGCAGGGTTTGTCCACTGTGGCC
AGCCTGACCAAGCGATATCGGTCTTCAAGCAAATGAACAAAGAGGTGAAACCGAATGAGGTTTCAATTATGAATCTTATGGAGGCTTGCGCTGTCTCTGCTGAATTGAGA
CGATCGAAATGGGCTCACGGTATAGCGGTTAGGAGAGGTTTGGCTGGTGATGTAGCTGTTGGAACTGCCATTATTGACATGTACTCAAAATGTGGAGATATAAAAGCCTC
CATCCGAGCCTTCAACCAAATCCCAGAAAAAAACGTCGTGTGTTGGAGTGCTATGATATCTGCATACGGCATCAACGGTCTCGCACATGAATCATTAGTCTCGCTTGAGA
AAATGAAACAAAATGGCATGAAGCCAAATGCTGTAACAACGCTGTCATTGTTATCTGCTTGTAGCCATGGAGGATTACTAGAAGAAGGCATCTCTTTCTTCACATCCATG
GAGAAGGAACATGGAGTAGAGCCTGGTTTGGAACACTACTCGTGCGCCATCGACATGCTAGCCCGAGCGGGGAAATTCAACGATGCTTTAGAATTGATTGAGAAGATGCC
TGGACAAATGGAAGCAGGTGCAAGCATTTGGGGGACACTCTTGAGCTCTTGTAGGAGCTATGGAAACGTTGTGCTTGGCTCAGGAGCAGCCTCTCACATTCTTCGACTGG
AACCTTCGAGCTCGGTTGGCTACATGCTGGCATCGAACTTGTATGCGAACTGCGGTCTAATGACCGAATCTGCAAGAATGAGAAGGCTGGCGAAAGAGAGAGGAGTGAAG
GTTGTTGCTGGATATAGTTTGGTGCATATAAATTCAGAGACTTGGAGATTTGTTGCTGGAGATGAGCTCCACCCAAGAGCTTATGAGATCTATTTAATGGTGGATCAATT
G
mRNA sequenceShow/hide mRNA sequence
CTAGAAATTCTCCAACTTTACCACGACATTAGAATCTCTGGAGCTCATTTGACAGACCCTTCAGTGTTCCCTTCGATTCTCAAGGCCTGTTCAAATCTCTCTTTCAAACT
TGGAACCGCTATGCACGGATGCCTGATCAAACAGGGGTACGAATCTTCCACTTCCATCGCTAATTCCACCATTGACTTGTATATGAAGTGGGGTGAATTAGTTTCTGCTC
AGCGCGCATTTGATTCTCTGAAGAACAAAGATTCAGTATCTTGGAACGTGATGGTTCATGGGAATTTCTCGAACAGGAGCGCAGAGGCAGGTCTGTGGTGGTTTATGAAG
GGTAGATTTTCCCATTTTCAGCCCAATGTTTCTTCGTTGGTACTTGTACTTCTCGCATTTCGTGATCTTAAATCATACAGGGAAGGTTTTGCGCTTCATGGTTATGCAAT
TCGCTGTGGATTTTGTGCCATTATTTCGGTTCAAAACTCTCTGTTGAGCTTGTATGCTGAAGTTAATATGTGTTCTGCCCACAAGATGTTTGATGAAATGTCCTATAGAA
ACGATGTCGTTTCCTGGAGTGTGATGATAGGAGGTTTCGTACAAATTGGGGAAGATGAACGTGGGTTGGAGATGTTTCGGAATATGGTAACAGAGGCTGGCATTACACCA
GATGGGGTAACTGTTGTAAGTGTTCTTAAAGCTTGCACCAACTTGAGAGATATTTCACTTGGAAGAATGGTTCATGGGTTTGCGATTTGTAGAGGCTGGGAGGATGATTT
GTTTGTTGGGAACTCCTTGATAGACATGTATTCCAAGTGTTGTGATGCTGTTTCTGCCTTTAAAATTTTCAAGGAGATGCCAGAAAGGAACATCGTCTCATGGAATTCAA
TGTTGTCGACGTATGTCCTCAATGAGAAGCATTTGGAAGCTGTGGCATTGCTTGGGACAATGGTGGAAGAAGGGGTTGAGAAAGATGAGGTGACTTTTGTAAATGTTCTT
CAGATAGTTGAGCATTTTCTTGACCCATTGCAATGCAAGTCTGTCCACGGCATGATTATCCGACGGGGGTGGGAATCAAACGGATCACTGGTGAATTCAGTAATCGATGC
TTATGCAAAATGCAATCTGGTTGAGCTTGCAGGTACACTTTTTGATGGGATGAAGAAGAAGGATGTTGTTGCTTGGAGCACTATGCTTGCAGGGTTTGTCCACTGTGGCC
AGCCTGACCAAGCGATATCGGTCTTCAAGCAAATGAACAAAGAGGTGAAACCGAATGAGGTTTCAATTATGAATCTTATGGAGGCTTGCGCTGTCTCTGCTGAATTGAGA
CGATCGAAATGGGCTCACGGTATAGCGGTTAGGAGAGGTTTGGCTGGTGATGTAGCTGTTGGAACTGCCATTATTGACATGTACTCAAAATGTGGAGATATAAAAGCCTC
CATCCGAGCCTTCAACCAAATCCCAGAAAAAAACGTCGTGTGTTGGAGTGCTATGATATCTGCATACGGCATCAACGGTCTCGCACATGAATCATTAGTCTCGCTTGAGA
AAATGAAACAAAATGGCATGAAGCCAAATGCTGTAACAACGCTGTCATTGTTATCTGCTTGTAGCCATGGAGGATTACTAGAAGAAGGCATCTCTTTCTTCACATCCATG
GAGAAGGAACATGGAGTAGAGCCTGGTTTGGAACACTACTCGTGCGCCATCGACATGCTAGCCCGAGCGGGGAAATTCAACGATGCTTTAGAATTGATTGAGAAGATGCC
TGGACAAATGGAAGCAGGTGCAAGCATTTGGGGGACACTCTTGAGCTCTTGTAGGAGCTATGGAAACGTTGTGCTTGGCTCAGGAGCAGCCTCTCACATTCTTCGACTGG
AACCTTCGAGCTCGGTTGGCTACATGCTGGCATCGAACTTGTATGCGAACTGCGGTCTAATGACCGAATCTGCAAGAATGAGAAGGCTGGCGAAAGAGAGAGGAGTGAAG
GTTGTTGCTGGATATAGTTTGGTGCATATAAATTCAGAGACTTGGAGATTTGTTGCTGGAGATGAGCTCCACCCAAGAGCTTATGAGATCTATTTAATGGTGGATCAATT
G
Protein sequenceShow/hide protein sequence
LEILQLYHDIRISGAHLTDPSVFPSILKACSNLSFKLGTAMHGCLIKQGYESSTSIANSTIDLYMKWGELVSAQRAFDSLKNKDSVSWNVMVHGNFSNRSAEAGLWWFMK
GRFSHFQPNVSSLVLVLLAFRDLKSYREGFALHGYAIRCGFCAIISVQNSLLSLYAEVNMCSAHKMFDEMSYRNDVVSWSVMIGGFVQIGEDERGLEMFRNMVTEAGITP
DGVTVVSVLKACTNLRDISLGRMVHGFAICRGWEDDLFVGNSLIDMYSKCCDAVSAFKIFKEMPERNIVSWNSMLSTYVLNEKHLEAVALLGTMVEEGVEKDEVTFVNVL
QIVEHFLDPLQCKSVHGMIIRRGWESNGSLVNSVIDAYAKCNLVELAGTLFDGMKKKDVVAWSTMLAGFVHCGQPDQAISVFKQMNKEVKPNEVSIMNLMEACAVSAELR
RSKWAHGIAVRRGLAGDVAVGTAIIDMYSKCGDIKASIRAFNQIPEKNVVCWSAMISAYGINGLAHESLVSLEKMKQNGMKPNAVTTLSLLSACSHGGLLEEGISFFTSM
EKEHGVEPGLEHYSCAIDMLARAGKFNDALELIEKMPGQMEAGASIWGTLLSSCRSYGNVVLGSGAASHILRLEPSSSVGYMLASNLYANCGLMTESARMRRLAKERGVK
VVAGYSLVHINSETWRFVAGDELHPRAYEIYLMVDQL