; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G015470 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G015470
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr11:23817757..23826115
RNA-Seq ExpressionLsi11G015470
SyntenyLsi11G015470
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031579.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0077.2Show/hide
Query:  MITKLRNW-NKLIPNLL-QTPQ-QSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNH
        M+ KLR+W N LIPNLL QT + QSN   SLFCTKTLS  FSST PP ST L ++I+ IRDPKISVIPVLEKWVGDG+AI K ELQ LVYL K+FRRFNH
Subjt:  MITKLRNW-NKLIPNLL-QTPQ-QSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNH

Query:  ALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIG
        ALEISQWMTDRRY +LS+SDAA+RLDLI  VHGLEHAE+YFNSIS++LKT N YG+LL  YVREKS+EKAEAIMQEMRKMG+A TSF YNVLINLYAQIG
Subjt:  ALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIG

Query:  QHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLL
        QH+KIDLLI+EMKMKGIPQDIY+IRNLCAAYVAKTDISGMEKIL+RIEEDSE KADWRIYSIAA+GYL+AGLET+ALSML KME+KI PN NK AFEFLL
Subjt:  QHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLL

Query:  SLYERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWS
        SLYERTG K+E+YRVW+TFKPL RQT VPYALMITSLAKLDD+EGA+RIFQEWESKCT YDFRVLNRL+VAYCRKGL DKAE  VN+AVVGRTP+ASTWS
Subjt:  SLYERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWS

Query:  VLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPK-RDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSA
        +LA GYAE+G MSKAVEMLKKAMLVGRQ+WKPK RDILEACLDYLE+QGDAETMEE++RLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGF+A
Subjt:  VLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPK-RDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSA

Query:  DEEVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYNMDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMALLLQFQGDPRI
        DEE +K     T   +V   L  +   +SS  +        S    +  N   +Q+    L YS    ++ L S+ S  D       +   +   GDPRI
Subjt:  DEEVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYNMDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMALLLQFQGDPRI

Query:  SIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVEN
        SIVRVLDQW+EEGR+V QSD+Q L+KQLRKF RFNHALQLCEWI NERN++P PGDIA+QLHLISK  GLEQAEKYFSSIRESSRDHKVYGALLNCYVEN
Subjt:  SIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVEN

Query:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA
        KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLY+ LGK EK DEL++EMEEMGI HDRFTYNIRMNAYAATS+I NMEKLL KMEAD LV MDWH+YF V 
Subjt:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA

Query:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFK
        NGY KAG SEN IL MLK++EQLIGDKQKW AYE LITLY AIGNKDEVYRVWNLY+NL++R+NSGYLC+I+SLMKLDDIDG E+ILKEWESGDTCFDF+
Subjt:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFK

Query:  IPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCK
        IPNMMINSYC KGF+DKAEAYISRL+ENGKEP+A  WDRL SGYH+NGLTNKA ETLKKAISVS P WKPN H +AAC+EYLKTNGNVE+AEEI+ LLCK
Subjt:  IPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCK

Query:  RDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNEKLDNVSD
         DI P NI NRLEDYI SENQTSIKCLD L L+GQ+E LD+  D
Subjt:  RDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNEKLDNVSD

KAF8413891.1 hypothetical protein HHK36_001885 [Tetracentron sinense]6.8e-29951.08Show/hide
Query:  LASLFCTKTLSPVFSSTSP--PTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRL
        L S F  +     FS+ +   P+  +L  +I T RDP++S++PVL+KW+ +G+ + ++ELQSLV  +K+FR+FNHALEISQWM+DRRYF+LS  DAAIRL
Subjt:  LASLFCTKTLSPVFSSTSP--PTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRL

Query:  DLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIR
        +LI +VHGLE AE YF +ISS+LK +  YGALL SYV+EKS++KAEA++Q M++MG  TTSF YNVL+NLY+Q G++ KID+L++EM+ KG+P D+YT++
Subjt:  DLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIR

Query:  NLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLIRQ
        N  +AYV   DI+GMEKIL R+EED     DW++YSI A+GYL  GL  KAL+MLKK+E        K AFE LL+LY R GRKDELYR+W+ +K L + 
Subjt:  NLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLIRQ

Query:  THVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAMLV
            Y  MITSL KLDDIEGA++I++EWES CT +DFRVLNRL+VAYC+KG  DKAE  VN+AV GRTPYASTW++LA+GY E+ +MSKAVEMLKKAMLV
Subjt:  THVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAMLV

Query:  GRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKILGTKTNLG-----------
        GR+ W+P    L+AC++YLE Q D E +EE+ RL ++SG + +E+Y RLLRT +A GKPV+ IL+QMK+DGFSADEE  KIL  + +L            
Subjt:  GRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKILGTKTNLG-----------

Query:  ---------------NVYELL------WLK-----------CKANSSCN-------DERNAMKLISLILAYRYNMDCIQSQTAVLSYSPAG---------
                       +V E +      WL+           C+AN++ N       D  N  ++ S + +     +      + + +  AG         
Subjt:  ---------------NVYELL------WLK-----------CKANSSCN-------DERNAMKLISLILAYRYNMDCIQSQTAVLSYSPAG---------

Query:  ---TSRRLLSAESDHDEAPLLSAMALLLQFQ--------------GDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQ
           +    LS ES    A  L      + F               GDPR+SIV VLD+W+EEGR V + DLQ ++KQ+R + RF HAL++  W+S+ R  
Subjt:  ---TSRRLLSAESDHDEAPLLSAMALLLQFQ--------------GDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQ

Query:  DPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEE
        +  PGDIA++L LISKVHGLEQAEKYFS+I E  +  +VYGALLNCY   K++EKAEAIMQKMRE+GF+K  L YNV+L+LYS +GK EKLD L++EMEE
Subjt:  DPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEE

Query:  MGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVY
         GI +D+FT+NIR++AYAA S++  +EK++ +ME DP V +DW  Y +VAN Y KAGL + + L MLK+SE+L+  K++  AY   +TLY   G KDE+Y
Subjt:  MGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVY

Query:  RVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLT
        RVWNL    ++ YN+ Y+C+ISSL+KLDDI+G EKIL++WES  T +DF++PN++I SYCKKG ++KAE  I R +  GK+P A+TW+ LA+GY      
Subjt:  RVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLT

Query:  NKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCKRDIVPLNISNRLEDYIR
         KAVE +KKAI  SQPGWKPN  TLAAC+E LK  G+VE AEE+V LL   D  P+++ +RL +YI+
Subjt:  NKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCKRDIVPLNISNRLEDYIR

RXH90462.1 hypothetical protein DVH24_035226 [Malus domestica]0.0e+0055.84Show/hide
Query:  KLRNWNKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQ
        KL   +KLI   LQ P+++    +LF +   S   SS+SP  S +L D+I  IRDPK SV+PVLE+WV +G+A+ KQ+LQSLV L+K FRRFNHALEISQ
Subjt:  KLRNWNKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQ

Query:  WMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKID
        WMTDRRYF+LS SDAA RL+LI RVHGLEHAE+YFN++S  LK+ NAYGALLC YV+E+S+EKAEA MQ+M+KMGMA TSFPYN+LINLY+Q GQ++KI+
Subjt:  WMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKID

Query:  LLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERT
        +L+QEM+  GIP D YT+RN   AY+A +D+ GME IL R+EED     DW+IYS+AA+GYL  GL  KA+SMLK +E  +P    KS  EFLL+LY  T
Subjt:  LLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERT

Query:  GRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGY
        G K+ELYRVW T+KP      VPY  MI+SLAKLDDIEGA+ IF+EWES+C  YDFRVLNRL+VAYC++GLFDKAES VN+AV GR PYASTW+VLA+GY
Subjt:  GRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGY

Query:  AEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKI
         E  +M KAVEMLKKA+ VGR+ W P    L ACLDYLE QGD E +EE+I L K+ G +++++Y+RLLR S+A GK V  IL+QMK+DGF+ADEE  K+
Subjt:  AEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKI

Query:  LGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYNMDCIQSQTAVLSYSP------AGTSRRLL-SAESDHDEAPLLSAMALLLQFQGDPRI
        + T                             LISL L     +    S   +L  +P      +G+SR L  S+++     P    + + +   G+PR+
Subjt:  LGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYNMDCIQSQTAVLSYSP------AGTSRRLL-SAESDHDEAPLLSAMALLLQFQGDPRI

Query:  SIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVEN
        S+V +L+QWVEEGR+VK+ +LQ  +K  RK+RR++HALQ+ EW+S+ RNQ   PGDIA++L LISKV GL+QAE YF+SI +  R+ KVYGALL  YVEN
Subjt:  SIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVEN

Query:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA
        K+ EKAE I +KM E+G++K  ++YN ML+LYS +GKHEKLD L++EMEE GI +D +T  I +N+YAA S I  MEKLL+K++ADPLV +DW+ Y I A
Subjt:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA

Query:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFK
        NG+ KAGL E +   ML+RSEQLI +K   FAYE L+TLY  IGNKDEVYR+WN+Y N+   YNSGYLC++SSL+KL DID  E I++EWES    FD +
Subjt:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFK

Query:  IPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCK
        IPN++I +YCKK  ++KA+ YI RL E+ KE  A+ W RLA+GYH NG  +KAVET+KKAI  S+ GWK N+ TLAAC+EYLK  G+VEVA+E+  L+ +
Subjt:  IPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCK

Query:  RDIVPLNISNRLEDYIRSE
         D    ++ ++L+ Y+  E
Subjt:  RDIVPLNISNRLEDYIRSE

XP_002273904.2 PREDICTED: pentatricopeptide repeat-containing protein At2g17140 [Vitis vinifera]4.3e-29353.32Show/hide
Query:  TTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLK
        ++L D+I  +RDPK S+ P+L +W+ +G+ + K +LQSLV +MK FRRF+HALEISQWMTDRRYF L+ SDAAIRLDLI  VHG   AE YFN+I + LK
Subjt:  TTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLK

Query:  TYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEE
        T +AYGALL  YVREKS+EKAEA MQ+MR+M  AT+SFPYN+LINLY+Q G H KI+ LIQEM+ K IP D +T+RNL  AYVA +DIS MEK L R+EE
Subjt:  TYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEE

Query:  DSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRI
        D     DW IYS+AASGYL  GL  KAL MLKK+E   P     SAF+FLLSLY RTG K ELYRVW+ +KP        Y+ MIT L KLDDIEGA++I
Subjt:  DSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRI

Query:  FQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGD
        FQEWE +CT YDFRVLNRL+ AYC++ LFDKAES VN+ +  R PYASTW++LA GY E  +M KAVEMLKKA+ VGR+ W+P   IL+AC++YLE QG+
Subjt:  FQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGD

Query:  AETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYN
         E +EE+ RLCK+ G    ++++RLLRTS AG K V +I                                               ++  +++ +  R+ 
Subjt:  AETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYN

Query:  MDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMALLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQ
        +D   S + V    P  T     +A  +  ++ +  A+        D R+SIV  L+QW +EGR +KQ DL +L+++LR F+R+NHAL++ EWI ++   
Subjt:  MDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMALLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQ

Query:  DPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEE
        D  PGD+AIQL LISKVHGLEQAEKYF+    S R  +VYGALLNCY + K+LEKAEAIMQ+MR++GF+KT LSYNVML LYS LGKHEKLD LM+EMEE
Subjt:  DPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEE

Query:  MGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVY
         GI  D FTY IR+NAY ATS++  MEKLL+K+E DP V  DW+AY + ANGY KA L E ++  MLK+SEQ I  + + F YE L+TLY  +GNK EVY
Subjt:  MGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVY

Query:  RVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLT
        R+WNLY  + + +N+GY+ ++SSL+KLDD+DG EK  +EW SG+  FDF++PN++I +YCKKG ++KAE  +SR +E G+EP A TWD LA+GYH N   
Subjt:  RVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLT

Query:  NKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCKRDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNEKLDNVSD
         KAV+TLKKA+  +  GWKPN  TL+AC+EYLK  G+VE AE ++ LL ++ +V    S+RL +YIRSE   S      L     +E LD  +D
Subjt:  NKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCKRDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNEKLDNVSD

XP_031744657.1 pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucumis sativus]0.0e+0075.38Show/hide
Query:  MITKLRNW-NKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHAL
        MI KLR+W N LI NLL           +  +KTLS  FSST PP    L  KI+ IR PKISV+PVLEKWVGDG+AIGK ELQ LV+LMK  RRFNHAL
Subjt:  MITKLRNW-NKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHAL

Query:  EISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQH
        EISQWMTDRRY +LS SDAA+RLDLI  VHGLEHAE+YFNSIS +LKT N YGALL  YVREKS+EKAEAIMQEMRKMG+ATTSF YNVLINLYAQIGQH
Subjt:  EISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQH

Query:  DKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSL
        DKIDLLI+EMK KGIPQDIY+IRNLCAAYVAK DISGMEKIL+RIEEDSE KADW IYSIAA+GYL+AGLET+ALSMLKK EEK+ PN NK AF+FLLSL
Subjt:  DKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSL

Query:  YERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVL
        YERTG K+E+YRVW+TFKPL ++T VPYALMITSLAKLDDIEGA+RIFQEWESKCT YDFRVLNRL+VAYCRKGL DKAES VN+AVV RTP+ STWS+L
Subjt:  YERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVL

Query:  AMGYAEHGRMSKAVEMLKKAMLVGRQDWKPKR-DILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADE
        A GYAE+G MSKAVEMLKKA+LVGRQ+WKPK+ DILEACLDYLE+QGDAETM+E++RLCKSSGTV KEMYYRLLRTSIAGGKPV+SILEQMKMDGF+ADE
Subjt:  AMGYAEHGRMSKAVEMLKKAMLVGRQDWKPKR-DILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADE

Query:  EVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLI-------------SLILAYRYNMDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMA
        EVDKILG+KTNL  +  L  +K    +     + A+  +             S++      + C QS    L  S     R L  +            + 
Subjt:  EVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLI-------------SLILAYRYNMDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMA

Query:  LLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKV
          +   GDPR SIVRVLDQWVEEGR+V QSDLQKL+KQLR F RFNHALQLCEW  NERN+ P PG IAIQLHLISK  GLEQAE+YFSSI ESSRDHKV
Subjt:  LLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKV

Query:  YGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLV
        YGALL+CYVENKNL+KAEAIMQKMREVGFMKTPLSYN ML+LY+ LGKHEKLDEL++EMEEMGI H+RFTYN+RMNAYAA S+ITNMEKLL KMEADPLV
Subjt:  YGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLV

Query:  TMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKE
          DWH YF V NGYFKAGLSENSI  MLK++EQLIGDKQKW AYECL+TLY AIGNKDEVYRVWNLYTNL++R+NSGYLCIISSLMKLDDIDG E+ILKE
Subjt:  TMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKE

Query:  WESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVE
        WESGDT FDFKIPNMMINSYC KGFVDKAEAYISRL+ENGKEP+A  WDRLASGYH+NGLTNKA ETLKKAISVS P WKPNY  LAAC+EYLKTNGNVE
Subjt:  WESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVE

Query:  VAEEIVELLCKRDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNE
        +AEEI+ LLCKRDI PLNI  RLEDYI SENQ SIKCLD LGL+ QNE
Subjt:  VAEEIVELLCKRDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNE

TrEMBL top hitse value%identityAlignment
A0A438JK79 Pentatricopeptide repeat-containing protein, mitochondrial4.6e-28552.41Show/hide
Query:  TTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLK
        ++L D+I  +RDPK S+ P+L +W+ +G+ + K +LQSLV +MK FRRF+HALEISQWMTDRRYF L+ SDAAIRLDLI  VHG E AE YFN+I + LK
Subjt:  TTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLK

Query:  TYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEE
        T +AYGALL  YVREKS+EKAEA MQ+MR+M  AT+SFPYN+LINLY+Q G H KI+ LIQEM+ K IP D +T+ NL  AYVA +DIS MEK+L R+EE
Subjt:  TYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEE

Query:  DSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRI
        D     DW IYS+AASGYL  GL  KAL MLKK+E   P     SAF++LLSLY RT  K ELYRVW+ +KP   +    Y+ MIT L KLDDIEGA++I
Subjt:  DSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRI

Query:  FQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGD
        FQEWE +CT YDFRVLNRL+ AYC++ LFDKAES VN+ +  R PYASTW++LA GY E  +M KAVEMLKKA+ VGR+ W+P   ILEAC++YLE QG+
Subjt:  FQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGD

Query:  AETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYN
         E +EE+ RLCK+SG    ++++RLLRTS A                                                                     
Subjt:  AETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYN

Query:  MDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMALLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQ
         + +QS+      SPA                              D R+SIV  L+QW +EGR +KQ DL +L+++LR F+R+NHAL++ EWI ++   
Subjt:  MDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMALLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQ

Query:  DPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEE
        D  PGD+AIQL LISKVHGLEQAEKYF+    S R  +VYGALLNCY + K+LEKAEAIMQ+MR++GF+KT LSYNVML LYS LGKHEKLD LM+EMEE
Subjt:  DPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEE

Query:  MGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVY
         GI  D FTY IR+NAY ATS++  MEKLL+K+E DP V  DW+AY + ANGY KA L E ++  MLK+SEQ I  + + F YE L+TLY  +GNK E+Y
Subjt:  MGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVY

Query:  RVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLT
        R+WNLY  + + +N+GY+ ++SSL+KLDD+DG EK  +EW SG+  FDF++PN++I +YCKKG ++KAE  +SR +E G+EP A TWD LA+GYH N   
Subjt:  RVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLT

Query:  NKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCKRDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNEKLDNVSD
         KAV+TLKKA+  +  GWKPN  TL+AC+EYLK   +VE AE ++ LL ++ +V    S+RL +YIRSE   S      L     +E LD  +D
Subjt:  NKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCKRDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNEKLDNVSD

A0A498J9D6 Uncharacterized protein0.0e+0055.84Show/hide
Query:  KLRNWNKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQ
        KL   +KLI   LQ P+++    +LF +   S   SS+SP  S +L D+I  IRDPK SV+PVLE+WV +G+A+ KQ+LQSLV L+K FRRFNHALEISQ
Subjt:  KLRNWNKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQ

Query:  WMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKID
        WMTDRRYF+LS SDAA RL+LI RVHGLEHAE+YFN++S  LK+ NAYGALLC YV+E+S+EKAEA MQ+M+KMGMA TSFPYN+LINLY+Q GQ++KI+
Subjt:  WMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKID

Query:  LLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERT
        +L+QEM+  GIP D YT+RN   AY+A +D+ GME IL R+EED     DW+IYS+AA+GYL  GL  KA+SMLK +E  +P    KS  EFLL+LY  T
Subjt:  LLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERT

Query:  GRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGY
        G K+ELYRVW T+KP      VPY  MI+SLAKLDDIEGA+ IF+EWES+C  YDFRVLNRL+VAYC++GLFDKAES VN+AV GR PYASTW+VLA+GY
Subjt:  GRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGY

Query:  AEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKI
         E  +M KAVEMLKKA+ VGR+ W P    L ACLDYLE QGD E +EE+I L K+ G +++++Y+RLLR S+A GK V  IL+QMK+DGF+ADEE  K+
Subjt:  AEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKI

Query:  LGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYNMDCIQSQTAVLSYSP------AGTSRRLL-SAESDHDEAPLLSAMALLLQFQGDPRI
        + T                             LISL L     +    S   +L  +P      +G+SR L  S+++     P    + + +   G+PR+
Subjt:  LGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYNMDCIQSQTAVLSYSP------AGTSRRLL-SAESDHDEAPLLSAMALLLQFQGDPRI

Query:  SIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVEN
        S+V +L+QWVEEGR+VK+ +LQ  +K  RK+RR++HALQ+ EW+S+ RNQ   PGDIA++L LISKV GL+QAE YF+SI +  R+ KVYGALL  YVEN
Subjt:  SIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVEN

Query:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA
        K+ EKAE I +KM E+G++K  ++YN ML+LYS +GKHEKLD L++EMEE GI +D +T  I +N+YAA S I  MEKLL+K++ADPLV +DW+ Y I A
Subjt:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA

Query:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFK
        NG+ KAGL E +   ML+RSEQLI +K   FAYE L+TLY  IGNKDEVYR+WN+Y N+   YNSGYLC++SSL+KL DID  E I++EWES    FD +
Subjt:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFK

Query:  IPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCK
        IPN++I +YCKK  ++KA+ YI RL E+ KE  A+ W RLA+GYH NG  +KAVET+KKAI  S+ GWK N+ TLAAC+EYLK  G+VEVA+E+  L+ +
Subjt:  IPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCK

Query:  RDIVPLNISNRLEDYIRSE
         D    ++ ++L+ Y+  E
Subjt:  RDIVPLNISNRLEDYIRSE

A0A5A7SQP0 Putative pentatricopeptide repeat-containing protein0.0e+0077.2Show/hide
Query:  MITKLRNW-NKLIPNLL-QTPQ-QSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNH
        M+ KLR+W N LIPNLL QT + QSN   SLFCTKTLS  FSST PP ST L ++I+ IRDPKISVIPVLEKWVGDG+AI K ELQ LVYL K+FRRFNH
Subjt:  MITKLRNW-NKLIPNLL-QTPQ-QSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNH

Query:  ALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIG
        ALEISQWMTDRRY +LS+SDAA+RLDLI  VHGLEHAE+YFNSIS++LKT N YG+LL  YVREKS+EKAEAIMQEMRKMG+A TSF YNVLINLYAQIG
Subjt:  ALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIG

Query:  QHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLL
        QH+KIDLLI+EMKMKGIPQDIY+IRNLCAAYVAKTDISGMEKIL+RIEEDSE KADWRIYSIAA+GYL+AGLET+ALSML KME+KI PN NK AFEFLL
Subjt:  QHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLL

Query:  SLYERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWS
        SLYERTG K+E+YRVW+TFKPL RQT VPYALMITSLAKLDD+EGA+RIFQEWESKCT YDFRVLNRL+VAYCRKGL DKAE  VN+AVVGRTP+ASTWS
Subjt:  SLYERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWS

Query:  VLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPK-RDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSA
        +LA GYAE+G MSKAVEMLKKAMLVGRQ+WKPK RDILEACLDYLE+QGDAETMEE++RLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGF+A
Subjt:  VLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPK-RDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSA

Query:  DEEVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYNMDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMALLLQFQGDPRI
        DEE +K     T   +V   L  +   +SS  +        S    +  N   +Q+    L YS    ++ L S+ S  D       +   +   GDPRI
Subjt:  DEEVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYNMDCIQSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMALLLQFQGDPRI

Query:  SIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVEN
        SIVRVLDQW+EEGR+V QSD+Q L+KQLRKF RFNHALQLCEWI NERN++P PGDIA+QLHLISK  GLEQAEKYFSSIRESSRDHKVYGALLNCYVEN
Subjt:  SIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVEN

Query:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA
        KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLY+ LGK EK DEL++EMEEMGI HDRFTYNIRMNAYAATS+I NMEKLL KMEAD LV MDWH+YF V 
Subjt:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA

Query:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFK
        NGY KAG SEN IL MLK++EQLIGDKQKW AYE LITLY AIGNKDEVYRVWNLY+NL++R+NSGYLC+I+SLMKLDDIDG E+ILKEWESGDTCFDF+
Subjt:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFK

Query:  IPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCK
        IPNMMINSYC KGF+DKAEAYISRL+ENGKEP+A  WDRL SGYH+NGLTNKA ETLKKAISVS P WKPN H +AAC+EYLKTNGNVE+AEEI+ LLCK
Subjt:  IPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCK

Query:  RDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNEKLDNVSD
         DI P NI NRLEDYI SENQTSIKCLD L L+GQ+E LD+  D
Subjt:  RDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNEKLDNVSD

A0A6J1GLA8 pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like4.4e-25186.44Show/hide
Query:  MITKLRNWNKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALE
        M+ KLR+WNK IPNLLQ PQ+S PL  LFCTK+ SP+FSST PPTS  LLDKILTIRDPKISVIPVLEKWVGDG+AIGKQELQSLV LMKSFRRFNHALE
Subjt:  MITKLRNWNKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALE

Query:  ISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHD
        IS+WMTDRRYFNLSSSDAA+RLDLIR VHGLEHAEHYFNSISSQL+T NAYGALLCSYVRE+S+EKAEAIMQEMR +G ATTSFPYNVLINLYAQ+GQH 
Subjt:  ISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHD

Query:  KIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLY
        KIDLLIQEM+MKGIPQDIYT+RNL AAYVA  DISGMEKIL+RIEE+SE +ADWRIYSIAASGYLSAGLET+ALSMLKKMEEKIPP  NKSAFEFLLSLY
Subjt:  KIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLY

Query:  ERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLA
        ER GRKDELYRVWSTFK  I+Q  VPYALMITSL KLDDIEGA+RIFQEWESKCTFYDFRVLNRL+VAYCRKGLFDKAES VNRAV+GRTPYASTWSVLA
Subjt:  ERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLA

Query:  MGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEV
        MGY EHG MSKAVEMLK+AMLVGRQDWKP +DILEACL+YLEEQGDAETMEE+IRLC+SSG++ KEMYYR LRTSIAGGKPVLSIL QM+MDGF ADEEV
Subjt:  MGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEV

Query:  DKILGTKTN
         KILGTKT+
Subjt:  DKILGTKTN

A0A6J1HYW6 pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like2.3e-25287.03Show/hide
Query:  MITKLRNWNKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALE
        M+ KLR+WNK IPNLLQ+PQ+S PL  LFC K+ SP+FSST PPTS  LLDKILTIRDPKISVIPVLEKWVGDG+AIGKQELQSLV LMKSFRRFNHALE
Subjt:  MITKLRNWNKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALE

Query:  ISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHD
        IS+WMTDRRYFNLSSSDAA+RLDLIR VHGLEHAEHYFNSISSQL+T NAYGALLCSYVRE+S+EKAEAIMQEMRK+G ATTSFPYNVLINLYAQ+GQH 
Subjt:  ISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHD

Query:  KIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLY
        KIDLLIQEM+MKGIPQDIYT+RNL AAYVA  DISGMEKIL+RIEE+SE +ADWRIYSIAASGYLSAGLET+ALSMLKKMEEKIPP  NKSAFEFLLSLY
Subjt:  KIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLY

Query:  ERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLA
        ERTGRKDELYRVWSTFK  I+Q  VPYALMITSL KLDDIEGA+RIFQEWESKCTFYDFRVLNRL+VAYCRKGLFDKAES VNRAV+GRTPYASTWSVLA
Subjt:  ERTGRKDELYRVWSTFKPLIRQTHVPYALMITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLA

Query:  MGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEV
        MGY EHG MSKAVEMLK+AMLVGRQDWKP +DILEACL+YLEEQGD ETMEE+IRLC+SSGTV KEMYYR LRTSIAGGKPVLSIL QM+MDGFSADEEV
Subjt:  MGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEV

Query:  DKILGTKTN
         KILGTKT+
Subjt:  DKILGTKTN

SwissProt top hitse value%identityAlignment
Q84JR3 Pentatricopeptide repeat-containing protein At4g21705, mitochondrial3.7e-7433.4Show/hide
Query:  TSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNS
        T+    TTL  KI  + DPK SV P L+ WV  GK +   EL  +V+ ++  +RF HALE+S+WM +      S ++ A+ LDLI RV+G   AE YF +
Subjt:  TSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNS

Query:  ISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKI
        +  Q K    YGALL  YVR++++EK+    ++M++MG  T+S  YN ++ LY  IGQH+K+  +++EMK + +  D Y+ R    A+ A  D+  +   
Subjt:  ISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKI

Query:  LRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLI-RQTHVPYALMITSLAKLDD
        LR +E   +   DW  Y++AA  Y+  G   +A+ +LK  E ++   + +  +  L++LY R G+K E+ R+W   K +  R+ +  Y  ++ SL K+D 
Subjt:  LRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLI-RQTHVPYALMITSLAKLDD

Query:  IEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAV-NRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAM--LVGRQDWKPKRDILEA
        +  A+ +  EW+S    YDFRV N ++  Y  K + +KAE+ + + A  G+     +W ++A  YAE G +  A + +K A+   VG + W+P   ++ +
Subjt:  IEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAV-NRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAM--LVGRQDWKPKRDILEA

Query:  CLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSI-AGGKPVLSILEQMKMDGFSADEEVDKILGTKT
         L ++ ++G  + +E  +   ++   V K+MY+ L++  I  GG+ + ++L++MK D    DEE   IL T++
Subjt:  CLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSI-AGGKPVLSILEQMKMDGFSADEEVDKILGTKT

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021501.2e-6932.63Show/hide
Query:  DHDEAPLL--SAMALLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPL-PGDIAIQLHLISKVHGLEQAE
        D++  P++  +A+   +     P +    VL+QW + GR++ + +L ++VK+LRK++R N AL++ +W++N   +  L   D AIQL LI KV G+  AE
Subjt:  DHDEAPLL--SAMALLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPL-PGDIAIQLHLISKVHGLEQAE

Query:  KYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNIT
        ++F  + E+ +D +VYG+LLN YV  K+ EKAEA++  MR+ G+   PL +NVM++LY +L +++K+D ++ EM++  I  D ++YNI +++  +  ++ 
Subjt:  KYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNIT

Query:  NMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRY-NSGYLCIISS
         ME +  +M++D  +  +W  +  +A  Y K G +E +   + K   ++ G  +    Y  L++LY ++GNK E+YRVW++Y ++     N GY  ++SS
Subjt:  NMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRY-NSGYLCIISS

Query:  LMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISV-SQPGWKPNY
        L+++ DI+G EK+ +EW    + +D +IPN+++N+Y K   ++ AE     ++E G +P ++TW+ LA G+      ++A+  L+ A S      W+P  
Subjt:  LMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISV-SQPGWKPNY

Query:  HTLAACIEYLKTNGNVEVAEEIVELL
          L+   +  +   +V   E ++ELL
Subjt:  HTLAACIEYLKTNGNVEVAEEIVELL

Q9C7F1 Putative pentatricopeptide repeat-containing protein At1g280201.0e-6341.88Show/hide
Query:  IVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYV-EN
        I+ VL+QW ++G +V  S ++ ++K+LR   +   ALQ+ EW+S E+  + +P D A +LHLI  V GLE+AEK+F SI +++R   VY +LLN Y   +
Subjt:  IVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYV-EN

Query:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA
        K L KAEA  QKMR++G +  P+ YN M+SLYS L   EK++EL+ EM++  +  D  T N  +  Y+A  ++T MEK L K E    + ++WH    +A
Subjt:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA

Query:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLY-TNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDF
          Y +A  S    + ML+ +EQL+  K    AY+ L+ LY   GN++EV RVW LY + +  R N+GY  +I SL+K+DDI G E+I K WES    FD 
Subjt:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLY-TNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDF

Query:  KIPNMMINSYCKKGFVDKAE
        +IP M+ + Y  +G  +KAE
Subjt:  KIPNMMINSYCKKGFVDKAE

Q9SKU6 Pentatricopeptide repeat-containing protein At2g20710, mitochondrial4.2e-11046.55Show/hide
Query:  GDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLN
        GDP  SI++VLD W+++G  VK S+L  ++K LRKF RF+HALQ+ +W+S  R  +   GD+AI+L LI+KV GL +AEK+F +I    R++ +YGALLN
Subjt:  GDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLN

Query:  CYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHA
        CY   K L KAE + Q+M+E+GF+K  L YNVML+LY   GK+  +++L+ EME+  +  D FT N R++AY+  S++  MEK L++ EAD  + +DW  
Subjt:  CYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHA

Query:  YFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDT
        Y   ANGY KAGL+E + L ML++SEQ++  +++  AYE L++ Y A G K+EVYR+W+LY  L   YN+GY+ +IS+L+K+DDI+  EKI++EWE+G +
Subjt:  YFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDT

Query:  CFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIV
         FD +IP+++I  YCKKG ++KAE  ++ L++  +    +TW+RLA GY   G   KAVE  K+AI VS+PGW+P+   L +C++YL+   ++E   +I+
Subjt:  CFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIV

Query:  ELLCKR
         LL +R
Subjt:  ELLCKR

Q9SY07 Pentatricopeptide repeat-containing protein At4g02820, mitochondrial9.0e-6030.72Show/hide
Query:  TLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKT
        TL  ++L++   K S +  + KW  +G ++ K EL  +V  ++  +R+ HALEI +WM  +    L + D A+ LDLI ++ GL  AE +F  +  Q++ 
Subjt:  TLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKT

Query:  YNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEED
        + A  +LL SYV+ K  +KAEA+ ++M + G   +  PYN ++++Y   GQ +K+ +LI+E+K++  P DI T      A+ +  D+ G EK+  + +E+
Subjt:  YNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKILRRIEED

Query:  SEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLIRQTH-VPYALMITSLAKLDDIEGADRI
         +   DW  YS+  + Y       KA   LK+M EK+    N+ A+  L+SL+   G KD +   W   K   ++ +   Y  MI+++ KL + E A  +
Subjt:  SEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLIRQTH-VPYALMITSLAKLDDIEGADRI

Query:  FQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAV-VGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQG
        + EWES     D R+ N ++  Y  +      E    R V  G  P  STW +L   Y +   M K ++   KA +   + W     +++     LEEQG
Subjt:  FQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAV-VGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDYLEEQG

Query:  DAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKIL
        + +  E+++ L + +G V  ++Y  LLRT    G+  L + E+M  D    DEE  +++
Subjt:  DAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKIL

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.9e-7132.63Show/hide
Query:  DHDEAPLL--SAMALLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPL-PGDIAIQLHLISKVHGLEQAE
        D++  P++  +A+   +     P +    VL+QW + GR++ + +L ++VK+LRK++R N AL++ +W++N   +  L   D AIQL LI KV G+  AE
Subjt:  DHDEAPLL--SAMALLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPL-PGDIAIQLHLISKVHGLEQAE

Query:  KYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNIT
        ++F  + E+ +D +VYG+LLN YV  K+ EKAEA++  MR+ G+   PL +NVM++LY +L +++K+D ++ EM++  I  D ++YNI +++  +  ++ 
Subjt:  KYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNIT

Query:  NMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRY-NSGYLCIISS
         ME +  +M++D  +  +W  +  +A  Y K G +E +   + K   ++ G  +    Y  L++LY ++GNK E+YRVW++Y ++     N GY  ++SS
Subjt:  NMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRY-NSGYLCIISS

Query:  LMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISV-SQPGWKPNY
        L+++ DI+G EK+ +EW    + +D +IPN+++N+Y K   ++ AE     ++E G +P ++TW+ LA G+      ++A+  L+ A S      W+P  
Subjt:  LMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISV-SQPGWKPNY

Query:  HTLAACIEYLKTNGNVEVAEEIVELL
          L+   +  +   +V   E ++ELL
Subjt:  HTLAACIEYLKTNGNVEVAEEIVELL

AT1G28020.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.3e-6541.88Show/hide
Query:  IVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYV-EN
        I+ VL+QW ++G +V  S ++ ++K+LR   +   ALQ+ EW+S E+  + +P D A +LHLI  V GLE+AEK+F SI +++R   VY +LLN Y   +
Subjt:  IVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYV-EN

Query:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA
        K L KAEA  QKMR++G +  P+ YN M+SLYS L   EK++EL+ EM++  +  D  T N  +  Y+A  ++T MEK L K E    + ++WH    +A
Subjt:  KNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVA

Query:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLY-TNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDF
          Y +A  S    + ML+ +EQL+  K    AY+ L+ LY   GN++EV RVW LY + +  R N+GY  +I SL+K+DDI G E+I K WES    FD 
Subjt:  NGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLY-TNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDF

Query:  KIPNMMINSYCKKGFVDKAE
        +IP M+ + Y  +G  +KAE
Subjt:  KIPNMMINSYCKKGFVDKAE

AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-11146.55Show/hide
Query:  GDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLN
        GDP  SI++VLD W+++G  VK S+L  ++K LRKF RF+HALQ+ +W+S  R  +   GD+AI+L LI+KV GL +AEK+F +I    R++ +YGALLN
Subjt:  GDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLN

Query:  CYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHA
        CY   K L KAE + Q+M+E+GF+K  L YNVML+LY   GK+  +++L+ EME+  +  D FT N R++AY+  S++  MEK L++ EAD  + +DW  
Subjt:  CYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHA

Query:  YFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDT
        Y   ANGY KAGL+E + L ML++SEQ++  +++  AYE L++ Y A G K+EVYR+W+LY  L   YN+GY+ +IS+L+K+DDI+  EKI++EWE+G +
Subjt:  YFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDT

Query:  CFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIV
         FD +IP+++I  YCKKG ++KAE  ++ L++  +    +TW+RLA GY   G   KAVE  K+AI VS+PGW+P+   L +C++YL+   ++E   +I+
Subjt:  CFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIV

Query:  ELLCKR
         LL +R
Subjt:  ELLCKR

AT2G20710.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-9345.53Show/hide
Query:  ISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDE
        +S  R  +   GD+AI+L LI+KV GL +AEK+F +I    R++ +YGALLNCY   K L KAE + Q+M+E+GF+K  L YNVML+LY   GK+  +++
Subjt:  ISNERNQDPLPGDIAIQLHLISKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDE

Query:  LMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAI
        L+ EME+  +  D FT N R++AY+  S++  MEK L++ EAD  + +DW  Y   ANGY KAGL+E + L ML++SEQ++  +++  AYE L++ Y A 
Subjt:  LMEEMEEMGIAHDRFTYNIRMNAYAATSNITNMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAI

Query:  GNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASG
        G K+EVYR+W+LY  L   YN+GY+ +IS+L+K+DDI+  EKI++EWE+G + FD +IP+++I  YCKKG ++KAE  ++ L++  +    +TW+RLA G
Subjt:  GNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTEKILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASG

Query:  YHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCKR
        Y   G   KAVE  K+AI VS+PGW+P+   L +C++YL+   ++E   +I+ LL +R
Subjt:  YHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEIVELLCKR

AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-7533.4Show/hide
Query:  TSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNS
        T+    TTL  KI  + DPK SV P L+ WV  GK +   EL  +V+ ++  +RF HALE+S+WM +      S ++ A+ LDLI RV+G   AE YF +
Subjt:  TSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRYFNLSSSDAAIRLDLIRRVHGLEHAEHYFNS

Query:  ISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKI
        +  Q K    YGALL  YVR++++EK+    ++M++MG  T+S  YN ++ LY  IGQH+K+  +++EMK + +  D Y+ R    A+ A  D+  +   
Subjt:  ISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYTIRNLCAAYVAKTDISGMEKI

Query:  LRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLI-RQTHVPYALMITSLAKLDD
        LR +E   +   DW  Y++AA  Y+  G   +A+ +LK  E ++   + +  +  L++LY R G+K E+ R+W   K +  R+ +  Y  ++ SL K+D 
Subjt:  LRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLI-RQTHVPYALMITSLAKLDD

Query:  IEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAV-NRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAM--LVGRQDWKPKRDILEA
        +  A+ +  EW+S    YDFRV N ++  Y  K + +KAE+ + + A  G+     +W ++A  YAE G +  A + +K A+   VG + W+P   ++ +
Subjt:  IEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAV-NRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAM--LVGRQDWKPKRDILEA

Query:  CLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSI-AGGKPVLSILEQMKMDGFSADEEVDKILGTKT
         L ++ ++G  + +E  +   ++   V K+MY+ L++  I  GG+ + ++L++MK D    DEE   IL T++
Subjt:  CLDYLEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSI-AGGKPVLSILEQMKMDGFSADEEVDKILGTKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAACGAAGCTCCGAAACTGGAACAAGCTCATTCCCAATCTCCTTCAAACCCCTCAACAATCTAATCCACTAGCTTCCCTTTTCTGTACTAAAACCCTTTCCCCGGT
CTTCTCTTCCACTTCTCCGCCCACATCAACCACTCTCCTCGACAAAATCTTAACCATACGAGACCCTAAAATCTCTGTTATTCCGGTACTGGAGAAGTGGGTCGGCGATG
GCAAAGCGATTGGGAAACAAGAACTTCAATCTCTTGTTTACCTCATGAAGAGCTTTCGGCGCTTCAATCACGCATTAGAGATATCTCAGTGGATGACGGATCGAAGATAC
TTCAATTTATCGTCAAGCGATGCAGCAATCAGGCTGGATTTAATCCGTAGAGTTCATGGTCTGGAACACGCAGAACATTACTTCAATAGTATATCTTCTCAGTTGAAAAC
TTATAATGCTTATGGTGCTCTTCTCTGTAGTTATGTGCGAGAGAAATCAATTGAGAAAGCTGAAGCCATTATGCAAGAAATGAGAAAGATGGGCATGGCTACTACGTCCT
TTCCTTATAATGTGCTAATTAACCTCTACGCTCAGATTGGGCAGCATGATAAGATTGATCTCCTGATTCAAGAAATGAAAATGAAGGGAATACCTCAAGATATTTACACA
ATTAGAAACCTTTGTGCAGCTTATGTTGCTAAGACAGACATTTCTGGTATGGAAAAGATCCTCAGAAGGATTGAGGAAGATTCTGAACACAAGGCTGATTGGAGAATTTA
TTCAATTGCTGCTAGTGGCTATCTATCAGCTGGGTTGGAAACGAAGGCTCTTTCCATGCTAAAGAAAATGGAGGAGAAGATTCCACCTAATAATAATAAATCTGCATTTG
AGTTTCTTCTGTCCCTTTATGAACGAACAGGGCGTAAGGACGAACTTTACAGAGTTTGGAGTACCTTCAAGCCATTAATTAGACAAACACATGTGCCATATGCTTTAATG
ATCACATCTCTAGCCAAGCTTGATGATATTGAAGGGGCTGATAGGATCTTCCAGGAGTGGGAATCAAAGTGTACATTCTATGACTTTCGAGTGTTGAATCGACTTGTGGT
CGCTTATTGCAGAAAAGGTCTTTTTGATAAGGCAGAATCAGCCGTTAACCGAGCAGTGGTTGGAAGAACTCCATACGCCAGCACGTGGAGCGTGTTAGCCATGGGATACG
CAGAACATGGACGCATGAGCAAAGCCGTTGAGATGTTGAAGAAAGCTATGTTAGTTGGAAGGCAAGATTGGAAACCAAAGCGTGACATTTTGGAAGCTTGTCTGGATTAC
TTGGAAGAACAAGGAGATGCAGAAACAATGGAGGAAGTAATACGACTATGCAAAAGCTCGGGTACAGTAGCGAAGGAGATGTACTACAGATTGCTGAGAACTTCCATAGC
AGGGGGAAAACCAGTTCTCAGCATTCTAGAACAGATGAAAATGGATGGTTTTTCAGCAGATGAAGAGGTAGACAAAATCCTGGGAACTAAGACGAACTTAGGCAATGTCT
ATGAACTGCTATGGCTGAAATGCAAGGCCAACTCTTCTTGTAATGATGAAAGGAATGCCATGAAGCTAATTTCACTTATTTTGGCTTACCGGTATAACATGGACTGCATC
CAATCGCAAACCGCCGTACTCTCCTATTCTCCCGCCGGAACTTCCCGTCGACTCCTCAGTGCTGAGTCCGACCATGATGAAGCTCCATTGCTCTCAGCCATGGCGTTGTT
GCTCCAATTTCAAGGCGATCCTCGAATCTCAATTGTTCGCGTGTTGGACCAATGGGTCGAAGAAGGCCGAGAAGTCAAGCAATCTGACCTGCAAAAGCTCGTCAAGCAGC
TCAGGAAGTTCCGTCGCTTCAACCATGCTCTACAGTTGTGTGAATGGATAAGTAATGAAAGGAACCAAGACCCATTACCTGGGGACATTGCTATTCAGTTGCACTTAATT
TCAAAAGTTCATGGTTTAGAACAAGCTGAGAAGTATTTTAGCAGCATCAGGGAATCTTCAAGAGATCATAAGGTCTATGGAGCACTTCTAAACTGTTATGTAGAGAATAA
AAATTTGGAAAAGGCAGAGGCAATCATGCAGAAGATGAGGGAAGTAGGATTTATGAAAACACCACTTTCCTATAATGTTATGTTAAGCCTTTATTCTCATCTCGGTAAAC
ATGAGAAACTTGATGAATTAATGGAAGAAATGGAAGAGATGGGAATTGCTCATGATAGATTTACATATAATATTCGTATGAATGCTTATGCAGCTACTTCCAATATAACA
AACATGGAAAAGCTTTTGTTGAAAATGGAGGCAGACCCACTAGTTACTATGGATTGGCATGCTTATTTCATTGTAGCAAATGGATATTTCAAAGCTGGTCTTTCTGAAAA
TAGTATATTGATGATGCTGAAGAGATCAGAACAACTCATTGGTGACAAACAAAAGTGGTTTGCATATGAATGTCTCATTACACTATATACTGCTATAGGAAATAAGGATG
AGGTGTATCGGGTTTGGAACTTGTACACGAATCTGAAAAGAAGATACAATTCAGGATATCTTTGTATAATAAGTTCATTAATGAAACTGGACGATATCGACGGTACTGAA
AAAATCTTGAAGGAATGGGAATCAGGGGATACATGTTTTGACTTTAAAATTCCAAACATGATGATAAATAGTTATTGTAAGAAGGGATTTGTGGATAAGGCAGAAGCATA
TATTAGCAGGCTTATGGAGAATGGCAAGGAACCACAAGCAAACACTTGGGATCGACTAGCAAGTGGATATCATGCTAATGGTTTGACAAATAAAGCAGTAGAAACTCTGA
AGAAAGCAATCTCAGTTAGTCAGCCTGGATGGAAGCCTAATTATCATACCTTGGCCGCATGTATTGAATATTTGAAAACAAATGGAAATGTGGAGGTAGCAGAGGAAATC
GTAGAGCTCCTTTGCAAACGTGATATTGTTCCCTTAAACATTTCCAATAGATTAGAAGATTATATCCGCAGTGAAAACCAAACCTCAATCAAGTGCCTTGATCAACTCGG
CCTGGAAGGTCAGAATGAGAAACTCGATAACGTGTCTGATCAAAACAAGCTCGACTTTGCTGAGCTAAATTATGATGAGACTTCTGATAGTAAGTGA
mRNA sequenceShow/hide mRNA sequence
GTTCTATCCTCTACATTCTATGCTTCATTTAGAAGTTCACTCGATGTTTGGTGCTCAATGCTCAGTGCTAGGTGCTCGATGCATTACGCTCTATGCTCGATCAGAAGTTC
ATTTGATGCTCGATAAAAAGAAACATAGAAAAAAGAAGAAGAAGAAGAAATGTAGAAAAACAATAAGAAGAAGAAATGCAGAAAAATAAAAGAAAAAAGGAAGAAGATAT
GAAAAAGGGGAAAAAAGAAAAAGGAAAACGCTCCAAAACGAAGAAAACATAAAAAGAATGCATTTAAGATTAAAATGATAATTTTACCTAATAAACATAAGGCCACGGTT
CATTAATTAAAAAAAAGAAAATGATGATAAACAGAAAAAATATCAAACTATTTATAAATATAAAAAATTTTATCGCAAAATTTTCTATATTTATAAATAATTTGATTCAT
TTTTTTTATATTTGAAAATTACCCCAAAATAAAATATCAATTTTTCATTTGAGCCCTTATATTGGTCGTCTATGCCGATCTATTTCCGGCTGCACCAAGTTTTTCTCCGA
AGTTCAAAGCTCTTCCCATTCTCAATTTGTAAGCTTTATTTTCTGTAGATTTTAGTACACCATTTATCAATTCTCATCGTTTTCATCTTTGAATCAAGAAGATGATAACG
AAGCTCCGAAACTGGAACAAGCTCATTCCCAATCTCCTTCAAACCCCTCAACAATCTAATCCACTAGCTTCCCTTTTCTGTACTAAAACCCTTTCCCCGGTCTTCTCTTC
CACTTCTCCGCCCACATCAACCACTCTCCTCGACAAAATCTTAACCATACGAGACCCTAAAATCTCTGTTATTCCGGTACTGGAGAAGTGGGTCGGCGATGGCAAAGCGA
TTGGGAAACAAGAACTTCAATCTCTTGTTTACCTCATGAAGAGCTTTCGGCGCTTCAATCACGCATTAGAGATATCTCAGTGGATGACGGATCGAAGATACTTCAATTTA
TCGTCAAGCGATGCAGCAATCAGGCTGGATTTAATCCGTAGAGTTCATGGTCTGGAACACGCAGAACATTACTTCAATAGTATATCTTCTCAGTTGAAAACTTATAATGC
TTATGGTGCTCTTCTCTGTAGTTATGTGCGAGAGAAATCAATTGAGAAAGCTGAAGCCATTATGCAAGAAATGAGAAAGATGGGCATGGCTACTACGTCCTTTCCTTATA
ATGTGCTAATTAACCTCTACGCTCAGATTGGGCAGCATGATAAGATTGATCTCCTGATTCAAGAAATGAAAATGAAGGGAATACCTCAAGATATTTACACAATTAGAAAC
CTTTGTGCAGCTTATGTTGCTAAGACAGACATTTCTGGTATGGAAAAGATCCTCAGAAGGATTGAGGAAGATTCTGAACACAAGGCTGATTGGAGAATTTATTCAATTGC
TGCTAGTGGCTATCTATCAGCTGGGTTGGAAACGAAGGCTCTTTCCATGCTAAAGAAAATGGAGGAGAAGATTCCACCTAATAATAATAAATCTGCATTTGAGTTTCTTC
TGTCCCTTTATGAACGAACAGGGCGTAAGGACGAACTTTACAGAGTTTGGAGTACCTTCAAGCCATTAATTAGACAAACACATGTGCCATATGCTTTAATGATCACATCT
CTAGCCAAGCTTGATGATATTGAAGGGGCTGATAGGATCTTCCAGGAGTGGGAATCAAAGTGTACATTCTATGACTTTCGAGTGTTGAATCGACTTGTGGTCGCTTATTG
CAGAAAAGGTCTTTTTGATAAGGCAGAATCAGCCGTTAACCGAGCAGTGGTTGGAAGAACTCCATACGCCAGCACGTGGAGCGTGTTAGCCATGGGATACGCAGAACATG
GACGCATGAGCAAAGCCGTTGAGATGTTGAAGAAAGCTATGTTAGTTGGAAGGCAAGATTGGAAACCAAAGCGTGACATTTTGGAAGCTTGTCTGGATTACTTGGAAGAA
CAAGGAGATGCAGAAACAATGGAGGAAGTAATACGACTATGCAAAAGCTCGGGTACAGTAGCGAAGGAGATGTACTACAGATTGCTGAGAACTTCCATAGCAGGGGGAAA
ACCAGTTCTCAGCATTCTAGAACAGATGAAAATGGATGGTTTTTCAGCAGATGAAGAGGTAGACAAAATCCTGGGAACTAAGACGAACTTAGGCAATGTCTATGAACTGC
TATGGCTGAAATGCAAGGCCAACTCTTCTTGTAATGATGAAAGGAATGCCATGAAGCTAATTTCACTTATTTTGGCTTACCGGTATAACATGGACTGCATCCAATCGCAA
ACCGCCGTACTCTCCTATTCTCCCGCCGGAACTTCCCGTCGACTCCTCAGTGCTGAGTCCGACCATGATGAAGCTCCATTGCTCTCAGCCATGGCGTTGTTGCTCCAATT
TCAAGGCGATCCTCGAATCTCAATTGTTCGCGTGTTGGACCAATGGGTCGAAGAAGGCCGAGAAGTCAAGCAATCTGACCTGCAAAAGCTCGTCAAGCAGCTCAGGAAGT
TCCGTCGCTTCAACCATGCTCTACAGTTGTGTGAATGGATAAGTAATGAAAGGAACCAAGACCCATTACCTGGGGACATTGCTATTCAGTTGCACTTAATTTCAAAAGTT
CATGGTTTAGAACAAGCTGAGAAGTATTTTAGCAGCATCAGGGAATCTTCAAGAGATCATAAGGTCTATGGAGCACTTCTAAACTGTTATGTAGAGAATAAAAATTTGGA
AAAGGCAGAGGCAATCATGCAGAAGATGAGGGAAGTAGGATTTATGAAAACACCACTTTCCTATAATGTTATGTTAAGCCTTTATTCTCATCTCGGTAAACATGAGAAAC
TTGATGAATTAATGGAAGAAATGGAAGAGATGGGAATTGCTCATGATAGATTTACATATAATATTCGTATGAATGCTTATGCAGCTACTTCCAATATAACAAACATGGAA
AAGCTTTTGTTGAAAATGGAGGCAGACCCACTAGTTACTATGGATTGGCATGCTTATTTCATTGTAGCAAATGGATATTTCAAAGCTGGTCTTTCTGAAAATAGTATATT
GATGATGCTGAAGAGATCAGAACAACTCATTGGTGACAAACAAAAGTGGTTTGCATATGAATGTCTCATTACACTATATACTGCTATAGGAAATAAGGATGAGGTGTATC
GGGTTTGGAACTTGTACACGAATCTGAAAAGAAGATACAATTCAGGATATCTTTGTATAATAAGTTCATTAATGAAACTGGACGATATCGACGGTACTGAAAAAATCTTG
AAGGAATGGGAATCAGGGGATACATGTTTTGACTTTAAAATTCCAAACATGATGATAAATAGTTATTGTAAGAAGGGATTTGTGGATAAGGCAGAAGCATATATTAGCAG
GCTTATGGAGAATGGCAAGGAACCACAAGCAAACACTTGGGATCGACTAGCAAGTGGATATCATGCTAATGGTTTGACAAATAAAGCAGTAGAAACTCTGAAGAAAGCAA
TCTCAGTTAGTCAGCCTGGATGGAAGCCTAATTATCATACCTTGGCCGCATGTATTGAATATTTGAAAACAAATGGAAATGTGGAGGTAGCAGAGGAAATCGTAGAGCTC
CTTTGCAAACGTGATATTGTTCCCTTAAACATTTCCAATAGATTAGAAGATTATATCCGCAGTGAAAACCAAACCTCAATCAAGTGCCTTGATCAACTCGGCCTGGAAGG
TCAGAATGAGAAACTCGATAACGTGTCTGATCAAAACAAGCTCGACTTTGCTGAGCTAAATTATGATGAGACTTCTGATAGTAAGTGACGTTTTTTCCTTTCCACCTTTC
TTCCTTGTTATAGTATTAGCATTAATTTTATTTCAATGTACTTGAAATACCAAGATATTTGGATTCTATAATTTAGGAAATCTCATTTTCTTCTAGTAATCAGTGTACTG
GGAATTTTCATCTACAAATTTTTTGTCCTTGGAACCCTTTTTTTCCACCTTGGATCACTATGGCATGTGAAGTGTTGGAAATCAAAATTTGAGAAGACTATATTTTTC
Protein sequenceShow/hide protein sequence
MITKLRNWNKLIPNLLQTPQQSNPLASLFCTKTLSPVFSSTSPPTSTTLLDKILTIRDPKISVIPVLEKWVGDGKAIGKQELQSLVYLMKSFRRFNHALEISQWMTDRRY
FNLSSSDAAIRLDLIRRVHGLEHAEHYFNSISSQLKTYNAYGALLCSYVREKSIEKAEAIMQEMRKMGMATTSFPYNVLINLYAQIGQHDKIDLLIQEMKMKGIPQDIYT
IRNLCAAYVAKTDISGMEKILRRIEEDSEHKADWRIYSIAASGYLSAGLETKALSMLKKMEEKIPPNNNKSAFEFLLSLYERTGRKDELYRVWSTFKPLIRQTHVPYALM
ITSLAKLDDIEGADRIFQEWESKCTFYDFRVLNRLVVAYCRKGLFDKAESAVNRAVVGRTPYASTWSVLAMGYAEHGRMSKAVEMLKKAMLVGRQDWKPKRDILEACLDY
LEEQGDAETMEEVIRLCKSSGTVAKEMYYRLLRTSIAGGKPVLSILEQMKMDGFSADEEVDKILGTKTNLGNVYELLWLKCKANSSCNDERNAMKLISLILAYRYNMDCI
QSQTAVLSYSPAGTSRRLLSAESDHDEAPLLSAMALLLQFQGDPRISIVRVLDQWVEEGREVKQSDLQKLVKQLRKFRRFNHALQLCEWISNERNQDPLPGDIAIQLHLI
SKVHGLEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSYNVMLSLYSHLGKHEKLDELMEEMEEMGIAHDRFTYNIRMNAYAATSNIT
NMEKLLLKMEADPLVTMDWHAYFIVANGYFKAGLSENSILMMLKRSEQLIGDKQKWFAYECLITLYTAIGNKDEVYRVWNLYTNLKRRYNSGYLCIISSLMKLDDIDGTE
KILKEWESGDTCFDFKIPNMMINSYCKKGFVDKAEAYISRLMENGKEPQANTWDRLASGYHANGLTNKAVETLKKAISVSQPGWKPNYHTLAACIEYLKTNGNVEVAEEI
VELLCKRDIVPLNISNRLEDYIRSENQTSIKCLDQLGLEGQNEKLDNVSDQNKLDFAELNYDETSDSK