; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G005830 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G005830
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionpentatricopeptide repeat-containing protein At1g62350
Genome locationchr06:7868068..7872951
RNA-Seq ExpressionLsi06G005830
SyntenyLsi06G005830
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044795 - Pentatricopeptide repeat-containing protein THA8L-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139064.1 pentatricopeptide repeat-containing protein At1g62350 isoform X1 [Cucumis sativus]1.6e-12286.35Show/hide
Query:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL
        MLRLAPNL R+IS+   S+T FHRFSL TFLN D+LQQQ LLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFIS+HVSRLLKSDL
Subjt:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL

Query:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        VAVLVELQRQN +FLCMK               LYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLKKEGVLFDQHTFGDIIRAY DN M S
Subjt:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD
        EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFP MIVYDPPEDLFEEDE+R KSEDD
Subjt:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD

XP_008450338.1 PREDICTED: pentatricopeptide repeat-containing protein At1g62350 [Cucumis melo]2.0e-12286.35Show/hide
Query:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL
        MLRLAPNL R+IS+   S+T FHRFSL TFLN D+LQQQ LLRFI GSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFIS+HVSRLLKSDL
Subjt:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL

Query:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        VAVLVELQRQN +FLCMK               LYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLKKEGVLFDQHTFGDIIRAY DNAM S
Subjt:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD
        EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFP MIVYDPPEDLFEEDE+R KSEDD
Subjt:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD

XP_022966084.1 pentatricopeptide repeat-containing protein At1g62350 [Cucurbita maxima]3.4e-11783.21Show/hide
Query:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL
        MLR APNL RRISN A ST H +RF+  TF  H   QQQ LLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSN IRLDRFIS+HVSRLLKSDL
Subjt:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL

Query:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        VAVLVELQRQNQ+FLCMK               LYNVVRKEVWYRPDMFFYRDML MLAKNKRVEETKQVW+DLK EGVLFDQHTFGDI+RAY DNAMPS
Subjt:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKS
        EAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQV+DDFLELFP MIVYDPPEDLFEEDE+R KS
Subjt:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKS

XP_023531705.1 pentatricopeptide repeat-containing protein At1g62350 [Cucurbita pepo subsp. pepo]2.0e-11782.84Show/hide
Query:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL
        MLR APNL RR SNRA ST H +RF+  TF  H   QQQ LLRFITGSASSPSLS+WRRKKEMGKEGLIVVKELKR+QSN IRLDRFIS+HVSRLLKSDL
Subjt:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL

Query:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        VAVLVELQRQNQ+FLCMK               LYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLK EGVLFDQHTFGDI+RAY DNAMPS
Subjt:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKS
        EAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQVKDDFLELFP MIVYDPPEDLFEEDE+  KS
Subjt:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKS

XP_038878451.1 pentatricopeptide repeat-containing protein At1g62350 [Benincasa hispida]2.0e-12288.17Show/hide
Query:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL
        MLRL  NL RRISN A STT FHRFSLPTFLNHD+ QQQ LLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFIS+HVSRLLKSDL
Subjt:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL

Query:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        VAVL ELQRQNQ+FLCMK               LYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAY DNAMPS
Subjt:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEED
        EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFP MIVYDPPE+LFEE+
Subjt:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEED

TrEMBL top hitse value%identityAlignment
A0A1S3BQ11 pentatricopeptide repeat-containing protein At1g623509.8e-12386.35Show/hide
Query:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL
        MLRLAPNL R+IS+   S+T FHRFSL TFLN D+LQQQ LLRFI GSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFIS+HVSRLLKSDL
Subjt:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL

Query:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        VAVLVELQRQN +FLCMK               LYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLKKEGVLFDQHTFGDIIRAY DNAM S
Subjt:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD
        EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFP MIVYDPPEDLFEEDE+R KSEDD
Subjt:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD

A0A6J1DGX4 pentatricopeptide repeat-containing protein At1g623504.1e-11380.15Show/hide
Query:  MLRLAPNLFRRISNRA--NSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKS
        MLRL PNL RR  NR    +T  FH  S  TF + D LQQQ L RFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSN IRLDRFIS+HVSRLLKS
Subjt:  MLRLAPNLFRRISNRA--NSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKS

Query:  DLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAM
        DLVAVLVELQRQ Q+FLCMK               LYNVVRKEVWYRPDMFFYRDMLMML+KNK+VEETKQVWQDLK+E VLFDQHTFGDIIRAY DN M
Subjt:  DLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAM

Query:  PSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSED
        PSEAMDIYREMR+SPDRPLSLPFRVILKGLIPYPELREQ+KDDFLELFP MIVYDPPEDLFEEDE+R++  D
Subjt:  PSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSED

A0A6J1EQI1 pentatricopeptide repeat-containing protein At1g623503.1e-11682.46Show/hide
Query:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL
        MLR APNL RR SN A ST H +R +  TF  H   QQQ LLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKR+QSN IRLDRFIS+HVSRLLKSDL
Subjt:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL

Query:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        VAVLVELQRQNQ+FLCMK               LYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVW+DLK EGVLFDQHTFGDI+RAY DNAMPS
Subjt:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKS
        EAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQVKDDFLELFP MIVYDPPEDLFEEDE   KS
Subjt:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKS

A0A6J1HQL7 pentatricopeptide repeat-containing protein At1g623501.6e-11783.21Show/hide
Query:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL
        MLR APNL RRISN A ST H +RF+  TF  H   QQQ LLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSN IRLDRFIS+HVSRLLKSDL
Subjt:  MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDL

Query:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        VAVLVELQRQNQ+FLCMK               LYNVVRKEVWYRPDMFFYRDML MLAKNKRVEETKQVW+DLK EGVLFDQHTFGDI+RAY DNAMPS
Subjt:  VAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKS
        EAMDIYREMR+SPDRPLSLPFRVILKGL+PYPELREQV+DDFLELFP MIVYDPPEDLFEEDE+R KS
Subjt:  EAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKS

A0A7N2KMM5 Uncharacterized protein1.7e-9568.12Show/hide
Query:  MLRLAPNLFRRISNRANSTTHFHR-----FSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRL
        MLR A  L R+ S+ ++S+ +F +      S  TF N   +QQQ+L R ++G ASSPSLSIWRRKKEMGKEGLIV KELKRL+SN +RLDRFI +HVSRL
Subjt:  MLRLAPNLFRRISNRANSTTHFHR-----FSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRL

Query:  LKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD
        LKSDL AVL E QRQ+Q+FLCMK               LY+VVRKE+WYRPDMFFYRDMLMMLA+NK V+E K+VW+DLK E VLFDQHTFGD+IRA+SD
Subjt:  LKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSD

Query:  NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD
        + +PSEAM+IY EMR+SPD P+SLPFRVILKGLIPYPELRE+VKDDFLELFPGM+VYDPPEDLFE+ + +R+SEDD
Subjt:  NAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD

SwissProt top hitse value%identityAlignment
Q1PFH7 Pentatricopeptide repeat-containing protein At1g623505.3e-8170.33Show/hide
Query:  MGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNK
        M KEGLI  KELKRLQ+  +RLDRFI +HVSRLLKSDLV+VL E QRQNQ+FLCMK               LY VVR+E+WYRPDMFFYRDMLMMLA+NK
Subjt:  MGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNK

Query:  RVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEED
        +V+ETK+VW+DLKKE VLFDQHTFGD++R + DN +P EAM +Y EMRESPDRPLSLPFRVILKGL+PYPELRE+VKDDFLELFPGMIVYDPPED+ E+ 
Subjt:  RVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEED

Query:  ENRRKSEDD
        +   +++ D
Subjt:  ENRRKSEDD

Q5G1S8 Pentatricopeptide repeat-containing protein At3g18110, chloroplastic8.9e-0427.38Show/hide
Query:  PDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDR-PLSLPFRVILKGL
        PD   Y  +L   A+ +  E+ K+V+Q ++K G   D+ T+  II  Y        A+ +Y++M+    R P ++ + V++  L
Subjt:  PDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDR-PLSLPFRVILKGL

Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic1.9e-0620.43Show/hide
Query:  STTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK---------KEMGKEGLIVVKELKRLQSNFIRLDRFIS----THVSRLLKSDLVAVL
        ST+  H   LPT  N    ++ F +R I+ S   P+ +I   K         +    + LI  +++    S  I+ D+         +S +L+ +    +
Subjt:  STTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK---------KEMGKEGLIVVKELKRLQSNFIRLDRFIS----THVSRLLKSDLVAVL

Query:  VELQRQNQIFLCMKG-KSSDAKIIKPQWN---ELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        +E ++ ++  L     +S   +I   +W    +++ ++R+++WY+P++  Y  +++ML K K+ E+  +++Q++  EG + +   +  ++ AYS +    
Subjt:  VELQRQNQIFLCMKG-KSSDAKIIKPQWN---ELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPD-RPLSLPFRVILKGLI
         A  +   M+ S + +P    + +++K  +
Subjt:  EAMDIYREMRESPD-RPLSLPFRVILKGLI

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic3.1e-4944.09Show/hide
Query:  FLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVR
        F+ RF  G    P   +WR KK +GKE L V+  LKRL+ +  +LD+FI THV RLLK D++AV+ EL+RQ +  L +K               ++ V++
Subjt:  FLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVR

Query:  KEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVK
        K+ WY+PD+F Y+D+++ LAK+KR++E   +W+ +KKE +  D  T+ ++IR +  +  P++AM++Y +M +SPD P  LPFRV+LKGL+P+P LR +VK
Subjt:  KEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVK

Query:  DDFLELFPGMIVYDPPEDLF
         DF ELFP    YDPPE++F
Subjt:  DDFLELFPGMIVYDPPEDLF

Q9ZVX5 Pentatricopeptide repeat-containing protein At2g168804.7e-0523.33Show/hide
Query:  IRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFL----CMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKE
        +R     S   +R +  D+V + V L  Q    L    C++GK  DA  +  +    + V        PD   Y  +L  ++K  R+ + K++  D+KK 
Subjt:  IRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFL----CMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKE

Query:  GVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELRE--QVKDDF--LELFPGMIVYD
        G++ ++ T+ +++  Y       EA  I   M+++   P    + +++ GL     +RE  ++ D    L+L P ++ Y+
Subjt:  GVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELRE--QVKDDF--LELFPGMIVYD

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-8270.33Show/hide
Query:  MGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNK
        M KEGLI  KELKRLQ+  +RLDRFI +HVSRLLKSDLV+VL E QRQNQ+FLCMK               LY VVR+E+WYRPDMFFYRDMLMMLA+NK
Subjt:  MGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNK

Query:  RVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEED
        +V+ETK+VW+DLKKE VLFDQHTFGD++R + DN +P EAM +Y EMRESPDRPLSLPFRVILKGL+PYPELRE+VKDDFLELFPGMIVYDPPED+ E+ 
Subjt:  RVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEED

Query:  ENRRKSEDD
        +   +++ D
Subjt:  ENRRKSEDD

AT3G42570.1 peroxidase family protein2.0e-0670.59Show/hide
Query:  KKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVS
        KKE  KEGLI  KELKRLQ+N +RLDRFI +H S
Subjt:  KKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVS

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-5044.09Show/hide
Query:  FLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVR
        F+ RF  G    P   +WR KK +GKE L V+  LKRL+ +  +LD+FI THV RLLK D++AV+ EL+RQ +  L +K               ++ V++
Subjt:  FLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVR

Query:  KEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVK
        K+ WY+PD+F Y+D+++ LAK+KR++E   +W+ +KKE +  D  T+ ++IR +  +  P++AM++Y +M +SPD P  LPFRV+LKGL+P+P LR +VK
Subjt:  KEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPELREQVK

Query:  DDFLELFPGMIVYDPPEDLF
         DF ELFP    YDPPE++F
Subjt:  DDFLELFPGMIVYDPPEDLF

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain6.3e-1331.15Show/hide
Query:  LDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKE-GVLFD
        LDR I +   RLLK D+VAVL EL RQN+  L +K               ++  +RKE WY+P +  Y DM+ ++A N  +EE   ++  +K E G++ +
Subjt:  LDRFISTHVSRLLKSDLVAVLVELQRQNQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKE-GVLFD

Query:  QHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPE--LREQVKDDFLELFPGMIVYDPPEDLFEEDE
           F  ++    ++ +    MD Y  M+     P    FRV++ GL    E  L   V+ D  E       Y    +  EEDE
Subjt:  QHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLPFRVILKGLIPYPE--LREQVKDDFLELFPGMIVYDPPEDLFEEDE

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-0720.43Show/hide
Query:  STTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK---------KEMGKEGLIVVKELKRLQSNFIRLDRFIS----THVSRLLKSDLVAVL
        ST+  H   LPT  N    ++ F +R I+ S   P+ +I   K         +    + LI  +++    S  I+ D+         +S +L+ +    +
Subjt:  STTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRK---------KEMGKEGLIVVKELKRLQSNFIRLDRFIS----THVSRLLKSDLVAVL

Query:  VELQRQNQIFLCMKG-KSSDAKIIKPQWN---ELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS
        +E ++ ++  L     +S   +I   +W    +++ ++R+++WY+P++  Y  +++ML K K+ E+  +++Q++  EG + +   +  ++ AYS +    
Subjt:  VELQRQNQIFLCMKG-KSSDAKIIKPQWN---ELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPS

Query:  EAMDIYREMRESPD-RPLSLPFRVILKGLI
         A  +   M+ S + +P    + +++K  +
Subjt:  EAMDIYREMRESPD-RPLSLPFRVILKGLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCGTCTTGCTCCGAATCTTTTTCGCAGAATCTCAAACAGGGCCAATTCCACAACCCATTTTCATCGCTTTTCCCTTCCCACATTCCTCAACCACGATGTGTTACA
GCAACAATTTCTGTTACGCTTCATCACTGGCTCTGCTTCCAGCCCTAGCCTCTCAATATGGAGGAGGAAGAAGGAGATGGGCAAGGAGGGTCTGATTGTGGTCAAAGAGC
TCAAGAGGCTTCAGTCCAATTTCATTCGCCTCGACCGCTTCATTTCCACCCATGTCTCTCGCTTGCTCAAGTCCGACCTTGTCGCTGTTCTCGTCGAGCTTCAGAGGCAG
AATCAGATCTTTCTGTGCATGAAGGGGAAGAGTAGTGATGCAAAGATCATCAAGCCTCAATGGAATGAGTTGTATAACGTGGTTCGTAAAGAAGTATGGTACCGGCCCGA
CATGTTCTTTTATAGAGATATGCTTATGATGCTTGCAAAGAACAAAAGGGTGGAAGAAACAAAACAAGTTTGGCAGGATCTGAAGAAAGAGGGAGTATTATTTGATCAGC
ATACTTTTGGAGACATTATTCGGGCATACTCAGATAATGCAATGCCCTCTGAGGCCATGGATATATACCGTGAAATGAGAGAATCTCCTGATAGGCCATTATCTTTGCCT
TTTCGTGTAATTTTGAAGGGACTTATTCCATACCCAGAATTGAGAGAACAAGTTAAAGATGACTTCTTGGAACTCTTCCCTGGTATGATCGTCTATGACCCACCAGAAGA
CTTGTTTGAAGAAGATGAAAATAGGAGGAAGAGTGAAGATGATTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAAAAATCCTCCTGGGTTTCATTTCAACAAAGAGCAAAAACTCTGGTGCTGTATTCGCTTCAGGTTCCTCAAGAATTTTGATGCTTCGTCTTGCTCCGAA
TCTTTTTCGCAGAATCTCAAACAGGGCCAATTCCACAACCCATTTTCATCGCTTTTCCCTTCCCACATTCCTCAACCACGATGTGTTACAGCAACAATTTCTGTTACGCT
TCATCACTGGCTCTGCTTCCAGCCCTAGCCTCTCAATATGGAGGAGGAAGAAGGAGATGGGCAAGGAGGGTCTGATTGTGGTCAAAGAGCTCAAGAGGCTTCAGTCCAAT
TTCATTCGCCTCGACCGCTTCATTTCCACCCATGTCTCTCGCTTGCTCAAGTCCGACCTTGTCGCTGTTCTCGTCGAGCTTCAGAGGCAGAATCAGATCTTTCTGTGCAT
GAAGGGGAAGAGTAGTGATGCAAAGATCATCAAGCCTCAATGGAATGAGTTGTATAACGTGGTTCGTAAAGAAGTATGGTACCGGCCCGACATGTTCTTTTATAGAGATA
TGCTTATGATGCTTGCAAAGAACAAAAGGGTGGAAGAAACAAAACAAGTTTGGCAGGATCTGAAGAAAGAGGGAGTATTATTTGATCAGCATACTTTTGGAGACATTATT
CGGGCATACTCAGATAATGCAATGCCCTCTGAGGCCATGGATATATACCGTGAAATGAGAGAATCTCCTGATAGGCCATTATCTTTGCCTTTTCGTGTAATTTTGAAGGG
ACTTATTCCATACCCAGAATTGAGAGAACAAGTTAAAGATGACTTCTTGGAACTCTTCCCTGGTATGATCGTCTATGACCCACCAGAAGACTTGTTTGAAGAAGATGAAA
ATAGGAGGAAGAGTGAAGATGATTAACTAATTAAGGATTCTGCTCTTGCAGTCCATTTTTGATAGTTTTTTTTATTGGCAAAATTAAAAACTGTCCAGTTTGATGCGTCC
AACAAGGCTCTAAACCTTAAAAAAAAAAGTCTATGAGGTCTTAAATTTTCCATTTTATGTCTAATTGGTTGATGAACTTTAAAATTGTCTAATAAATTTTTAAACTTTCA
ATTTTGTACCGAACAATTAAACATATTTTAAAATTCACGAAAGTATTAAGCACAAAGTTGAAAGGTAAATTCAGTTTTGTGTTTAATAAACATGTGAAAGACATTTTTAA
ACTTCAAAAATTTATTAGGCACAATTTAGAAAATTTAGGGACTAGACTTATAAGTTAACATTGGCCCGGTTGGGACCATTTAATTGCTTGTGGGCTTTGAAGATCCTTTT
GCAGACTGACGCAATCTCAAAGCATTCCTGGGATCCGTTTGCCATTCAGAAAATTGTTCAAATAAGATTTATTGAATTTATCAACTTTGGATTCAATTGTATATTTATTT
ATTCTTGAC
Protein sequenceShow/hide protein sequence
MLRLAPNLFRRISNRANSTTHFHRFSLPTFLNHDVLQQQFLLRFITGSASSPSLSIWRRKKEMGKEGLIVVKELKRLQSNFIRLDRFISTHVSRLLKSDLVAVLVELQRQ
NQIFLCMKGKSSDAKIIKPQWNELYNVVRKEVWYRPDMFFYRDMLMMLAKNKRVEETKQVWQDLKKEGVLFDQHTFGDIIRAYSDNAMPSEAMDIYREMRESPDRPLSLP
FRVILKGLIPYPELREQVKDDFLELFPGMIVYDPPEDLFEEDENRRKSEDD