; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg01023 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg01023
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr06:1602329..1604968
RNA-Seq ExpressionCarg01023
SyntenyCarg01023
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580575.1 ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. sororia]5.0e-8540.77Show/hide
Query:  LQAIRRNDGMNYGAYGRFIQHVPASS--------------SSASATASRSSCSMFCRSREFPRIEGHRLLIKIWQ----PWRCLQCVRYTFHNMQSDMLK
        LQ IRR+DGMNYGAYGR IQH                   SS +      S  +   S+     + + +   I       W  L  + YT HNM +DMLK
Subjt:  LQAIRRNDGMNYGAYGRFIQHVPASS--------------SSASATASRSSCSMFCRSREFPRIEGHRLLIKIWQ----PWRCLQCVRYTFHNMQSDMLK

Query:  LFLSLFNLISMDAKPDKF------KFIVSFFDEGLSLSFLCLCSDTRGVIS------------------WLAKIVFDRMPARDIVF--------------
        LF SL NL S D KPDKF      K + S F   +    +      RG+ S                   LA+I+FDR P RDIV               
Subjt:  LFLSLFNLISMDAKPDKF------KFIVSFFDEGLSLSFLCLCSDTRGVIS------------------WLAKIVFDRMPARDIVF--------------

Query:  -ECDKCLRRMR----------------------------------------------------------------------------SHCAMISDYMT--
         +C +  + M                                                                             ++ +MIS YM   
Subjt:  -ECDKCLRRMR----------------------------------------------------------------------------SHCAMISDYMT--

Query:  ------EYYDTCERS-----------------------------------------------SHLLTFCNTKD------GNGYDGNVCVVTAIIDSYAKS
              + +   ER                                                SH  T    K+       N YDGN+ V TAIIDSYAKS
Subjt:  ------EYYDTCERS-----------------------------------------------SHLLTFCNTKD------GNGYDGNVCVVTAIIDSYAKS

Query:  GYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLS
        GYLH AR V DQ K RSLIIWTAIISAYAA+G AN  LS FYE+LTNGIRPD V FT V  +CAHSGELDEAWKIFNVLLPE+GIQPL+EHYACMVGVLS
Subjt:  GYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLS

Query:  LAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
         A K SDAV+FISKMPIEPT KVW ALLNGASVAGDVELGK VFD LLD EPENTGN IIM NLYS FGRWK++
Subjt:  LAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

KAG6596437.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.4e-10373.38Show/hide
Query:  LSLSFLCLCSDTRGVISWLAKIVFDRMPARDIVFECDKC---LRRMRSHCAMISD-----YMTEYYDTCERSSHLLTFCNTKDGNGYDGNVCVVTAIIDS
        L +  +C C    G + ++ ++ F+ MP +D V   D      + + S C  I         TEYYDTCERSSHLLTFCNTKDG                
Subjt:  LSLSFLCLCSDTRGVISWLAKIVFDRMPARDIVFECDKC---LRRMRSHCAMISD-----YMTEYYDTCERSSHLLTFCNTKDGNGYDGNVCVVTAIIDS

Query:  YAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVG
                 ARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVG
Subjt:  YAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVG

Query:  VLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKSG
        VLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKSG
Subjt:  VLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKSG

KAG7027978.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]4.8e-237100Show/hide
Query:  MTNAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHVPASSSSASATASRSSCSMFCRSREFPRIEGHRLLIKIWQPWRCLQCVRYTFHNMQS
        MTNAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHVPASSSSASATASRSSCSMFCRSREFPRIEGHRLLIKIWQPWRCLQCVRYTFHNMQS
Subjt:  MTNAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHVPASSSSASATASRSSCSMFCRSREFPRIEGHRLLIKIWQPWRCLQCVRYTFHNMQS

Query:  DMLKLFLSLFNLISMDAKPDKFKFIVSFFDEGLSLSFLCLCSDTRGVISWLAKIVFDRMPARDIVFECDKCLRRMRSHCAMISDYMTEYYDTCERSSHLL
        DMLKLFLSLFNLISMDAKPDKFKFIVSFFDEGLSLSFLCLCSDTRGVISWLAKIVFDRMPARDIVFECDKCLRRMRSHCAMISDYMTEYYDTCERSSHLL
Subjt:  DMLKLFLSLFNLISMDAKPDKFKFIVSFFDEGLSLSFLCLCSDTRGVISWLAKIVFDRMPARDIVFECDKCLRRMRSHCAMISDYMTEYYDTCERSSHLL

Query:  TFCNTKDGNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAW
        TFCNTKDGNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAW
Subjt:  TFCNTKDGNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAW

Query:  KIFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
        KIFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
Subjt:  KIFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK

Query:  SG
        SG
Subjt:  SG

XP_022145703.1 pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia]2.2e-8840.65Show/hide
Query:  QISVPASALYPWVLQAIRRNDGMNYGAYGRFIQH----------------VPASSSSASATASRSSCSMFCRS---REFPRIEGHRLLIKIWQPWRCLQC
        QIS+PA A+ PW LQAIRR DGMNY AYGR IQH                +   S +          + + +S   R+   + G+     I+  W  L  
Subjt:  QISVPASALYPWVLQAIRRNDGMNYGAYGRFIQH----------------VPASSSSASATASRSSCSMFCRS---REFPRIEGHRLLIKIWQPWRCLQC

Query:  VRYTFHNMQSDMLKLFLSLFNLISMDAKPDKFKFIV-------SFFDEGLSLSFLC------LCSD---TRGVISW--------LAKIVFDRMPARDIVF
        + YT HNM SDMLKLF SL N  +MD KPDKF           SF D  L+    C      L SD      ++++        LA+IVF RMP RDIV 
Subjt:  VRYTFHNMQSDMLKLFLSLFNLISMDAKPDKFKFIV-------SFFDEGLSLSFLC------LCSD---TRGVISW--------LAKIVFDRMPARDIVF

Query:  ---------------ECDKCLRRMRSHCAMISDYMT--EYYDTCERSSHLL------------------TFCNT--------------------------
                       EC +  + M S   +  + +T       C +S+ L+                  + CN                           
Subjt:  ---------------ECDKCLRRMRSHCAMISDYMT--EYYDTCERSSHLL------------------TFCNT--------------------------

Query:  ----------------------------------------------KDG---------------------------------------------NGYDGN
                                                      +DG                                             NGY+GN
Subjt:  ----------------------------------------------KDG---------------------------------------------NGYDGN

Query:  VCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQ
        + V TAIIDSYAKSGYLH A  V D  KGRSLIIWTAIISAYAA+G ANVALS FYE+L NGI+PD V FT V  +CAHSGELDEAWKIFN++LPEYGIQ
Subjt:  VCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQ

Query:  PLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        PL+EHYACMVGVLS A K SDAVDFISKMPIEP+ KVW ALLNGASVAGDVELGK VFD LL+ EPENTG  IIM NLYS  GRWK++
Subjt:  PLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

XP_023539538.1 pentatricopeptide repeat-containing protein At2g37310-like [Cucurbita pepo subsp. pepo]1.8e-8751.84Show/hide
Query:  MTNAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHVPASSSSASATASRSSCSMFCRSREFPRIEGHRLLIKIWQPWRCLQCVRYTFHNMQS
        MTNAKPTSLQISVPASALYPW LQAIRR+DGMNYGAYGRFIQ  P SSSSASATASRSSCSMFCRSREFPRIE     I  W      +C +  F+    
Subjt:  MTNAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHVPASSSSASATASRSSCSMFCRSREFPRIEGHRLLIKIWQPWRCLQCVRYTFHNMQS

Query:  DMLKLFLSLFNLISMDAKPDKFKFIVSFFDEGLSLSFLCLCSDTRGVISWLAKIVFDRMPARDIVFECDKCLRRMRSHCAMISDYMT---------EYYD
        ++ K  LSL  L     +P+    ++        L++L L  +  G + ++ ++ F+ MP +D V           ++ AMISDYM           Y  
Subjt:  DMLKLFLSLFNLISMDAKPDKFKFIVSFFDEGLSLSFLCLCSDTRGVISWLAKIVFDRMPARDIVFECDKCLRRMRSHCAMISDYMT---------EYYD

Query:  T-----CERSSHLLT--------FCNTKDG---------NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFF
        +     C  ++  L         F   KDG         N YDGNVCVVTAIIDSYAKSGY HRARLVCDQFK                           
Subjt:  T-----CERSSHLLT--------FCNTKDG---------NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFF

Query:  YEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKC
                                GELDEAWKIFNVLLPEYGIQPL+EHYACMVGVLS AEK SDAVDFISKMPIEPTTKVW ALLNGASVAGDVELGKC
Subjt:  YEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKC

Query:  VFDSLLD
        VFDSLL+
Subjt:  VFDSLLD

TrEMBL top hitse value%identityAlignment
A0A0A0LFN1 Uncharacterized protein5.8e-7978.12Show/hide
Query:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL
        N YD N+ V TAIIDSYAK GYLH A+LV DQ KGRSLI WT+IISAYA +G ANVALS FYE+LTNGI+PD V FT V  +CAHSGELDEAWKIFNVLL
Subjt:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL

Query:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK
        PEYGIQPL+EHYACMVGVLS A K SDAV+FISKMP+EPT KVW ALLNGASVAGDVELGK VFD L + EPENTGN +IM NLYS  GRWK
Subjt:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWK

A0A5A7TRM4 Pentatricopeptide repeat-containing protein1.1e-7739.22Show/hide
Query:  MNYGAYGRFIQH----------------VPASSSSASATASRSSCSMFCRS---REFPRIEGHRLLIKIWQPWRCLQCVRYTFHNMQSDMLKLFLSLFNL
        MNYGAYGR IQH                +  SS +          S + +S   R+   + G ++  K    W  L  + YT HNM +D+LKLFLSL N 
Subjt:  MNYGAYGRFIQH----------------VPASSSSASATASRSSCSMFCRS---REFPRIEGHRLLIKIWQPWRCLQCVRYTFHNMQSDMLKLFLSLFNL

Query:  ISMDAKPDKF------KFIVSFFDEG-LSLSFLC------LCSD---TRGVISW--------LAKIVFDRMPARDIVF---------------ECDKCLR
         S D KPD+F      K + S F    L+    C      L SD      +I++        LA+I+FDRMP RDIV                +C +  R
Subjt:  ISMDAKPDKF------KFIVSFFDEG-LSLSFLC------LCSD---TRGVISW--------LAKIVFDRMPARDIVF---------------ECDKCLR

Query:  RMRS----------------------------------------------------------------------------HCAMISDYMT--------EY
         M S                                                                            +C+MIS YM         + 
Subjt:  RMRS----------------------------------------------------------------------------HCAMISDYMT--------EY

Query:  YDTCERS-----------------------------------------------SHLLTFCNTKDGNG------YDGNVCVVTAIIDSYAKSGYLHRARL
        +   ER                                                SH  T    K+ +G      YDGN+ V TAIIDSYAK GYL  AR 
Subjt:  YDTCERS-----------------------------------------------SHLLTFCNTKDGNG------YDGNVCVVTAIIDSYAKSGYLHRARL

Query:  VCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDA
        V DQ KGRSLI WT+IISAYA +G ANVALS FYE+LT GI+PD V FT V  +CAHSGELDEAWKIFN+LLP+YGIQPL+EHYACMVGVLS A K SDA
Subjt:  VCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDA

Query:  VDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        V+FISKMP+EP  KVW ALLNGASVAGDVELGK VFD L + EP NTGN +IM NLYS  GRWK++
Subjt:  VDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

A0A6J1CWN9 pentatricopeptide repeat-containing protein At2g373101.1e-8840.65Show/hide
Query:  QISVPASALYPWVLQAIRRNDGMNYGAYGRFIQH----------------VPASSSSASATASRSSCSMFCRS---REFPRIEGHRLLIKIWQPWRCLQC
        QIS+PA A+ PW LQAIRR DGMNY AYGR IQH                +   S +          + + +S   R+   + G+     I+  W  L  
Subjt:  QISVPASALYPWVLQAIRRNDGMNYGAYGRFIQH----------------VPASSSSASATASRSSCSMFCRS---REFPRIEGHRLLIKIWQPWRCLQC

Query:  VRYTFHNMQSDMLKLFLSLFNLISMDAKPDKFKFIV-------SFFDEGLSLSFLC------LCSD---TRGVISW--------LAKIVFDRMPARDIVF
        + YT HNM SDMLKLF SL N  +MD KPDKF           SF D  L+    C      L SD      ++++        LA+IVF RMP RDIV 
Subjt:  VRYTFHNMQSDMLKLFLSLFNLISMDAKPDKFKFIV-------SFFDEGLSLSFLC------LCSD---TRGVISW--------LAKIVFDRMPARDIVF

Query:  ---------------ECDKCLRRMRSHCAMISDYMT--EYYDTCERSSHLL------------------TFCNT--------------------------
                       EC +  + M S   +  + +T       C +S+ L+                  + CN                           
Subjt:  ---------------ECDKCLRRMRSHCAMISDYMT--EYYDTCERSSHLL------------------TFCNT--------------------------

Query:  ----------------------------------------------KDG---------------------------------------------NGYDGN
                                                      +DG                                             NGY+GN
Subjt:  ----------------------------------------------KDG---------------------------------------------NGYDGN

Query:  VCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQ
        + V TAIIDSYAKSGYLH A  V D  KGRSLIIWTAIISAYAA+G ANVALS FYE+L NGI+PD V FT V  +CAHSGELDEAWKIFN++LPEYGIQ
Subjt:  VCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQ

Query:  PLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        PL+EHYACMVGVLS A K SDAVDFISKMPIEP+ KVW ALLNGASVAGDVELGK VFD LL+ EPENTG  IIM NLYS  GRWK++
Subjt:  PLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

A0A6J1F110 pentatricopeptide repeat-containing protein At2g373102.8e-8180.41Show/hide
Query:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL
        N YDGN+ V TAIIDSYAKSGYL  AR V DQ K RSLIIWTAIISAYAA+G AN  LS FYE+LTNGIRPD V FT V  +CAHSGELDEAWKIFNVLL
Subjt:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL

Query:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        PE+GIQPL+EHYACMVGVLS A K SDAV+FISKMPIEPT KVW ALLNGASVAGDVELGK VFD LLD EPENTGN IIM NLYS FGRWK++
Subjt:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

A0A6J1J0S5 pentatricopeptide repeat-containing protein At2g373101.5e-7979.38Show/hide
Query:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL
        N YDGN+ V TAIIDSYAKSGYL  AR V DQ K RSLIIWTAIISAYAA+G AN  LS FYE+LTNGIRPD V FT V  +CAHSGEL+EAWKIFNVLL
Subjt:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLL

Query:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        PE+GIQPL+EHYACMVGVLS A K SDAV+FISKMPIEPT KVW ALLNGASVAGDVELGK VFD LLD EPENTGN IIM NLYS FG WK++
Subjt:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

SwissProt top hitse value%identityAlignment
P0C899 Putative pentatricopeptide repeat-containing protein At3g491421.7e-3538.83Show/hide
Query:  NVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAF-TPVCSCAHSGELDEAWKIFNVLLPEYGI
        N+ +  A+ID YAK G L +AR V +  K R ++ WTA+ISAY  +G    A++ F ++  +G+ PDS+AF T + +C+H+G L+E    F ++   Y I
Subjt:  NVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAF-TPVCSCAHSGELDEAWKIFNVLLPEYGI

Query:  QPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
         P +EH ACMV +L  A K  +A  FI  M +EP  +VW ALL    V  D ++G    D L    PE +G  +++ N+Y+  GRW++
Subjt:  QPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563102.2e-3537.5Show/hide
Query:  CNTKDGNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCS-CAHSGELDEAWK
        C+  D  G +  V +  A+ID YAKSG + +A  V +    R+++ WT II+  A +G    AL+ F  ++  G+RP+ V F  + S C+H G +D   +
Subjt:  CNTKDGNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCS-CAHSGELDEAWK

Query:  IFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        +FN +  +YGI P IEHY CM+ +L  A K  +A + I  MP +    +W +LL  ++V  D+ELG+     L+  EP N+GN +++ NLYS  GRW +S
Subjt:  IFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202303.4e-3636.56Show/hide
Query:  NVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCS-CAHSGELDEAWKIFNVLLPEYGI
        NV V +A+ID YAK G ++ +++V +    ++L+ W ++++ ++ +G A   +S F  ++   ++PD ++FT + S C   G  DE WK F ++  EYGI
Subjt:  NVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCS-CAHSGELDEAWKIFNVLLPEYGI

Query:  QPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRW
        +P +EHY+CMV +L  A K  +A D I +MP EP + VW ALLN   +  +V+L +   + L   EPEN G  +++ N+Y+  G W
Subjt:  QPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233302.6e-3638.02Show/hide
Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
        G+  N+ + +A++D Y+K G +  AR + D+      + WTAII  +A +G  + A+S F E+   G++P+ VAF  V  +C+H G +DEAW  FN +  
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
         YG+   +EHYA +  +L  A K  +A +FISKM +EPT  VW  LL+  SV  ++EL + V + +   + EN G  ++M N+Y+  GRWK+
Subjt:  EYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK

Q9ZUT5 Pentatricopeptide repeat-containing protein At2g373104.0e-5355.15Show/hide
Query:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSC-AHSGELDEAWKIFNVLL
        NG D N+ V T+IID+YAK G+L  A+ V D  K RSLI WTAII+AYA +G ++ A S F ++   G +PD V  T V S  AHSG+ D A  IF+ +L
Subjt:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSC-AHSGELDEAWKIFNVLL

Query:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
         +Y I+P +EHYACMV VLS A K SDA++FISKMPI+P  KVW ALLNGASV GD+E+ +   D L + EPENTGN  IM NLY+  GRW+++
Subjt:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

Arabidopsis top hitse value%identityAlignment
AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-3736.56Show/hide
Query:  NVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCS-CAHSGELDEAWKIFNVLLPEYGI
        NV V +A+ID YAK G ++ +++V +    ++L+ W ++++ ++ +G A   +S F  ++   ++PD ++FT + S C   G  DE WK F ++  EYGI
Subjt:  NVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCS-CAHSGELDEAWKIFNVLLPEYGI

Query:  QPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRW
        +P +EHY+CMV +L  A K  +A D I +MP EP + VW ALLN   +  +V+L +   + L   EPEN G  +++ N+Y+  G W
Subjt:  QPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRW

AT2G37310.1 Pentatricopeptide repeat (PPR) superfamily protein2.9e-5455.15Show/hide
Query:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSC-AHSGELDEAWKIFNVLL
        NG D N+ V T+IID+YAK G+L  A+ V D  K RSLI WTAII+AYA +G ++ A S F ++   G +PD V  T V S  AHSG+ D A  IF+ +L
Subjt:  NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSC-AHSGELDEAWKIFNVLL

Query:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
         +Y I+P +EHYACMV VLS A K SDA++FISKMPI+P  KVW ALLNGASV GD+E+ +   D L + EPENTGN  IM NLY+  GRW+++
Subjt:  PEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-3738.02Show/hide
Query:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP
        G+  N+ + +A++D Y+K G +  AR + D+      + WTAII  +A +G  + A+S F E+   G++P+ VAF  V  +C+H G +DEAW  FN +  
Subjt:  GYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLP

Query:  EYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
         YG+   +EHYA +  +L  A K  +A +FISKM +EPT  VW  LL+  SV  ++EL + V + +   + EN G  ++M N+Y+  GRWK+
Subjt:  EYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-3638.83Show/hide
Query:  NVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAF-TPVCSCAHSGELDEAWKIFNVLLPEYGI
        N+ +  A+ID YAK G L +AR V +  K R ++ WTA+ISAY  +G    A++ F ++  +G+ PDS+AF T + +C+H+G L+E    F ++   Y I
Subjt:  NVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAF-TPVCSCAHSGELDEAWKIFNVLLPEYGI

Query:  QPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK
         P +EH ACMV +L  A K  +A  FI  M +EP  +VW ALL    V  D ++G    D L    PE +G  +++ N+Y+  GRW++
Subjt:  QPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKK

AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-3637.5Show/hide
Query:  CNTKDGNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCS-CAHSGELDEAWK
        C+  D  G +  V +  A+ID YAKSG + +A  V +    R+++ WT II+  A +G    AL+ F  ++  G+RP+ V F  + S C+H G +D   +
Subjt:  CNTKDGNGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCS-CAHSGELDEAWK

Query:  IFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS
        +FN +  +YGI P IEHY CM+ +L  A K  +A + I  MP +    +W +LL  ++V  D+ELG+     L+  EP N+GN +++ NLYS  GRW +S
Subjt:  IFNVLLPEYGIQPLIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGAATGCGAAGCCCACTAGCCTTCAAATCTCAGTTCCCGCCAGCGCCCTTTATCCATGGGTTCTGCAAGCAATCCGCCGCAACGACGGGATGAACTACGGCGCCTA
TGGCCGCTTCATCCAGCACGTACCGGCTTCCTCTTCGTCCGCCTCGGCAACTGCTTCGCGCTCGTCTTGTTCTATGTTCTGTCGCTCCCGAGAATTTCCTCGGATCGAAG
GTCATCGCCTTCTAATCAAAATCTGGCAGCCTTGGAGATGCCTACAATGTGTTCGTTACACTTTTCACAATATGCAATCTGATATGCTGAAGCTGTTTTTATCTTTGTTT
AATTTGATATCGATGGATGCGAAGCCTGATAAGTTTAAGTTCATTGTTTCATTCTTCGACGAGGGCTTGAGTCTCAGCTTTTTGTGTCTATGCTCTGATACTCGAGGTGT
AATTAGCTGGTTAGCGAAAATTGTGTTTGATAGAATGCCTGCGAGAGATATTGTCTTCGAATGCGATAAATGCCTGAGAAGGATGAGGTCACATTGCGCGATGATATCGG
ACTACATGACCGAATACTATGACACTTGCGAGCGTTCTTCCCACCTTCTCACATTTTGCAACACCAAGGATGGAAACGGTTACGATGGGAATGTTTGTGTTGTCACTGCT
ATCATTGATTCTTATGCTAAGTCTGGTTACCTCCACAGGGCACGACTGGTTTGTGATCAATTTAAAGGTAGGAGTCTAATCATTTGGACAGCAATAATTTCAGCCTATGC
TGCCAATGGAGGTGCTAACGTGGCTCTGAGTTTTTTTTATGAGATTCTGACAAATGGGATTCGGCCTGATTCAGTAGCCTTTACTCCAGTGTGTTCCTGTGCCCATTCAG
GGGAGTTAGATGAAGCCTGGAAGATATTTAACGTCTTGTTACCAGAGTATGGAATTCAACCACTAATCGAGCACTATGCTTGCATGGTAGGAGTTCTTAGTCTAGCAGAA
AAGTTCTCCGATGCTGTTGACTTTATTTCTAAAATGCCAATTGAACCCACTACAAAAGTTTGGGATGCTTTGCTCAATGGGGCTTCCGTTGCTGGTGATGTTGAGCTTGG
AAAGTGCGTTTTTGATAGTCTGCTTGACACCGAGCCTGAAAATACAGGTAACAATATCATCATGGATAACTTATATTCACTGTTTGGAAGGTGGAAAAAATCTGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGAATGCGAAGCCCACTAGCCTTCAAATCTCAGTTCCCGCCAGCGCCCTTTATCCATGGGTTCTGCAAGCAATCCGCCGCAACGACGGGATGAACTACGGCGCCTA
TGGCCGCTTCATCCAGCACGTACCGGCTTCCTCTTCGTCCGCCTCGGCAACTGCTTCGCGCTCGTCTTGTTCTATGTTCTGTCGCTCCCGAGAATTTCCTCGGATCGAAG
GTCATCGCCTTCTAATCAAAATCTGGCAGCCTTGGAGATGCCTACAATGTGTTCGTTACACTTTTCACAATATGCAATCTGATATGCTGAAGCTGTTTTTATCTTTGTTT
AATTTGATATCGATGGATGCGAAGCCTGATAAGTTTAAGTTCATTGTTTCATTCTTCGACGAGGGCTTGAGTCTCAGCTTTTTGTGTCTATGCTCTGATACTCGAGGTGT
AATTAGCTGGTTAGCGAAAATTGTGTTTGATAGAATGCCTGCGAGAGATATTGTCTTCGAATGCGATAAATGCCTGAGAAGGATGAGGTCACATTGCGCGATGATATCGG
ACTACATGACCGAATACTATGACACTTGCGAGCGTTCTTCCCACCTTCTCACATTTTGCAACACCAAGGATGGAAACGGTTACGATGGGAATGTTTGTGTTGTCACTGCT
ATCATTGATTCTTATGCTAAGTCTGGTTACCTCCACAGGGCACGACTGGTTTGTGATCAATTTAAAGGTAGGAGTCTAATCATTTGGACAGCAATAATTTCAGCCTATGC
TGCCAATGGAGGTGCTAACGTGGCTCTGAGTTTTTTTTATGAGATTCTGACAAATGGGATTCGGCCTGATTCAGTAGCCTTTACTCCAGTGTGTTCCTGTGCCCATTCAG
GGGAGTTAGATGAAGCCTGGAAGATATTTAACGTCTTGTTACCAGAGTATGGAATTCAACCACTAATCGAGCACTATGCTTGCATGGTAGGAGTTCTTAGTCTAGCAGAA
AAGTTCTCCGATGCTGTTGACTTTATTTCTAAAATGCCAATTGAACCCACTACAAAAGTTTGGGATGCTTTGCTCAATGGGGCTTCCGTTGCTGGTGATGTTGAGCTTGG
AAAGTGCGTTTTTGATAGTCTGCTTGACACCGAGCCTGAAAATACAGGTAACAATATCATCATGGATAACTTATATTCACTGTTTGGAAGGTGGAAAAAATCTGGCTAG
Protein sequenceShow/hide protein sequence
MTNAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHVPASSSSASATASRSSCSMFCRSREFPRIEGHRLLIKIWQPWRCLQCVRYTFHNMQSDMLKLFLSLF
NLISMDAKPDKFKFIVSFFDEGLSLSFLCLCSDTRGVISWLAKIVFDRMPARDIVFECDKCLRRMRSHCAMISDYMTEYYDTCERSSHLLTFCNTKDGNGYDGNVCVVTA
IIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIISAYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLSLAE
KFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKSG