; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009697 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009697
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr9:41523838..41524251
RNA-Seq ExpressionLag0009697
SyntenyLag0009697
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604761.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]8.1e-3570.91Show/hide
Query:  VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD
        +DIISWNIL+E+CAKT +Y KV+A+FK ML+LQIYPDNYTF+SLL VC KLCNLALGSSVHG+ +K+GS C D FVCNLLI MYGKCGSI CALK+   D
Subjt:  VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD

Query:  SSNLRPWITW
            R  ITW
Subjt:  SSNLRPWITW

XP_008456417.1 PREDICTED: pentatricopeptide repeat-containing protein At3g58590 [Cucumis melo]1.1e-3470.64Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D+ISWNILIEACAK ++Y KV+ +FK ML+ QIYPDNYTF SLL VC KLCNLALGSS+HG+ +K+GSG CD FVCNLLIDMYGKCGSI CALK+   D 
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
           R  ITW
Subjt:  SNLRPWITW

XP_022947134.1 pentatricopeptide repeat-containing protein At3g58590 [Cucurbita moschata]8.1e-3570.91Show/hide
Query:  VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD
        +DIISWNIL+E+CAKT +Y KV+A+FK ML+LQIYPDNYTF+SLL VC KLCNLALGSSVHG+ +K+GS C D FVCNLLI MYGKCGSI CALK+   D
Subjt:  VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD

Query:  SSNLRPWITW
            R  ITW
Subjt:  SSNLRPWITW

XP_023532810.1 pentatricopeptide repeat-containing protein At3g58590 [Cucurbita pepo subsp. pepo]8.1e-3570.91Show/hide
Query:  VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD
        +DIISWNIL+E+CAKT +Y KV+A+FK ML+LQIYPDNYTF+SLL VC KLCNLALGSSVHG+ +K+GS C D FVCNLLI MYGKCGSI CALK+   D
Subjt:  VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD

Query:  SSNLRPWITW
            R  ITW
Subjt:  SSNLRPWITW

XP_038902940.1 pentatricopeptide repeat-containing protein At3g58590 [Benincasa hispida]5.6e-3672.48Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D+ISWNILIEACAK D+Y KV+ +FK ML+LQIYPDNYTF+SLL VC KL NLALGSSVHG+ +K+G GCCD FVCNLLIDMYGKCGSI CALK+   D 
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
           R  ITW
Subjt:  SNLRPWITW

TrEMBL top hitse value%identityAlignment
A0A1S3C2S6 pentatricopeptide repeat-containing protein At3g585905.1e-3570.64Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D+ISWNILIEACAK ++Y KV+ +FK ML+ QIYPDNYTF SLL VC KLCNLALGSS+HG+ +K+GSG CD FVCNLLIDMYGKCGSI CALK+   D 
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
           R  ITW
Subjt:  SNLRPWITW

A0A5A7UHJ9 Pentatricopeptide repeat-containing protein5.1e-3570.64Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D+ISWNILIEACAK ++Y KV+ +FK ML+ QIYPDNYTF SLL VC KLCNLALGSS+HG+ +K+GSG CD FVCNLLIDMYGKCGSI CALK+   D 
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
           R  ITW
Subjt:  SNLRPWITW

A0A6J1C0G3 pentatricopeptide repeat-containing protein At3g58590-like8.7e-3569.72Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        DI+SWNILIEACAKT +Y+K + +FK MLMLQIYPDNYTF+SLL VC KLCNLALGSSVHG+ +K+   C D F+CNLLIDMYGKCGSI CALK+   D 
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
           R  ITW
Subjt:  SNLRPWITW

A0A6J1G5W7 pentatricopeptide repeat-containing protein At3g585903.9e-3570.91Show/hide
Query:  VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD
        +DIISWNIL+E+CAKT +Y KV+A+FK ML+LQIYPDNYTF+SLL VC KLCNLALGSSVHG+ +K+GS C D FVCNLLI MYGKCGSI CALK+   D
Subjt:  VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD

Query:  SSNLRPWITW
            R  ITW
Subjt:  SSNLRPWITW

A0A6J1I2K3 pentatricopeptide repeat-containing protein At3g585902.5e-3471.3Show/hide
Query:  IISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDSS
        IISWNIL+E+CAKT +Y KV+A+FK ML+LQIYPDNYTF+SLL VC KLCNLALGSSVHG+ +K+GS C D FVCNLLI MYGKCGSI CALK+   D  
Subjt:  IISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDSS

Query:  NLRPWITW
          R  ITW
Subjt:  NLRPWITW

SwissProt top hitse value%identityAlignment
Q0WN01 Pentatricopeptide repeat-containing protein At3g585902.4e-2652.29Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D +SWNI I AC+++D + +V+ +FK ML   I PD YTFVS+L +C+KLC+L LGSS+HGL  K+   C D FVCN+LIDMYGKCGSI   +K+   + 
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
        +  +  ITW
Subjt:  SNLRPWITW

Q9LFI1 Pentatricopeptide repeat-containing protein At3g53360, mitochondrial1.1e-1334.44Show/hide
Query:  SYLVSW--LTSLILTNSLLS----CS---------------VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSS
        SY++ W  L  L + NSLL+    CS                D +SWN ++ AC + +  ++++ +FK ML+ +  PD+ T  +LLR C ++ +L LGS 
Subjt:  SYLVSW--LTSLILTNSLLS----CS---------------VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSS

Query:  VHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKL-NHMDSSNLRPWIT
        VH  ++K+G    + F+ N LIDMY KCGS+  A ++ + MD+ ++  W T
Subjt:  VHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKL-NHMDSSNLRPWIT

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233303.2e-1032.11Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D ISWN L+    +   Y + + +F+ M+  ++ P    F S++  C  L  L LG  +HG  ++ G G  + F+ + L+DMY KCG+I  A K+   D 
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
         N+   ++W
Subjt:  SNLRPWITW

Q9SII7 Pentatricopeptide repeat-containing protein At2g172109.3e-1032.73Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFML-MLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD
        D+ISW+++I +  ++ + +  + +FK M+   +  PD  T  S+L+ CT + ++ +G SVHG +++ G    D FVCN LIDMY K   +  A ++   D
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFML-MLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD

Query:  SSNLRPWITW
         +  R  ++W
Subjt:  SSNLRPWITW

Q9SS60 Pentatricopeptide repeat-containing protein At3g035803.4e-1234.86Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D +SWN +I    ++ D ++ + +FK M++++   D+ T++ L+ V T+L +L  G  +H   +KSG  C D  V N LIDMY KCG +  +LK+    S
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
              +TW
Subjt:  SNLRPWITW

Arabidopsis top hitse value%identityAlignment
AT2G17210.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.6e-1132.73Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFML-MLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD
        D+ISW+++I +  ++ + +  + +FK M+   +  PD  T  S+L+ CT + ++ +G SVHG +++ G    D FVCN LIDMY K   +  A ++   D
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFML-MLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMD

Query:  SSNLRPWITW
         +  R  ++W
Subjt:  SSNLRPWITW

AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-1334.86Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D +SWN +I    ++ D ++ + +FK M++++   D+ T++ L+ V T+L +L  G  +H   +KSG  C D  V N LIDMY KCG +  +LK+    S
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
              +TW
Subjt:  SNLRPWITW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-1132.11Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D ISWN L+    +   Y + + +F+ M+  ++ P    F S++  C  L  L LG  +HG  ++ G G  + F+ + L+DMY KCG+I  A K+   D 
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
         N+   ++W
Subjt:  SNLRPWITW

AT3G53360.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.6e-1534.44Show/hide
Query:  SYLVSW--LTSLILTNSLLS----CS---------------VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSS
        SY++ W  L  L + NSLL+    CS                D +SWN ++ AC + +  ++++ +FK ML+ +  PD+ T  +LLR C ++ +L LGS 
Subjt:  SYLVSW--LTSLILTNSLLS----CS---------------VDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSS

Query:  VHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKL-NHMDSSNLRPWIT
        VH  ++K+G    + F+ N LIDMY KCGS+  A ++ + MD+ ++  W T
Subjt:  VHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKL-NHMDSSNLRPWIT

AT3G58590.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-2752.29Show/hide
Query:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS
        D +SWNI I AC+++D + +V+ +FK ML   I PD YTFVS+L +C+KLC+L LGSS+HGL  K+   C D FVCN+LIDMYGKCGSI   +K+   + 
Subjt:  DIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLLIDMYGKCGSIVCALKLNHMDS

Query:  SNLRPWITW
        +  +  ITW
Subjt:  SNLRPWITW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAACACGGTCTCATATCTGGTGTCTTGGCTTACGTCTCTGATTCTAACAAACAGCCTTCTGTCGTGCTCTGTAGACATTATATCTTGGAATATTTTGATT
GAAGCTTGTGCTAAAACGGATGATTATCTCAAAGTTGTAGCAGTTTTCAAATTCATGCTTATGCTCCAAATCTACCCAGATAACTATACATTCGTTTCCCTTCTA
AGAGTTTGCACTAAACTGTGCAATCTTGCTCTGGGCAGTTCAGTTCATGGCCTTGCGATGAAGTCTGGTTCAGGTTGTTGTGATGCATTTGTATGCAATTTGCTA
ATTGACATGTATGGAAAATGTGGAAGCATTGTATGTGCTTTGAAACTTAATCACATGGACAGTTCTAATCTCCGTCCTTGGATTACATGGCCATGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAAACACGGTCTCATATCTGGTGTCTTGGCTTACGTCTCTGATTCTAACAAACAGCCTTCTGTCGTGCTCTGTAGACATTATATCTTGGAATATTTTGATT
GAAGCTTGTGCTAAAACGGATGATTATCTCAAAGTTGTAGCAGTTTTCAAATTCATGCTTATGCTCCAAATCTACCCAGATAACTATACATTCGTTTCCCTTCTA
AGAGTTTGCACTAAACTGTGCAATCTTGCTCTGGGCAGTTCAGTTCATGGCCTTGCGATGAAGTCTGGTTCAGGTTGTTGTGATGCATTTGTATGCAATTTGCTA
ATTGACATGTATGGAAAATGTGGAAGCATTGTATGTGCTTTGAAACTTAATCACATGGACAGTTCTAATCTCCGTCCTTGGATTACATGGCCATGCTAG
Protein sequenceShow/hide protein sequence
MPNTVSYLVSWLTSLILTNSLLSCSVDIISWNILIEACAKTDDYLKVVAVFKFMLMLQIYPDNYTFVSLLRVCTKLCNLALGSSVHGLAMKSGSGCCDAFVCNLL
IDMYGKCGSIVCALKLNHMDSSNLRPWITWPC