; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033154 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033154
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr11:41267850..41269479
RNA-Seq ExpressionLag0033154
SyntenyLag0033154
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579064.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]5.8e-2071.79Show/hide
Query:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF
        G L+ GSSLH SIIKSF LS+H+NGV I NSLISMYERCGKL DAVKVFDEMPTRDTVSWNA  IG  +R    +  F
Subjt:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF

KAG7030242.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-2461.47Show/hide
Query:  IQPEPVQRNLFDIRPQPCGPQGFFYRFVEEKGLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLL
        I  +P+ RNL +IRPQP     F        G L+ GSSLH SIIKSF LS+H+NGV I NSLISMYERCGKL DAVKVFDEMPTRDTVSWNA  IG  +
Subjt:  IQPEPVQRNLFDIRPQPCGPQGFFYRFVEEKGLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLL

Query:  RFSSLFGLF
        R    +  F
Subjt:  RFSSLFGLF

XP_022999024.1 pentatricopeptide repeat-containing protein At3g05340 [Cucurbita maxima]5.8e-2071.79Show/hide
Query:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF
        G L+ GSSLH SIIKSF LS+H+NGV I NSLISMYERCGKL DAVKVFDEMPTRDTVSWNA  IG  +R    +  F
Subjt:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF

XP_023545246.1 pentatricopeptide repeat-containing protein At3g05340-like isoform X1 [Cucurbita pepo subsp. pepo]5.8e-2071.79Show/hide
Query:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF
        G L+ GSSLH SIIKSF LS+H+NGV I NSLISMYERCGKL DAVKVFDEMPTRDTVSWNA  IG  +R    +  F
Subjt:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF

XP_023546071.1 pentatricopeptide repeat-containing protein At3g05340 isoform X1 [Cucurbita pepo subsp. pepo]5.8e-2071.79Show/hide
Query:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF
        G L+ GSSLH SIIKSF LS+H+NGV I NSLISMYERCGKL DAVKVFDEMPTRDTVSWNA  IG  +R    +  F
Subjt:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF

TrEMBL top hitse value%identityAlignment
A0A6J1D9D8 pentatricopeptide repeat-containing protein At3g053408.1e-2062.07Show/hide
Query:  FYRFVEEKGLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLFF
        F      + LLH GSSLH SIIKSF L++H+NGV IANSLISMY+RCGKL DAVKVFDEM  RDTVSWNA   G +    S  G  +
Subjt:  FYRFVEEKGLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLFF

A0A6J1G350 pentatricopeptide repeat-containing protein At3g05340 isoform X33.6e-2078.57Show/hide
Query:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLR
        G L+ GSSLH SIIKSF LS+H+NGV I NSLISMYERCGKL DAVKVFDEMPTRDTVSWNA  IG  +R
Subjt:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLR

A0A6J1G3A2 pentatricopeptide repeat-containing protein At3g05340 isoform X13.6e-2078.57Show/hide
Query:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLR
        G L+ GSSLH SIIKSF LS+H+NGV I NSLISMYERCGKL DAVKVFDEMPTRDTVSWNA  IG  +R
Subjt:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLR

A0A6J1G3C0 pentatricopeptide repeat-containing protein At3g05340 isoform X23.6e-2078.57Show/hide
Query:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLR
        G L+ GSSLH SIIKSF LS+H+NGV I NSLISMYERCGKL DAVKVFDEMPTRDTVSWNA  IG  +R
Subjt:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLR

A0A6J1KFW9 pentatricopeptide repeat-containing protein At3g053402.8e-2071.79Show/hide
Query:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF
        G L+ GSSLH SIIKSF LS+H+NGV I NSLISMYERCGKL DAVKVFDEMPTRDTVSWNA  IG  +R    +  F
Subjt:  GLLHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLF

SwissProt top hitse value%identityAlignment
O49619 Pentatricopeptide repeat-containing protein At4g35130, chloroplastic3.4e-0745.33Show/hide
Query:  LHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLL---RFSSL
        L  G  +H  +IK   +S     V++ NSLIS+Y + G  +DA KVF+EMP RD VSWN+   G L     FSSL
Subjt:  LHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLL---RFSSL

Q9FGL1 Putative pentatricopeptide repeat-containing protein At5g474605.8e-0737.97Show/hide
Query:  GSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLFFLHSNP
        G+ +H  ++K   L      V + N LI MY +CG + DAV VF  M  +DTVSWNA             GL+F H  P
Subjt:  GSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLFFLHSNP

Q9FM64 Pentatricopeptide repeat-containing protein At5g55740, chloroplastic3.1e-0846.77Show/hide
Query:  FGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIG
        FG  +H  ++K    S  ++ VF+A+SL  MY +CG L DA KVFDE+P R+ V+WNA  +G
Subjt:  FGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIG

Q9MA85 Pentatricopeptide repeat-containing protein At3g053404.3e-1043.21Show/hide
Query:  CGPQGFFYRFVEEKGLLHFGSSLHVSIIKS------FALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWN
        CG +G+F          H G  LH SIIK+           H N + + NSL+S+Y +CGKL DA+K+FDEMP RD +S N
Subjt:  CGPQGFFYRFVEEKGLLHFGSSLHVSIIKS------FALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWN

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015101.5e-0748.33Show/hide
Query:  LHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNA
        L  G  LH  II+S  L +    VF  + L+ MY +CG + DAV+VF+EMP R+ VSWNA
Subjt:  LHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNA

Arabidopsis top hitse value%identityAlignment
AT3G02010.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-0848.33Show/hide
Query:  LHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNA
        L  G  LH  II+S  L +    VF  + L+ MY +CG + DAV+VF+EMP R+ VSWNA
Subjt:  LHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNA

AT3G05340.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-1143.21Show/hide
Query:  CGPQGFFYRFVEEKGLLHFGSSLHVSIIKS------FALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWN
        CG +G+F          H G  LH SIIK+           H N + + NSL+S+Y +CGKL DA+K+FDEMP RD +S N
Subjt:  CGPQGFFYRFVEEKGLLHFGSSLHVSIIKS------FALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWN

AT4G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-0845.33Show/hide
Query:  LHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLL---RFSSL
        L  G  +H  +IK   +S     V++ NSLIS+Y + G  +DA KVF+EMP RD VSWN+   G L     FSSL
Subjt:  LHFGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLL---RFSSL

AT5G47460.1 Pentatricopeptide repeat (PPR) superfamily protein4.1e-0837.97Show/hide
Query:  GSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLFFLHSNP
        G+ +H  ++K   L      V + N LI MY +CG + DAV VF  M  +DTVSWNA             GL+F H  P
Subjt:  GSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLFFLHSNP

AT5G55740.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-0946.77Show/hide
Query:  FGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIG
        FG  +H  ++K    S  ++ VF+A+SL  MY +CG L DA KVFDE+P R+ V+WNA  +G
Subjt:  FGSSLHVSIIKSFALSSHDNGVFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTTCTCCCTCATTCACCGTTCGTCTTCCGTTCGCTTCTTCTTCTTCTCCGTTCGTCATCCTCTTCGTCTTCAAATCCTTGCATTTCAAGCGACGGCTCTGGCGAC
TTCTTTAGGCAGCTCGACCATAGCCACTGGTCTCCTTCTCCTCTCCGACTTCATCCAACCGGAGCCAGTTCAACGAAACCTCTTCGACATTCGTCCTCAACCATGTGGAC
CCCAAGGTTTCTTCTATCGATTTGTGGAAGAGAAGGGGCTCCTCCATTTTGGCTCTTCCCTCCATGTCTCCATCATCAAGAGCTTCGCGCTCTCAAGCCATGATAATGGG
GTCTTCATAGCGAACTCCCTCATCTCCATGTACGAGAGGTGCGGTAAATTGTTTGATGCAGTCAAGGTGTTCGATGAAATGCCCACAAGAGATACTGTTTCGTGGAACGC
TTCCCCCATTGGCGATTTGCTCCGATTCTCATCTCTATTTGGACTTTTCTTTCTTCATTCAAACCCCTACGACTTGTCCACTTCTCATCCAGTGAAAGAAATACCAAATT
TCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGTTTCTCCCTCATTCACCGTTCGTCTTCCGTTCGCTTCTTCTTCTTCTCCGTTCGTCATCCTCTTCGTCTTCAAATCCTTGCATTTCAAGCGACGGCTCTGGCGAC
TTCTTTAGGCAGCTCGACCATAGCCACTGGTCTCCTTCTCCTCTCCGACTTCATCCAACCGGAGCCAGTTCAACGAAACCTCTTCGACATTCGTCCTCAACCATGTGGAC
CCCAAGGTTTCTTCTATCGATTTGTGGAAGAGAAGGGGCTCCTCCATTTTGGCTCTTCCCTCCATGTCTCCATCATCAAGAGCTTCGCGCTCTCAAGCCATGATAATGGG
GTCTTCATAGCGAACTCCCTCATCTCCATGTACGAGAGGTGCGGTAAATTGTTTGATGCAGTCAAGGTGTTCGATGAAATGCCCACAAGAGATACTGTTTCGTGGAACGC
TTCCCCCATTGGCGATTTGCTCCGATTCTCATCTCTATTTGGACTTTTCTTTCTTCATTCAAACCCCTACGACTTGTCCACTTCTCATCCAGTGAAAGAAATACCAAATT
TCTAA
Protein sequenceShow/hide protein sequence
MRFSLIHRSSSVRFFFFSVRHPLRLQILAFQATALATSLGSSTIATGLLLLSDFIQPEPVQRNLFDIRPQPCGPQGFFYRFVEEKGLLHFGSSLHVSIIKSFALSSHDNG
VFIANSLISMYERCGKLFDAVKVFDEMPTRDTVSWNASPIGDLLRFSSLFGLFFLHSNPYDLSTSHPVKEIPNF