; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022153 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022153
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr7:19769119..19770136
RNA-Seq ExpressionLag0022153
SyntenyLag0022153
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152191.1 pentatricopeptide repeat-containing protein At3g62890 [Momordica charantia]4.0e-3865.47Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SE+EGKQVH+H+LKLGFDSDVYVQNTLINLFSVCSNMTDARR+FDESSVLDSVSWNSILAG                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFDEMPEKDMVTWSALIACYEQNEM EEAMRTFVG
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

XP_022934101.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata]1.4e-3864.75Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SEWEGKQVH+HV+KLGFDSDVYVQNTLIN FSVCSNM+DARRVFDES+VLDSVSWNSILAG                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFDEMPE+DMVTWSALIACYEQNEMFEEAMRTFVG
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

XP_023526339.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo]8.9e-3864.03Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SEWEGKQVH+HV+KLGFDSDVYVQNTLIN FSVCS M+DARRVFDESSVLDSVSWNSILAG                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFDEMPE+DMVTWSALIACYEQNEMFEEAMRTF G
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

XP_023526340.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo]8.9e-3864.03Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SEWEGKQVH+HV+KLGFDSDVYVQNTLIN FSVCS M+DARRVFDESSVLDSVSWNSILAG                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFDEMPE+DMVTWSALIACYEQNEMFEEAMRTF G
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

XP_023526341.1 pentatricopeptide repeat-containing protein At3g62890-like isoform X3 [Cucurbita pepo subsp. pepo]8.9e-3864.03Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SEWEGKQVH+HV+KLGFDSDVYVQNTLIN FSVCS M+DARRVFDESSVLDSVSWNSILAG                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFDEMPE+DMVTWSALIACYEQNEMFEEAMRTF G
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

TrEMBL top hitse value%identityAlignment
A0A0A0L8Y7 DYW_deaminase domain-containing protein9.9e-3561.15Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SEWE KQVH+HVLKLGFDSDVYV+NTLIN FSVCSNMTDA RVF+ESSVLDSVSWNSILAG                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFDEM EKDMVTWSALIAC++QNEM+EEA+RTFVG
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

A0A1S4E1G8 pentatricopeptide repeat-containing protein At3g62890-like2.0e-3561.15Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SEWE KQVH+HVLKLGFDSDVYV+NTLIN FSVCSNMTDARRVFDE+S+LDSVSWNSILAG                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFD M EKDMVTWSALIAC++QNEMFEEA+RTFVG
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

A0A6J1DFB5 pentatricopeptide repeat-containing protein At3g628901.9e-3865.47Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SE+EGKQVH+H+LKLGFDSDVYVQNTLINLFSVCSNMTDARR+FDESSVLDSVSWNSILAG                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFDEMPEKDMVTWSALIACYEQNEM EEAMRTFVG
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

A0A6J1F0X4 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like6.6e-3964.75Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SEWEGKQVH+HV+KLGFDSDVYVQNTLIN FSVCSNM+DARRVFDES+VLDSVSWNSILAG                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFDEMPE+DMVTWSALIACYEQNEMFEEAMRTFVG
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

A0A6J1J842 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like2.1e-3764.03Show/hide
Query:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------
        SE EGKQVH+HV+KLGFDSDVYVQNTLIN FSVCSNM+DARRVFDESSVLDSVSWNSIL+G                                       
Subjt:  SEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG---------------------------------------

Query:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG
            LFDEMPE+DMVTWSALIACYEQNEMFEEAMRTFVG
Subjt:  ----LFDEMPEKDMVTWSALIACYEQNEMFEEAMRTFVG

SwissProt top hitse value%identityAlignment
O23169 Pentatricopeptide repeat-containing protein At4g371702.1e-1337Show/hide
Query:  EGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSALIACYEQNEMFEEAM
        EGK+VH+H+   GF   + + N L+ +++ C ++ DAR+VFDE    D  SWN ++ G            LFDEM EKD  +W+A++  Y + +  EEA+
Subjt:  EGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSALIACYEQNEMFEEAM

Q683I9 Pentatricopeptide repeat-containing protein At3g628909.3e-1433.88Show/hide
Query:  FAKLPWVFEGVFHPSSEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSIL-----AG-------LFDEMPEKDMVT
        F   P++     +P     G++ H  +L  G D D +V+ +L+N++S C ++  A+RVFD+S   D  +WNS++     AG       LFDEMPE+++++
Subjt:  FAKLPWVFEGVFHPSSEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSIL-----AG-------LFDEMPEKDMVT

Query:  WSALIACYEQNEMFEEAMRTF
        WS LI  Y     ++EA+  F
Subjt:  WSALIACYEQNEMFEEAMRTF

Q9FG16 Pentatricopeptide repeat-containing protein At5g065402.9e-1535.59Show/hide
Query:  FEGVFHPSSEWE----GKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSA
        F  +   SSE E    G+Q H  +++ GF +DVYV+N+L+++++ C  +  A R+F +    D VSW S++AG            +FDEMP +++ TWS 
Subjt:  FEGVFHPSSEWE----GKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSA

Query:  LIACYEQNEMFEEAMRTF
        +I  Y +N  FE+A+  F
Subjt:  LIACYEQNEMFEEAMRTF

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.7e-1534.92Show/hide
Query:  PWVFEGVFHPSSEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSAL
        P+V +      +  EG+Q+H HVLKLG D D+YV  +LI+++     + DA +VFD+S   D VS+ +++ G            LFDE+P KD+V+W+A+
Subjt:  PWVFEGVFHPSSEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSAL

Query:  IACYEQNEMFEEAMRTFVGCIMLELR
        I+ Y +   ++EA+  F   +   +R
Subjt:  IACYEQNEMFEEAMRTFVGCIMLELR

Q9LS72 Pentatricopeptide repeat-containing protein At3g292305.4e-1438.61Show/hide
Query:  KQVHDHVLKLGFDSDVYVQNTLINLFSVCSNM--TDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSALI----ACYEQNEMF
        K +H+H+ KLG  SD+YV N LI+ +S C  +   DA ++F++ S  D+VSWNS+L G            LFDEMP++D+++W+ ++     C E ++ F
Subjt:  KQVHDHVLKLGFDSDVYVQNTLINLFSVCSNM--TDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSALI----ACYEQNEMF

Query:  E
        E
Subjt:  E

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-1634.92Show/hide
Query:  PWVFEGVFHPSSEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSAL
        P+V +      +  EG+Q+H HVLKLG D D+YV  +LI+++     + DA +VFD+S   D VS+ +++ G            LFDE+P KD+V+W+A+
Subjt:  PWVFEGVFHPSSEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSAL

Query:  IACYEQNEMFEEAMRTFVGCIMLELR
        I+ Y +   ++EA+  F   +   +R
Subjt:  IACYEQNEMFEEAMRTFVGCIMLELR

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.9e-1538.61Show/hide
Query:  KQVHDHVLKLGFDSDVYVQNTLINLFSVCSNM--TDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSALI----ACYEQNEMF
        K +H+H+ KLG  SD+YV N LI+ +S C  +   DA ++F++ S  D+VSWNS+L G            LFDEMP++D+++W+ ++     C E ++ F
Subjt:  KQVHDHVLKLGFDSDVYVQNTLINLFSVCSNM--TDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSALI----ACYEQNEMF

Query:  E
        E
Subjt:  E

AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein6.6e-1533.88Show/hide
Query:  FAKLPWVFEGVFHPSSEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSIL-----AG-------LFDEMPEKDMVT
        F   P++     +P     G++ H  +L  G D D +V+ +L+N++S C ++  A+RVFD+S   D  +WNS++     AG       LFDEMPE+++++
Subjt:  FAKLPWVFEGVFHPSSEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSIL-----AG-------LFDEMPEKDMVT

Query:  WSALIACYEQNEMFEEAMRTF
        WS LI  Y     ++EA+  F
Subjt:  WSALIACYEQNEMFEEAMRTF

AT4G37170.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-1437Show/hide
Query:  EGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSALIACYEQNEMFEEAM
        EGK+VH+H+   GF   + + N L+ +++ C ++ DAR+VFDE    D  SWN ++ G            LFDEM EKD  +W+A++  Y + +  EEA+
Subjt:  EGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSALIACYEQNEMFEEAM

AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-1635.59Show/hide
Query:  FEGVFHPSSEWE----GKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSA
        F  +   SSE E    G+Q H  +++ GF +DVYV+N+L+++++ C  +  A R+F +    D VSW S++AG            +FDEMP +++ TWS 
Subjt:  FEGVFHPSSEWE----GKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAG------------LFDEMPEKDMVTWSA

Query:  LIACYEQNEMFEEAMRTF
        +I  Y +N  FE+A+  F
Subjt:  LIACYEQNEMFEEAMRTF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCGAACCTATTAAGAAAGCTCCCCGTAGTGGCCACTTTCTATGTTTCAGACACCAATTGTTTTGCGAAACTGCCGTGGGTGTTTGAAGGCGTGTTCCATCCGTC
GTCCGAATGGGAAGGAAAACAGGTACATGATCATGTTTTGAAGTTGGGTTTTGATTCAGATGTTTATGTTCAGAATACTTTGATTAATTTATTTTCTGTTTGTTCGAATA
TGACTGATGCCCGCCGGGTGTTTGATGAAAGTTCTGTTTTGGATTCAGTGTCATGGAATTCAATTTTGGCTGGATTGTTTGATGAAATGCCAGAGAAAGATATGGTCACA
TGGAGTGCTCTAATTGCTTGCTATGAGCAGAATGAGATGTTCGAGGAGGCTATGAGAACATTTGTGGGATGCATAATGTTGGAGTTACGGTGGATGAGGTTGTGGCAGTT
AGTGCTCTTTCTGCTTGTGCAAGCTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCGAACCTATTAAGAAAGCTCCCCGTAGTGGCCACTTTCTATGTTTCAGACACCAATTGTTTTGCGAAACTGCCGTGGGTGTTTGAAGGCGTGTTCCATCCGTC
GTCCGAATGGGAAGGAAAACAGGTACATGATCATGTTTTGAAGTTGGGTTTTGATTCAGATGTTTATGTTCAGAATACTTTGATTAATTTATTTTCTGTTTGTTCGAATA
TGACTGATGCCCGCCGGGTGTTTGATGAAAGTTCTGTTTTGGATTCAGTGTCATGGAATTCAATTTTGGCTGGATTGTTTGATGAAATGCCAGAGAAAGATATGGTCACA
TGGAGTGCTCTAATTGCTTGCTATGAGCAGAATGAGATGTTCGAGGAGGCTATGAGAACATTTGTGGGATGCATAATGTTGGAGTTACGGTGGATGAGGTTGTGGCAGTT
AGTGCTCTTTCTGCTTGTGCAAGCTTACTGA
Protein sequenceShow/hide protein sequence
MSSNLLRKLPVVATFYVSDTNCFAKLPWVFEGVFHPSSEWEGKQVHDHVLKLGFDSDVYVQNTLINLFSVCSNMTDARRVFDESSVLDSVSWNSILAGLFDEMPEKDMVT
WSALIACYEQNEMFEEAMRTFVGCIMLELRWMRLWQLVLFLLVQAY