; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000207 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000207
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr4:1467242..1471456
RNA-Seq ExpressionLag0000207
SyntenyLag0000207
Gene Ontology termsGO:0043248 - proteasome assembly (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019538 - 26S proteasome non-ATPase regulatory subunit 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039243.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.2e-3589.77Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL
        MQD NLKPDLVTYINLVGCYGKAGMIEG+K+IYSQLKYGEIE NKSLFYAIIN FRS NRYDLVQ+VTQEMKFSLDSEVYSESE D+L
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL

XP_004141647.3 pentatricopeptide repeat-containing protein At1g73710 [Cucumis sativus]9.6e-3384.09Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL
        M+D NLKPDLVTYINLVGCYGKAGMIEG+K+IYSQLKYGEIE NKSLF+AIIN FRS +RYDLVQ+V QEMKFSLDSEV+SESE D+L
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL

XP_008459651.1 PREDICTED: pentatricopeptide repeat-containing protein At1g73710 [Cucumis melo]1.2e-3589.77Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL
        MQD NLKPDLVTYINLVGCYGKAGMIEG+K+IYSQLKYGEIE NKSLFYAIIN FRS NRYDLVQ+VTQEMKFSLDSEVYSESE D+L
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL

XP_022149701.1 pentatricopeptide repeat-containing protein At1g73710 [Momordica charantia]2.7e-3588.51Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDD
        MQD NLKPDLVTYINLVGCYGKAG+IEG+KR+YSQLKYGEIEPNKSLFYAI NAF S NRYDLVQ+VTQEMKF+LDSEVYSESE DD
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDD

XP_038890049.1 pentatricopeptide repeat-containing protein At1g73710 [Benincasa hispida]2.7e-3588.64Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL
        MQD NLKPDLVTYINLVGCYGKAGMIEG+K+IY+QLKYGEIE NKSLFYAIINAFRS +RYDLVQ+VTQEMKFSLDSEVYSESE D+L
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL

TrEMBL top hitse value%identityAlignment
A0A0A0KUW2 PPR_long domain-containing protein4.6e-3384.09Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL
        M+D NLKPDLVTYINLVGCYGKAGMIEG+K+IYSQLKYGEIE NKSLF+AIIN FRS +RYDLVQ+V QEMKFSLDSEV+SESE D+L
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL

A0A1S3CAP2 pentatricopeptide repeat-containing protein At1g737105.8e-3689.77Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL
        MQD NLKPDLVTYINLVGCYGKAGMIEG+K+IYSQLKYGEIE NKSLFYAIIN FRS NRYDLVQ+VTQEMKFSLDSEVYSESE D+L
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL

A0A5D3BQP5 Pentatricopeptide repeat-containing protein5.8e-3689.77Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL
        MQD NLKPDLVTYINLVGCYGKAGMIEG+K+IYSQLKYGEIE NKSLFYAIIN FRS NRYDLVQ+VTQEMKFSLDSEVYSESE D+L
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL

A0A6J1D965 pentatricopeptide repeat-containing protein At1g737101.3e-3588.51Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDD
        MQD NLKPDLVTYINLVGCYGKAG+IEG+KR+YSQLKYGEIEPNKSLFYAI NAF S NRYDLVQ+VTQEMKF+LDSEVYSESE DD
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDD

A0A6J1HNW8 pentatricopeptide repeat-containing protein At1g737101.6e-3077.27Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL
        MQD NL PDLVTY++LV CYGKAGMIEGM R+YSQLKYGEIEP+KSLFYAIINA R+ NRYDLVQ+V QEM+FSL SE++S++E DDL
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDL

SwissProt top hitse value%identityAlignment
Q9C9U0 Pentatricopeptide repeat-containing protein At1g737102.0e-1747.83Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSE-----VYSESEHDD
        MQ+  L+PD+VT   LVG YGKAGM+EG+KR++S+L +GE+EP++SLF A+ +A+ S NR DL  +V +EM  + ++E        E E DD
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSE-----VYSESEHDD

Arabidopsis top hitse value%identityAlignment
AT1G73710.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-1847.83Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSE-----VYSESEHDD
        MQ+  L+PD+VT   LVG YGKAGM+EG+KR++S+L +GE+EP++SLF A+ +A+ S NR DL  +V +EM  + ++E        E E DD
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSE-----VYSESEHDD

AT2G31400.1 genomes uncoupled 11.2e-0427.78Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMK
        M    +K D+VTY  L+G YGK G  + +K++++++K   + PN   +  +I+ +     Y     + +E K
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMK

AT3G15180.1 ARM repeat superfamily protein4.3e-1548.91Show/hide
Query:  GATLLLSSFPTCVKHVINAAFDRHEHGKQLAAMHALGNIFGETRSENDIILNDNAEENLLVV---------------LFLVVLQQDSEIRFA
        GA L+LS+ P   +HV+ +AFDR+ HGKQLAA+HAL NI GETR +++ I++  AEE+L  +               LFL VLQQ SEIR A
Subjt:  GATLLLSSFPTCVKHVINAAFDRHEHGKQLAAMHALGNIFGETRSENDIILNDNAEENLLVV---------------LFLVVLQQDSEIRFA

AT3G15180.2 ARM repeat superfamily protein4.3e-1548.91Show/hide
Query:  GATLLLSSFPTCVKHVINAAFDRHEHGKQLAAMHALGNIFGETRSENDIILNDNAEENLLVV---------------LFLVVLQQDSEIRFA
        GA L+LS+ P   +HV+ +AFDR+ HGKQLAA+HAL NI GETR +++ I++  AEE+L  +               LFL VLQQ SEIR A
Subjt:  GATLLLSSFPTCVKHVINAAFDRHEHGKQLAAMHALGNIFGETRSENDIILNDNAEENLLVV---------------LFLVVLQQDSEIRFA

AT5G28370.1 Pentatricopeptide repeat (PPR) superfamily protein5.3e-0521.43Show/hide
Query:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDLFMKI--------
        M+    KPD +TY  L+  +GK    E ++R+  Q++   ++P  + + A+I+A+ SV   D      + +K   D  ++S+   + +   I        
Subjt:  MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDLFMKI--------

Query:  ---PRVFDIPKQCLTRSFRLSYDFSNA-HSCCSGKWGATLLLSSFPTCVKHVIN
            +   + ++   +  R + +  NA   C + K     LL      V+H++N
Subjt:  ---PRVFDIPKQCLTRSFRLSYDFSNA-HSCCSGKWGATLLLSSFPTCVKHVIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGATTGGAATCTCAAGCCAGATCTGGTAACTTATATAAATTTGGTAGGTTGTTATGGTAAAGCTGGTATGATTGAAGGTATGAAGCGAATATACAGCCAGCTAAA
ATATGGAGAGATAGAGCCCAACAAATCATTGTTCTATGCAATCATAAATGCATTTAGAAGTGTCAATAGATATGACCTCGTCCAAATAGTCACCCAAGAAATGAAATTTT
CTTTGGACTCAGAAGTGTACTCTGAATCTGAGCACGATGATCTTTTTATGAAGATTCCCCGGGTCTTTGACATCCCAAAACAATGCTTGACTCGTTCTTTTAGGTTGAGT
TATGATTTTAGTAATGCACATTCTTGTTGTTCAGGCAAATGGGGAGCCACTTTACTGCTGTCAAGTTTTCCGACTTGTGTGAAGCATGTAATTAATGCAGCTTTTGATCG
GCATGAACATGGTAAACAGCTAGCAGCTATGCATGCTCTTGGTAACATCTTTGGAGAAACTCGATCTGAGAATGATATTATTCTGAATGATAATGCAGAAGAAAATTTAC
TAGTTGTCCTTTTTCTAGTTGTCCTTCAACAGGATTCTGAGATTCGCTTCGCGACAAGTGAGCAACAGCGAGGAGGCAATGGCGAAGACGAGGAGGCGGCGGCGACCAAG
GCACGACAAGGAGGTTGGCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGATTGGAATCTCAAGCCAGATCTGGTAACTTATATAAATTTGGTAGGTTGTTATGGTAAAGCTGGTATGATTGAAGGTATGAAGCGAATATACAGCCAGCTAAA
ATATGGAGAGATAGAGCCCAACAAATCATTGTTCTATGCAATCATAAATGCATTTAGAAGTGTCAATAGATATGACCTCGTCCAAATAGTCACCCAAGAAATGAAATTTT
CTTTGGACTCAGAAGTGTACTCTGAATCTGAGCACGATGATCTTTTTATGAAGATTCCCCGGGTCTTTGACATCCCAAAACAATGCTTGACTCGTTCTTTTAGGTTGAGT
TATGATTTTAGTAATGCACATTCTTGTTGTTCAGGCAAATGGGGAGCCACTTTACTGCTGTCAAGTTTTCCGACTTGTGTGAAGCATGTAATTAATGCAGCTTTTGATCG
GCATGAACATGGTAAACAGCTAGCAGCTATGCATGCTCTTGGTAACATCTTTGGAGAAACTCGATCTGAGAATGATATTATTCTGAATGATAATGCAGAAGAAAATTTAC
TAGTTGTCCTTTTTCTAGTTGTCCTTCAACAGGATTCTGAGATTCGCTTCGCGACAAGTGAGCAACAGCGAGGAGGCAATGGCGAAGACGAGGAGGCGGCGGCGACCAAG
GCACGACAAGGAGGTTGGCGATGA
Protein sequenceShow/hide protein sequence
MQDWNLKPDLVTYINLVGCYGKAGMIEGMKRIYSQLKYGEIEPNKSLFYAIINAFRSVNRYDLVQIVTQEMKFSLDSEVYSESEHDDLFMKIPRVFDIPKQCLTRSFRLS
YDFSNAHSCCSGKWGATLLLSSFPTCVKHVINAAFDRHEHGKQLAAMHALGNIFGETRSENDIILNDNAEENLLVVLFLVVLQQDSEIRFATSEQQRGGNGEDEEAAATK
ARQGGWR