; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18102 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18102
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr11:11644592..11645563
RNA-Seq ExpressionCarg18102
SyntenyCarg18102
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589065.1 hypothetical protein SDJN03_17630, partial [Cucurbita argyrosperma subsp. sororia]4.1e-11283.02Show/hide
Query:  MNKSSPERAFNNEFLGNISVADDSRSSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSS
        MNKSSPERAFNNEFLGNISVADDSRSSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSS
Subjt:  MNKSSPERAFNNEFLGNISVADDSRSSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSS

Query:  RAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVFDDVRKEHWYKPQWIIRTKTDVVHDIDGFNALLRALVSNNL
        RAHS    YRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVFDDVRKEHWYKPQ                            
Subjt:  RAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVFDDVRKEHWYKPQWIIRTKTDVVHDIDGFNALLRALVSNNL

Query:  GELVIESYYLMKEDCHRRTGIIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL
                     DCHRRTGIIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL
Subjt:  GELVIESYYLMKEDCHRRTGIIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL

KAG6589074.1 hypothetical protein SDJN03_17639, partial [Cucurbita argyrosperma subsp. sororia]3.6e-10880.75Show/hide
Query:  MNKSSPERAFNNEFLGNISVADDSRSSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSS
        MNKSSPERAFNNEFLGN SVADDS SSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKED QQLDRVYDSEVKALIEVRYDGCSS
Subjt:  MNKSSPERAFNNEFLGNISVADDSRSSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSS

Query:  RAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVFDDVRKEHWYKPQWIIRTKTDVVHDIDGFNALLRALVSNNL
        RAHS    +RNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVF+DVRKEHWYKPQ                            
Subjt:  RAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVFDDVRKEHWYKPQWIIRTKTDVVHDIDGFNALLRALVSNNL

Query:  GELVIESYYLMKEDCHRRTGIIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL
                     DC RRTGIIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL
Subjt:  GELVIESYYLMKEDCHRRTGIIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL

KAG7022781.1 hypothetical protein SDJN02_16517, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-146100Show/hide
Query:  MNKSSPERAFNNEFLGNISVADDSRSSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSS
        MNKSSPERAFNNEFLGNISVADDSRSSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSS
Subjt:  MNKSSPERAFNNEFLGNISVADDSRSSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSS

Query:  RAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVFDDVRKEHWYKPQWIIRTKTDVVHDIDGFNALLRALVSNNL
        RAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVFDDVRKEHWYKPQWIIRTKTDVVHDIDGFNALLRALVSNNL
Subjt:  RAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVFDDVRKEHWYKPQWIIRTKTDVVHDIDGFNALLRALVSNNL

Query:  GELVIESYYLMKEDCHRRTGIIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL
        GELVIESYYLMKEDCHRRTGIIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL
Subjt:  GELVIESYYLMKEDCHRRTGIIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL

XP_031744771.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucumis sativus]1.2e-2639.57Show/hide
Query:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF
        RNLSIEAIQAVQSLKR K+DLQQLDRVYDS+++ L++                        D   +L+     +E                 C   L +F
Subjt:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF

Query:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCHRRTGII--------
        E     DVRKEHWYKPQ                          ++ + D+  +IDGFNALL+ALVS+NLGEL +ESYYLMK+  C               
Subjt:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCHRRTGII--------

Query:  --GRGYWFKNVKQDAQKIYGESLEFLEEED
          G     + VKQDAQ++YGESLEFLEEE+
Subjt:  --GRGYWFKNVKQDAQKIYGESLEFLEEED

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]1.8e-2740.43Show/hide
Query:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF
        RNLSIEAIQAVQSLKR K+DLQQLDRVYDS+++ L++                        D   +L+     +E                 C   L +F
Subjt:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF

Query:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCHRRTGII--------
        E     DVR EHWYKPQ                          ++ +TD+  +IDGFNALL+ALVS+NLGEL +ESYYLMKE  C               
Subjt:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCHRRTGII--------

Query:  --GRGYWFKNVKQDAQKIYGESLEFLEEED
          G     + VKQDAQK+YGESLEFLEEE+
Subjt:  --GRGYWFKNVKQDAQKIYGESLEFLEEED

TrEMBL top hitse value%identityAlignment
A0A0A0K6A9 Uncharacterized protein5.8e-2739.57Show/hide
Query:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF
        RNLSIEAIQAVQSLKR K+DLQQLDRVYDS+++ L++                        D   +L+     +E                 C   L +F
Subjt:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF

Query:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCHRRTGII--------
        E     DVRKEHWYKPQ                          ++ + D+  +IDGFNALL+ALVS+NLGEL +ESYYLMK+  C               
Subjt:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCHRRTGII--------

Query:  --GRGYWFKNVKQDAQKIYGESLEFLEEED
          G     + VKQDAQ++YGESLEFLEEE+
Subjt:  --GRGYWFKNVKQDAQKIYGESLEFLEEED

A0A1S3BRZ8 pentatricopeptide repeat-containing protein At3g46870-like2.2e-2639.13Show/hide
Query:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF
        RNLSIEAIQAVQSLKR K+DLQQLDRVYDS+++ L++                        D   +L+     +E                 C   L +F
Subjt:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF

Query:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCH----------RRTG
        E     DVR EHWYKPQ                          ++ +TD+  +IDGFNALL+ALV +NLG+L +ESYYLMKE  C           +   
Subjt:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCH----------RRTG

Query:  IIGRGYWFKNVKQDAQKIYGESLEFLEEED
        + G     + VKQDAQK+YGESLEFLEE +
Subjt:  IIGRGYWFKNVKQDAQKIYGESLEFLEEED

A0A5A7UZI6 Pentatricopeptide repeat-containing protein2.2e-2639.13Show/hide
Query:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF
        RNLSIEAIQAVQSLKR K+DLQQLDRVYDS+++ L++                        D   +L+     +E                 C   L +F
Subjt:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF

Query:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCH----------RRTG
        E     DVR EHWYKPQ                          ++ +TD+  +IDGFNALL+ALV +NLG+L +ESYYLMKE  C           +   
Subjt:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCH----------RRTG

Query:  IIGRGYWFKNVKQDAQKIYGESLEFLEEED
        + G     + VKQDAQK+YGESLEFLEE +
Subjt:  IIGRGYWFKNVKQDAQKIYGESLEFLEEED

A0A5D3CCT4 Pentatricopeptide repeat-containing protein6.4e-2638.7Show/hide
Query:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF
        RNLSIEAIQAVQSLKR K+DLQQLDRVYDS+++ L++                        D   +L+     +E                 C   L +F
Subjt:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF

Query:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCH----------RRTG
        E     DVR EHWYKPQ                          ++ +TD+  +IDGFNALL+ LV +NLG+L +ESYYLMKE  C           +   
Subjt:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKE-DCH----------RRTG

Query:  IIGRGYWFKNVKQDAQKIYGESLEFLEEED
        + G     + VKQDAQK+YGESLEFLEE +
Subjt:  IIGRGYWFKNVKQDAQKIYGESLEFLEEED

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like1.7e-2640.87Show/hide
Query:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF
        RNLSIEAIQAVQSLKR K DLQQLDRVYDS++  L++                        D   +L+     +E                 C+  L +F
Subjt:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF

Query:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMK----EDCHRRTGIIGRGYW
        E     DVR EHWYKPQ                          ++ +TD+  +IDGFNALLRALVS NLGEL +ESYYLMK    E       I+ +G  
Subjt:  ETSVFDDVRKEHWYKPQ------------------------WIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMK----EDCHRRTGIIGRGYW

Query:  -------FKNVKQDAQKIYGESLEFLEEED
                + VKQDAQKIYG+ LEFLEEED
Subjt:  -------FKNVKQDAQKIYGESLEFLEEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain6.4e-1027.97Show/hide
Query:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF
        R LSIEAIQAVQ+LKRA                 L        S+ + SS  + R                    + IS + + L++ ++     LL   
Subjt:  RNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYRNYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIF

Query:  ETS----VFDDVRKEHWYKPQWIIRTKTDVV--------------------------HDIDGFNALLRALVSNNLGELVIESYYLMKEDCHRR-------
        E S    VF+++RKE+WYKPQ  +R  TD++                           +I+ FN LL  L+++ L +LV++ Y  M+   +         
Subjt:  ETS----VFDDVRKEHWYKPQWIIRTKTDVV--------------------------HDIDGFNALLRALVSNNLGELVIESYYLMKEDCHRR-------

Query:  --TGIIGRGYWFKN--VKQDAQKIYGESLEFLEEED
           G+   G    +  V+QDA + YGESLEF+EE++
Subjt:  --TGIIGRGYWFKN--VKQDAQKIYGESLEFLEEED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAAGAGTTCGCCGGAGAGAGCATTTAATAATGAGTTTCTTGGCAACATCTCCGTCGCCGACGATTCTCGGTCCTCCGTACGAGTTACTGAGCGCCGGTGGGAAAG
CATGTTGCCTGCGGCTAGGAAGGGCGGAGGAATATCGGAGAGTGATAGTGAGCCAAGGAACCTTAGCATCGAAGCGATTCAAGCGGTACAGTCGCTGAAGCGAGCCAAGG
AAGATTTACAACAATTGGACCGAGTGTACGATTCCGAAGTTAAGGCGCTTATTGAAGTTCGATATGATGGCTGTTCTTCGCGAGCTCATTCGTCAGAACGAGTCTACCGG
AATTATTCCATATTGCACGATCAGTATCGATTGTTGAAGAATTATCGCAGCAATAGCGAGATATGGATTTCTGCTTATGGTAAATTCCTTCAATATTCCTTGCTGATATT
TTGTATTACTTTACTGTCGATATTTGAGACTTCGGTTTTCGATGATGTTAGGAAGGAACACTGGTACAAGCCTCAGTGGATTATTCGAACAAAAACTGACGTAGTGCATG
ATATTGACGGGTTTAATGCTCTTTTGAGAGCTTTGGTTAGTAATAATTTAGGTGAACTTGTGATTGAGTCCTATTACTTGATGAAAGAAGATTGTCATAGAAGGACTGGA
ATCATCGGGAGAGGCTACTGGTTTAAGAATGTGAAGCAGGATGCACAAAAGATTTATGGTGAATCGCTCGAGTTTCTAGAGGAAGAAGACCCGCAGCTACAGCCATATCT
ATGCATTGAGCTGCTGATCGGGCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAAGAGTTCGCCGGAGAGAGCATTTAATAATGAGTTTCTTGGCAACATCTCCGTCGCCGACGATTCTCGGTCCTCCGTACGAGTTACTGAGCGCCGGTGGGAAAG
CATGTTGCCTGCGGCTAGGAAGGGCGGAGGAATATCGGAGAGTGATAGTGAGCCAAGGAACCTTAGCATCGAAGCGATTCAAGCGGTACAGTCGCTGAAGCGAGCCAAGG
AAGATTTACAACAATTGGACCGAGTGTACGATTCCGAAGTTAAGGCGCTTATTGAAGTTCGATATGATGGCTGTTCTTCGCGAGCTCATTCGTCAGAACGAGTCTACCGG
AATTATTCCATATTGCACGATCAGTATCGATTGTTGAAGAATTATCGCAGCAATAGCGAGATATGGATTTCTGCTTATGGTAAATTCCTTCAATATTCCTTGCTGATATT
TTGTATTACTTTACTGTCGATATTTGAGACTTCGGTTTTCGATGATGTTAGGAAGGAACACTGGTACAAGCCTCAGTGGATTATTCGAACAAAAACTGACGTAGTGCATG
ATATTGACGGGTTTAATGCTCTTTTGAGAGCTTTGGTTAGTAATAATTTAGGTGAACTTGTGATTGAGTCCTATTACTTGATGAAAGAAGATTGTCATAGAAGGACTGGA
ATCATCGGGAGAGGCTACTGGTTTAAGAATGTGAAGCAGGATGCACAAAAGATTTATGGTGAATCGCTCGAGTTTCTAGAGGAAGAAGACCCGCAGCTACAGCCATATCT
ATGCATTGAGCTGCTGATCGGGCTTTGA
Protein sequenceShow/hide protein sequence
MNKSSPERAFNNEFLGNISVADDSRSSVRVTERRWESMLPAARKGGGISESDSEPRNLSIEAIQAVQSLKRAKEDLQQLDRVYDSEVKALIEVRYDGCSSRAHSSERVYR
NYSILHDQYRLLKNYRSNSEIWISAYGKFLQYSLLIFCITLLSIFETSVFDDVRKEHWYKPQWIIRTKTDVVHDIDGFNALLRALVSNNLGELVIESYYLMKEDCHRRTG
IIGRGYWFKNVKQDAQKIYGESLEFLEEEDPQLQPYLCIELLIGL