; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g03810 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g03810
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr4:2416500..2418762
RNA-Seq ExpressionMoc04g03810
SyntenyMoc04g03810
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044190 - Protein THYLAKOID ASSEMBLY 8-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572189.1 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.2e-3883.33Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNEC LALK
Subjt:  RELLRQNECLLALK

XP_022135810.1 pentatricopeptide repeat-containing protein At1g62350-like [Momordica charantia]1.8e-52100Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNECLLALK
Subjt:  RELLRQNECLLALK

XP_022952712.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita moschata]4.2e-3883.33Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNEC LALK
Subjt:  RELLRQNECLLALK

XP_023554588.1 protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Cucurbita pepo subsp. pepo]8.5e-3984.21Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PSP TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNEC LALK
Subjt:  RELLRQNECLLALK

XP_038888189.1 protein THYLAKOID ASSEMBLY 8, chloroplastic-like [Benincasa hispida]1.3e-3984.21Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FL  PPSP TI S  +KLLSSG ++SCL LGRAE YR+VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNEC LALK
Subjt:  RELLRQNECLLALK

TrEMBL top hitse value%identityAlignment
A0A5A7UZI6 Pentatricopeptide repeat-containing protein6.6e-3781.58Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FL   PSP TI S P KL SS  K  CL+LGRAE Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDM+AVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNEC LALK
Subjt:  RELLRQNECLLALK

A0A5D3CCT4 Pentatricopeptide repeat-containing protein6.6e-3781.58Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FL   PSP TI S P KL SS  K  CL+LGRAE Y RVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDM+AVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNEC LALK
Subjt:  RELLRQNECLLALK

A0A6J1C1T5 pentatricopeptide repeat-containing protein At1g62350-like8.6e-53100Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNECLLALK
Subjt:  RELLRQNECLLALK

A0A6J1GL50 protein THYLAKOID ASSEMBLY 8-like, chloroplastic2.1e-3883.33Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNEC LALK
Subjt:  RELLRQNECLLALK

A0A6J1HX85 protein THYLAKOID ASSEMBLY 8-like, chloroplastic2.1e-3883.33Show/hide
Query:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL
        M+FLA  PS +TI S P+ LL SG K S LRLGR E YRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR K DLQQLDRVYDSKI RLLKFDMMAVL
Subjt:  MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVL

Query:  RELLRQNECLLALK
        RELLRQNEC LALK
Subjt:  RELLRQNECLLALK

SwissProt top hitse value%identityAlignment
Q9LVW6 Protein THYLAKOID ASSEMBLY 8, chloroplastic9.0e-0750.72Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLAL
        G  +NR PL KGR LS EAIQ++QSLKR       L       + RL+K D+++VLRELLRQ+ C LA+
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLAL

Q9STF9 Protein THYLAKOID ASSEMBLY 8-like, chloroplastic4.9e-0545.45Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALK
        R PL +G+ L   EA+  +  LKR+K D ++LD+   + + RLLK DM+AV+ EL RQ E  LA+K
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALK

Arabidopsis top hitse value%identityAlignment
AT1G62350.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-0442.86Show/hide
Query:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALK
        +S E + A + LKR++    +LDR   S +SRLLK D+++VL E  RQN+  L +K
Subjt:  LSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALK

AT3G27750.1 FUNCTIONS IN: molecular_function unknown6.4e-0850.72Show/hide
Query:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLAL
        G  +NR PL KGR LS EAIQ++QSLKR       L       + RL+K D+++VLRELLRQ+ C LA+
Subjt:  GGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLAL

AT3G46870.1 Pentatricopeptide repeat (PPR) superfamily protein3.5e-0645.45Show/hide
Query:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALK
        R PL +G+ L   EA+  +  LKR+K D ++LD+   + + RLLK DM+AV+ EL RQ E  LA+K
Subjt:  RKPLQKGRNL-SIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALK

AT5G09320.1 Vacuolar sorting protein 9 (VPS9) domain9.8e-1759.55Show/hide
Query:  VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR---------------VKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALK
        + MR  S+NRKPLQ+GR LSIEAIQAVQ+LKR                 +    LDRV  SK  RLLKFDM+AVLRELLRQNEC LALK
Subjt:  VTMRGGSENRKPLQKGRNLSIEAIQAVQSLKR---------------VKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECLLALK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTCTTAGCAATTCCTCCGTCGCCGACGACGATTTTCAGTTCGCCGCACAAGTTACTGAGCTCCGGCGAGAAATCATCTTGCCTCCGACTAGGTAGGGCGGAGGA
ATATCGGAGAGTGACAATGAGAGGCGGAAGCGAGAACCGGAAGCCGTTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGTCA
AGAACGATCTACAACAATTGGACCGAGTGTATGATTCCAAAATTAGCCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTGCTGCGCCAGAACGAGTGTCTT
TTGGCTCTCAAGCAAACGCTGCCTCTGACATCTCAAAGTGTTCTTGGGGAAAATTGCACCACCCACCATAATCAGAGGATAAGCCATAATTGGGAGGACAGAAGTCCGTC
GCTGTTAGAATCACGGTCGGGCTTCCTTGCAAGCACCATAAGATGTGGTCAACACATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTCTTAGCAATTCCTCCGTCGCCGACGACGATTTTCAGTTCGCCGCACAAGTTACTGAGCTCCGGCGAGAAATCATCTTGCCTCCGACTAGGTAGGGCGGAGGA
ATATCGGAGAGTGACAATGAGAGGCGGAAGCGAGAACCGGAAGCCGTTGCAGAAGGGGAGGAACCTCAGCATCGAAGCAATTCAAGCGGTACAGTCGTTGAAACGAGTCA
AGAACGATCTACAACAATTGGACCGAGTGTATGATTCCAAAATTAGCCGCTTATTGAAGTTCGATATGATGGCTGTCCTTCGCGAGCTGCTGCGCCAGAACGAGTGTCTT
TTGGCTCTCAAGCAAACGCTGCCTCTGACATCTCAAAGTGTTCTTGGGGAAAATTGCACCACCCACCATAATCAGAGGATAAGCCATAATTGGGAGGACAGAAGTCCGTC
GCTGTTAGAATCACGGTCGGGCTTCCTTGCAAGCACCATAAGATGTGGTCAACACATCTAA
Protein sequenceShow/hide protein sequence
MNFLAIPPSPTTIFSSPHKLLSSGEKSSCLRLGRAEEYRRVTMRGGSENRKPLQKGRNLSIEAIQAVQSLKRVKNDLQQLDRVYDSKISRLLKFDMMAVLRELLRQNECL
LALKQTLPLTSQSVLGENCTTHHNQRISHNWEDRSPSLLESRSGFLASTIRCGQHI