; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g00330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g00330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr7:164354..169481
RNA-Seq ExpressionMoc07g00330
SyntenyMoc07g00330
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148135.1 uncharacterized protein LOC111016888 [Momordica charantia]6.3e-3954.35Show/hide
Query:  GCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFK----------------------------------------------
        GC+GLTG PNDEKLQ +VQSSGTNDV EG+VFD KKELSL+MHLVAMR NFQFK                                              
Subjt:  GCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFK----------------------------------------------

Query:  -------------------------VSRTYRPKDIIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYGEALKIM
                                 VSRTYRPKDIIQDMRKEYGVNLSYD+AW SSEEALRLIR DPASSYGLL AYGEALKIM
Subjt:  -------------------------VSRTYRPKDIIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYGEALKIM

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]1.2e-5053.74Show/hide
Query:  EEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFK---
        EEG ++AE+ N+++D+ALD E   DV+QVH EI RDE AV+  GC+GLTG  N E LQLIVQSSGTNDV EG+VFD KKELSL+MHLV MR NFQFK   
Subjt:  EEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFK---

Query:  --------------------------------------------------------------------VSRTYRPKDIIQDMRKEYGVNLSYDRAWHSSE
                                                                            VSRTYRPKDIIQDMRKEYGVNLSYD+AW SSE
Subjt:  --------------------------------------------------------------------VSRTYRPKDIIQDMRKEYGVNLSYDRAWHSSE

Query:  EALRLIRGDPASSYGLLPAYGEALKIM
        EALRLIRGDPASSYGLLP YGEALKIM
Subjt:  EALRLIRGDPASSYGLLPAYGEALKIM

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]1.6e-5551.64Show/hide
Query:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
        N+PG+WNDN+DES ESYD L +SEEG ++AE+ N+++D+A D +   DV+QV  EIRRDE  V   GC+GL G PNDEKLQLIVQSSGTNDV EG VFD 
Subjt:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN

Query:  KKELSLKMHLVAMRKNFQFKVSR-----------------------------------------------------------------------TYRPKD
        KKELSL+ HLVAM  NFQFKV +                                                                       TYRPKD
Subjt:  KKELSLKMHLVAMRKNFQFKVSR-----------------------------------------------------------------------TYRPKD

Query:  IIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYG
        IIQDMRKEYGVNLSYD+AW S+EEALRLIRGDP +SYGLLPAYG
Subjt:  IIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYG

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]5.1e-5783.82Show/hide
Query:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
        +VPGVWNDNEDESGESYDPLAES+EGH QAEYGNEEHD+ALD EL  DV+QVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
Subjt:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN

Query:  KKELSLKMHLVAMRKNFQFKVSRTYRPKDIIQDMRK
        KKELSLKMHLVAMRKNFQFK+      K++ + + K
Subjt:  KKELSLKMHLVAMRKNFQFKVSRTYRPKDIIQDMRK

XP_022157017.1 uncharacterized protein LOC111023843 [Momordica charantia]9.6e-8067.2Show/hide
Query:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
        +VPGVWNDNEDESGESYDPLA SEEGH QAEYGNEEHD+ALD EL  DV+QVHTEIRRDEEAVR PGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
Subjt:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN

Query:  KKELSLKMHLVAMRKNFQFK-----------------------------------------------------------------------VSRTYRPKD
        KKELSLKMHLVAMRKNFQFK                                                                       VSRTYRPKD
Subjt:  KKELSLKMHLVAMRKNFQFK-----------------------------------------------------------------------VSRTYRPKD

Query:  IIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYGEALKIM
        IIQDMRKEYGVNLSYDRAW SSEEALRLIRGDPASSYGLLPAYG+ALKIM
Subjt:  IIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYGEALKIM

TrEMBL top hitse value%identityAlignment
A0A6J1D234 uncharacterized protein LOC1110168883.0e-3954.35Show/hide
Query:  GCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFK----------------------------------------------
        GC+GLTG PNDEKLQ +VQSSGTNDV EG+VFD KKELSL+MHLVAMR NFQFK                                              
Subjt:  GCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFK----------------------------------------------

Query:  -------------------------VSRTYRPKDIIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYGEALKIM
                                 VSRTYRPKDIIQDMRKEYGVNLSYD+AW SSEEALRLIR DPASSYGLL AYGEALKIM
Subjt:  -------------------------VSRTYRPKDIIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYGEALKIM

A0A6J1DJT1 uncharacterized protein LOC1110207155.9e-5153.74Show/hide
Query:  EEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFK---
        EEG ++AE+ N+++D+ALD E   DV+QVH EI RDE AV+  GC+GLTG  N E LQLIVQSSGTNDV EG+VFD KKELSL+MHLV MR NFQFK   
Subjt:  EEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFK---

Query:  --------------------------------------------------------------------VSRTYRPKDIIQDMRKEYGVNLSYDRAWHSSE
                                                                            VSRTYRPKDIIQDMRKEYGVNLSYD+AW SSE
Subjt:  --------------------------------------------------------------------VSRTYRPKDIIQDMRKEYGVNLSYDRAWHSSE

Query:  EALRLIRGDPASSYGLLPAYGEALKIM
        EALRLIRGDPASSYGLLP YGEALKIM
Subjt:  EALRLIRGDPASSYGLLPAYGEALKIM

A0A6J1DP00 uncharacterized protein LOC1110229548.0e-5651.64Show/hide
Query:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
        N+PG+WNDN+DES ESYD L +SEEG ++AE+ N+++D+A D +   DV+QV  EIRRDE  V   GC+GL G PNDEKLQLIVQSSGTNDV EG VFD 
Subjt:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN

Query:  KKELSLKMHLVAMRKNFQFKVSR-----------------------------------------------------------------------TYRPKD
        KKELSL+ HLVAM  NFQFKV +                                                                       TYRPKD
Subjt:  KKELSLKMHLVAMRKNFQFKVSR-----------------------------------------------------------------------TYRPKD

Query:  IIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYG
        IIQDMRKEYGVNLSYD+AW S+EEALRLIRGDP +SYGLLPAYG
Subjt:  IIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYG

A0A6J1DQB9 Reverse transcriptase2.5e-5783.82Show/hide
Query:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
        +VPGVWNDNEDESGESYDPLAES+EGH QAEYGNEEHD+ALD EL  DV+QVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
Subjt:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN

Query:  KKELSLKMHLVAMRKNFQFKVSRTYRPKDIIQDMRK
        KKELSLKMHLVAMRKNFQFK+      K++ + + K
Subjt:  KKELSLKMHLVAMRKNFQFKVSRTYRPKDIIQDMRK

A0A6J1DTG5 uncharacterized protein LOC1110238434.6e-8067.2Show/hide
Query:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
        +VPGVWNDNEDESGESYDPLA SEEGH QAEYGNEEHD+ALD EL  DV+QVHTEIRRDEEAVR PGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN
Subjt:  NVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDN

Query:  KKELSLKMHLVAMRKNFQFK-----------------------------------------------------------------------VSRTYRPKD
        KKELSLKMHLVAMRKNFQFK                                                                       VSRTYRPKD
Subjt:  KKELSLKMHLVAMRKNFQFK-----------------------------------------------------------------------VSRTYRPKD

Query:  IIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYGEALKIM
        IIQDMRKEYGVNLSYDRAW SSEEALRLIRGDPASSYGLLPAYG+ALKIM
Subjt:  IIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYGEALKIM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCATTGGAAATTTCTAGCCTTGAAGAGACGATGATAGATTTATGGCAAAGAAATGATGTAGCATTACGTAGGCTGCAGATCGAAATGGAGCAGATGATC
GATGAGTCGAGGAATAGTCCACTAGAAGCATTGCCTAACAATGAGCAAGGTGAGGCAATCAAGGCTACCCTAGCCTCTGCAGCAACCAAAAGCGGTAGTGAAAGA
GTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCCAACATAGTGTGTTTTCCATGTTTTGCATCAAAACTGAGCAACAAGTCCCTGAG
GATTCGACCTTGGATTACTTCCGAGTATTACTTGTGACCGTGTGCGCTTGCACGAATGTGCCGGGAGTATGGAATGATAACGAAGATGAAAGTGGTGAATCATAT
GACCCGTTGGCAGAGTCTGAAGAAGGACACTTTCAAGCAGAATATGGGAACGAAGAGCATGACAATGCGCTTGATTATGAGCTTGGGTCTGATGTCAAACAAGTG
CACACTGAGATTCGCAGGGATGAAGAAGCGGTCCGGCCACCGGGATGTAATGGTCTCACCGGAGACCCTAATGATGAGAAATTGCAACTTATAGTACAGTCTTCT
GGGACAAATGATGTTAATGAGGGCGATGTATTTGATAATAAGAAGGAGTTGAGTTTGAAAATGCATTTAGTTGCAATGCGGAAGAATTTTCAGTTTAAAGTCTCC
CGCACGTATAGACCGAAGGACATTATACAAGACATGAGGAAGGAGTATGGTGTCAATTTAAGTTATGATAGAGCATGGCATTCTAGTGAAGAAGCACTCCGACTT
ATTAGAGGTGATCCAGCATCATCATATGGTCTACTTCCAGCTTATGGTGAAGCTTTGAAAATCATGATCCAGCATCGCATAAGCTCCTTCTCGAGTGAATCTAGT
GGAGTGGCCGATTGGAGTGAATCCCATACGGTTAAGTCACCTTCCACAAGATCAATTCAGAGCATCACCCAGTGGTTCCCACCTATGTGGACGCCTAACTCGTCG
TCTCGTGGGCTGTGTATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAATCATTGGAAATTTCTAGCCTTGAAGAGACGATGATAGATTTATGGCAAAGAAATGATGTAGCATTACGTAGGCTGCAGATCGAAATGGAGCAGATGATC
GATGAGTCGAGGAATAGTCCACTAGAAGCATTGCCTAACAATGAGCAAGGTGAGGCAATCAAGGCTACCCTAGCCTCTGCAGCAACCAAAAGCGGTAGTGAAAGA
GTAGAACTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCCAACATAGTGTGTTTTCCATGTTTTGCATCAAAACTGAGCAACAAGTCCCTGAG
GATTCGACCTTGGATTACTTCCGAGTATTACTTGTGACCGTGTGCGCTTGCACGAATGTGCCGGGAGTATGGAATGATAACGAAGATGAAAGTGGTGAATCATAT
GACCCGTTGGCAGAGTCTGAAGAAGGACACTTTCAAGCAGAATATGGGAACGAAGAGCATGACAATGCGCTTGATTATGAGCTTGGGTCTGATGTCAAACAAGTG
CACACTGAGATTCGCAGGGATGAAGAAGCGGTCCGGCCACCGGGATGTAATGGTCTCACCGGAGACCCTAATGATGAGAAATTGCAACTTATAGTACAGTCTTCT
GGGACAAATGATGTTAATGAGGGCGATGTATTTGATAATAAGAAGGAGTTGAGTTTGAAAATGCATTTAGTTGCAATGCGGAAGAATTTTCAGTTTAAAGTCTCC
CGCACGTATAGACCGAAGGACATTATACAAGACATGAGGAAGGAGTATGGTGTCAATTTAAGTTATGATAGAGCATGGCATTCTAGTGAAGAAGCACTCCGACTT
ATTAGAGGTGATCCAGCATCATCATATGGTCTACTTCCAGCTTATGGTGAAGCTTTGAAAATCATGATCCAGCATCGCATAAGCTCCTTCTCGAGTGAATCTAGT
GGAGTGGCCGATTGGAGTGAATCCCATACGGTTAAGTCACCTTCCACAAGATCAATTCAGAGCATCACCCAGTGGTTCCCACCTATGTGGACGCCTAACTCGTCG
TCTCGTGGGCTGTGTATCTAG
Protein sequenceShow/hide protein sequence
MQSLEISSLEETMIDLWQRNDVALRRLQIEMEQMIDESRNSPLEALPNNEQGEAIKATLASAATKSGSERVELKSQEKSGIAPGAFSQHSVFSMFCIKTEQQVPE
DSTLDYFRVLLVTVCACTNVPGVWNDNEDESGESYDPLAESEEGHFQAEYGNEEHDNALDYELGSDVKQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSS
GTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVSRTYRPKDIIQDMRKEYGVNLSYDRAWHSSEEALRLIRGDPASSYGLLPAYGEALKIMIQHRISSFSSESS
GVADWSESHTVKSPSTRSIQSITQWFPPMWTPNSSSRGLCI