; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g15520 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g15520
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr2:11681666..11682659
RNA-Seq ExpressionMoc02g15520
SyntenyMoc02g15520
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]5.1e-4755.9Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        M ++EW  LDR+VLG IRLTL+KNV  +VAKE T  GLM  L++MYEKPS NNKV+L  K F+LKM EG P+ TH+NEF+ ++N+L +V++EF DE  A+
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR---KSWKSLSC
        +L+ SLP+SWEPMRAA+SNS G +K+KF DVRD  L EE+RR D+G   TSS+     RGR+ NR   NRG     RSKSRN +   KS K + C
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR---KSWKSLSC

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]1.5e-4655.9Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        M ++EW  LDR+VLG IRLTL+KNV  +VAKE T  GLM  L++MYEKPS NNKV+L  K F+LKM EG P+ TH+NEF+ ++N+L +V++EF DE  A+
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR---KSWKSLSC
        +LL SLP+SWEPMRAA+SNS G +K+KF DVRD  L EE+RR D+G    SS+     RGR+ NR   NRG     RSKSRN +   KS K + C
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR---KSWKSLSC

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]3.0e-4756.41Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        M ++EW  LDR+VLG IRLTL+KNV  +VAKE T  GLM  L++MYEKPS NNKV+L  K F+LKM EG P+ TH+NEF+ ++N+L +V++EF DE  A+
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR---KSWKSLSC
        +LL SLP+SWEPMRAA+SNS G +K+KF DVRD  L EE+RR D+G   TSS+     RGR+ NR   NRG     RSKSRN +   KS K + C
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR---KSWKSLSC

POO03940.1 hypothetical protein TorRG33x02_002440, partial [Trema orientale]9.1e-4455.61Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        M + +W+ LDR+VLG IRLTLTKNV  +VA+  T   +M+ L++MYEKPS NNKV+L  K F LKM EG  + TH+NEF+ ++++L +V++ F DE  A+
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVA-RGR----NNNRGYGNRGKLKNNRSKSRN
        +LL SLP SWEPMRAA+SNS GK K++F DVRD  LAEE+RR DSG   +SSS LN+  RGR    N+NRG G R K +N R KSR+
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVA-RGR----NNNRGYGNRGKLKNNRSKSRN

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]1.4e-8191.3Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        MGEKEWK LDRKVLGTIRLTLTKNVQSSVAK TT MGLMNALANMYEK SVNNKVYLATKFFNLKMAE TPIT HLNEFD LINKLVAVDLEFS E YAI
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR
        LLLRSLPDSWEPMRAAISNSC KEK+KFEDVRDAALAEEIRRKDSGIAPTS SVLNV RGRNNNRGYGNRGK KNNRS+SRN+R
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR

TrEMBL top hitse value%identityAlignment
A0A0D3BM55 Uncharacterized protein2.3e-4555.45Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        M + EW+ LDR+VLG IRLTL+KNV  +VAKE T  GLM  L++MYEKPS NNKV+L  K F+LKM EG  +  H+NEF+ ++N+L +V++EF DE  A+
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNRKSWKSLSCRWRALGYH
        +LL SLP+SWEPMRAA+SNS G +K+KF DVRD  LAEE+RR DSG A TSS+     RGRN +R   NR    N RSKSRN    W     R  A    
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNRKSWKSLSCRWRALGYH

Query:  CW
        CW
Subjt:  CW

A0A0D3CS45 Uncharacterized protein5.2e-4558.15Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        M + EW+ LDR+VLG IRLTL+KNV  +VAKE    GLM  L++MYEKPS NNKV+L  K F+LKM EG  +  H+NEF+ ++N+L +V++EF DE  A+
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR
        +LL SLP+SWEPMRAA+SNS G +K+KF DVRD  LAEE+RR DSG A TSS+     RGRN +R   NR    N RSKSRN R
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR

A0A0D3DMW7 CCHC-type domain-containing protein3.0e-4557.61Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        M + EW+ LDR+VLG IRLTL+KNV  ++AKE T  GLM  L++MYEKPSVNNKV+L  K F+LKM EG  +  H+NEF+ ++N+L +V++EF DE  A+
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR
        +LL SLP+SWEPMRAA++NS G +K+KF DVRD  LAEE+RR DSG   TSS+     RGRN +R   NR    N RSKSRN R
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR

A0A6J1DF43 uncharacterized protein LOC1110204696.9e-8291.3Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        MGEKEWK LDRKVLGTIRLTLTKNVQSSVAK TT MGLMNALANMYEK SVNNKVYLATKFFNLKMAE TPIT HLNEFD LINKLVAVDLEFS E YAI
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR
        LLLRSLPDSWEPMRAAISNSC KEK+KFEDVRDAALAEEIRRKDSGIAPTS SVLNV RGRNNNRGYGNRGK KNNRS+SRN+R
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNR

A0A7N2LG47 Integrase catalytic domain-containing protein3.0e-4550.93Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        M  +EW  LDR+VLG IRLTL+++V  +V KE T + LM AL+ MYEKPS NNKV+L  K FNLKMAE   +  HLNEF+ + N+L +V+++F DE  A+
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNV-ARGRNNNRGYGNRGKLKNNRSKSRNNRKSWKSLSCRWRALGY
        ++L SLP+SWE MR A+SNS GKEK+K+ D+RD  LAEEIRR+D+G +  S S LN+  RGR NNR   NRG+ K +R+ +RN  KS      +W   G 
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNV-ARGRNNNRGYGNRGKLKNNRSKSRNNRKSWKSLSCRWRALGY

Query:  HCWIMKDVKYPLVK
         C      K P  K
Subjt:  HCWIMKDVKYPLVK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-1733.33Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI
        M  ++W +LD +    IRL L+ +V +++  E T  G+   L ++Y   ++ NK+YL  + + L M+EGT   +HLN F+ LI +L  + ++  +E  AI
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAI

Query:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNRKS
        LLL SLP S++ +   I +  GK  ++ +DV  A L  E  RK         +++   RGR+  R   N G+    R KS+N  KS
Subjt:  LLLRSLPDSWEPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNRKS

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein5.0e-0849.09Show/hide
Query:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKV
        M + +W  L R+VL  IRLT++KN+  +VAKE +  GLM  L+++Y+KPS NN V
Subjt:  MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAAAAAGAATGGAAGAATTTGGACAGGAAAGTGTTGGGTACGATTCGCCTGACATTAACTAAAAATGTTCAGAGCAGCGTGGCGAAGGAGACGACCATAATGGG
GTTGATGAATGCCCTGGCTAACATGTATGAAAAACCTTCGGTAAATAATAAGGTGTATCTTGCAACTAAATTTTTTAATTTGAAAATGGCTGAAGGTACACCTATTACTA
CCCATTTAAATGAGTTTGACGCGTTGATTAATAAACTGGTAGCTGTTGATTTAGAATTCAGTGATGAATTTTATGCTATTTTGTTATTAAGATCTTTGCCTGATAGTTGG
GAACCCATGCGAGCTGCTATTTCAAATTCTTGTGGGAAAGAGAAAATGAAATTTGAAGATGTTAGAGATGCAGCTCTTGCAGAAGAAATTCGCAGGAAGGATTCTGGTAT
CGCTCCTACTTCTAGTTCAGTATTGAATGTGGCTAGAGGAAGAAATAATAACAGAGGTTATGGGAATCGAGGCAAGTTGAAAAACAACAGAAGCAAGTCGAGAAACAACA
GGAAATCATGGAAAAGTCTATCTTGCCGATGGAGAGCTTTAGGATATCATTGTTGGATAATGAAGGATGTGAAATATCCTTTGGTCAAGGAAACTGGAAAGTTACAAAGG
GTGTCATGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAAAAAGAATGGAAGAATTTGGACAGGAAAGTGTTGGGTACGATTCGCCTGACATTAACTAAAAATGTTCAGAGCAGCGTGGCGAAGGAGACGACCATAATGGG
GTTGATGAATGCCCTGGCTAACATGTATGAAAAACCTTCGGTAAATAATAAGGTGTATCTTGCAACTAAATTTTTTAATTTGAAAATGGCTGAAGGTACACCTATTACTA
CCCATTTAAATGAGTTTGACGCGTTGATTAATAAACTGGTAGCTGTTGATTTAGAATTCAGTGATGAATTTTATGCTATTTTGTTATTAAGATCTTTGCCTGATAGTTGG
GAACCCATGCGAGCTGCTATTTCAAATTCTTGTGGGAAAGAGAAAATGAAATTTGAAGATGTTAGAGATGCAGCTCTTGCAGAAGAAATTCGCAGGAAGGATTCTGGTAT
CGCTCCTACTTCTAGTTCAGTATTGAATGTGGCTAGAGGAAGAAATAATAACAGAGGTTATGGGAATCGAGGCAAGTTGAAAAACAACAGAAGCAAGTCGAGAAACAACA
GGAAATCATGGAAAAGTCTATCTTGCCGATGGAGAGCTTTAGGATATCATTGTTGGATAATGAAGGATGTGAAATATCCTTTGGTCAAGGAAACTGGAAAGTTACAAAGG
GTGTCATGGTGA
Protein sequenceShow/hide protein sequence
MGEKEWKNLDRKVLGTIRLTLTKNVQSSVAKETTIMGLMNALANMYEKPSVNNKVYLATKFFNLKMAEGTPITTHLNEFDALINKLVAVDLEFSDEFYAILLLRSLPDSW
EPMRAAISNSCGKEKMKFEDVRDAALAEEIRRKDSGIAPTSSSVLNVARGRNNNRGYGNRGKLKNNRSKSRNNRKSWKSLSCRWRALGYHCWIMKDVKYPLVKETGKLQR
VSW