; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g16970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g16970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr3:11266979..11272261
RNA-Seq ExpressionMoc03g16970
SyntenyMoc03g16970
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2626078.1 hypothetical protein D8674_017738 [Pyrus ussuriensis x Pyrus communis]1.9e-1136.84Show/hide
Query:  RYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARRKLGKMSASTSGAENGVESALVA
        +Y E   V  H+N   N++N++  +G+ I+EE+ A+ LL SLPDSWET    +SNS     L   A+ D+  +EE RRK       TSGA  G     V 
Subjt:  RYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARRKLGKMSASTSGAENGVESALVA

Query:  Q----NKGKTKIS----YNGKQ-------QKFKEDLKKGNTTANVVTEEEQI
        +    N G+T+ +    Y GK+       +K+K++ K GN TA VVT +E++
Subjt:  Q----NKGKTKIS----YNGKQ-------QKFKEDLKKGNTTANVVTEEEQI

KAG2715942.1 hypothetical protein I3760_03G102900 [Carya illinoinensis]2.3e-1243.36Show/hide
Query:  FLGRYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARRK---LGKMSASTSGAENGV
        F  R  EGTLV  H+NE   I+N+L  +G++ D+EV+A+  L  LP SWE M+TAVSNS G+  +K+  I D  LSEE RR+       S+ST   E   
Subjt:  FLGRYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARRK---LGKMSASTSGAENGV

Query:  ESALVAQNKGKTK
          A  + N+G++K
Subjt:  ESALVAQNKGKTK

KAG6639444.1 hypothetical protein CIPAW_10G100900 [Carya illinoinensis]4.1e-1440.56Show/hide
Query:  EGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARRKLGKMSASTSGAENGVESALVA---
        +GT V  H+N+   I N+L  + ++ D+E++A+ LL SLP+SWE M+ AVSNS G++ LK+  I D  L+EE R+K        SG  +G+ SAL A   
Subjt:  EGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARRKLGKMSASTSGAENGVESALVA---

Query:  -------QNKGKTKISYNGKQQKFKEDLKKGNTTANVVTEEEQ
                N+G++K  Y G+  +    LK  N +ANVVTEE Q
Subjt:  -------QNKGKTKISYNGKQQKFKEDLKKGNTTANVVTEEEQ

XP_022152155.1 cinnamoyl-CoA reductase-like SNL6 [Momordica charantia]8.8e-4177.1Show/hide
Query:  VVATGRKRSSVYVSEFEVAKGSLRQTMHRVAADGSGRDLRGPAAMMARTDQKNLPSAQVKQLRSTEKGNGNLIGHRVHSSTVRRSSELMKSHRRISALKG
        VVATG KRS VYVSEFEVAKGSLRQTMH+V A GS R LR PAA+MA+TDQKNLPS QVKQLRST+KGN NLIGHRVH+S VR   EL+KSHRRISALKG
Subjt:  VVATGRKRSSVYVSEFEVAKGSLRQTMHRVAADGSGRDLRGPAAMMARTDQKNLPSAQVKQLRSTEKGNGNLIGHRVHSSTVRRSSELMKSHRRISALKG

Query:  TSSVSSVATDLGGSAKLLGESSFKGRSVRSR
        T S+SSVAT L GSAK  GESSF+   +RSR
Subjt:  TSSVSSVATDLGGSAKLLGESSFKGRSVRSR

XP_022157059.1 uncharacterized protein LOC111023870 [Momordica charantia]3.8e-3682.08Show/hide
Query:  GSLRQTMHRVAADGSGRDLRGPAAMMARTDQKNLPSAQVKQLRSTEKGNGNLIGHRVHSSTVRRSSELMKSHRRISALKGTSSVSSVATDLGGSAKLLGE
        GSLRQTMHRVA D SGRDL+GP  +MARTDQKNLPSA VKQLRSTEKGN NLIGH+VH+S VRRS EL+KSHRRISA KGT  VSSV TDLGGSAK  GE
Subjt:  GSLRQTMHRVAADGSGRDLRGPAAMMARTDQKNLPSAQVKQLRSTEKGNGNLIGHRVHSSTVRRSSELMKSHRRISALKGTSSVSSVATDLGGSAKLLGE

Query:  SSFKGR
        SSFKGR
Subjt:  SSFKGR

TrEMBL top hitse value%identityAlignment
A0A2N9EUG1 Uncharacterized protein4.2e-1239.13Show/hide
Query:  CGRFAGRLSNGPCLEMVKFLG-RYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARR
        CG +    +N     M K    +  EGT V  H+NE   I N+L  + ++ D+E++A+ +L SLP+SWE M+ AVSNS G+  LK+  I D  L EE RR
Subjt:  CGRFAGRLSNGPCLEMVKFLG-RYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARR

Query:  KLGKMSASTSGAENGVES---ALVAQNKGKTKISYNGKQQKFKEDLKK--GNTTANVVTEE
        +      S+SG+   +E+    L   N GKT     G  +K   +LKK   N +ANVVTEE
Subjt:  KLGKMSASTSGAENGVES---ALVAQNKGKTKISYNGKQQKFKEDLKK--GNTTANVVTEE

A0A2N9GHB5 Uncharacterized protein4.2e-1239.13Show/hide
Query:  CGRFAGRLSNGPCLEMVKFLG-RYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARR
        CG +    +N     M K    +  EGT V  H+NE   I N+L  + ++ D+E++A+ +L SLP+SWE M+ AVSNS G+  LK+  I D  L EE RR
Subjt:  CGRFAGRLSNGPCLEMVKFLG-RYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARR

Query:  KLGKMSASTSGAENGVES---ALVAQNKGKTKISYNGKQQKFKEDLKK--GNTTANVVTEE
        +      S+SG+   +E+    L   N GKT     G  +K   +LKK   N +ANVVTEE
Subjt:  KLGKMSASTSGAENGVES---ALVAQNKGKTKISYNGKQQKFKEDLKK--GNTTANVVTEE

A0A2N9HSF0 gag_pre-integrs domain-containing protein2.4e-1235.66Show/hide
Query:  CGRFAGRLSNGPCLEMVKFLG-RYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARR
        CG +    +N     M K    +  EGT V  H+NE   I N+L  + ++ D+E++A+ +L SLP+SWE M+ AVSNS G+  LK+  I D  L EE RR
Subjt:  CGRFAGRLSNGPCLEMVKFLG-RYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARR

Query:  KLGKMSASTSGAENGVESALVAQNKGKTKISYNGKQQKFKEDL
        +    ++S+  A N     L A+ +GK +    G+ +  KE++
Subjt:  KLGKMSASTSGAENGVESALVAQNKGKTKISYNGKQQKFKEDL

A0A6J1DD48 cinnamoyl-CoA reductase-like SNL64.3e-4177.1Show/hide
Query:  VVATGRKRSSVYVSEFEVAKGSLRQTMHRVAADGSGRDLRGPAAMMARTDQKNLPSAQVKQLRSTEKGNGNLIGHRVHSSTVRRSSELMKSHRRISALKG
        VVATG KRS VYVSEFEVAKGSLRQTMH+V A GS R LR PAA+MA+TDQKNLPS QVKQLRST+KGN NLIGHRVH+S VR   EL+KSHRRISALKG
Subjt:  VVATGRKRSSVYVSEFEVAKGSLRQTMHRVAADGSGRDLRGPAAMMARTDQKNLPSAQVKQLRSTEKGNGNLIGHRVHSSTVRRSSELMKSHRRISALKG

Query:  TSSVSSVATDLGGSAKLLGESSFKGRSVRSR
        T S+SSVAT L GSAK  GESSF+   +RSR
Subjt:  TSSVSSVATDLGGSAKLLGESSFKGRSVRSR

A0A6J1DVE9 uncharacterized protein LOC1110238701.9e-3682.08Show/hide
Query:  GSLRQTMHRVAADGSGRDLRGPAAMMARTDQKNLPSAQVKQLRSTEKGNGNLIGHRVHSSTVRRSSELMKSHRRISALKGTSSVSSVATDLGGSAKLLGE
        GSLRQTMHRVA D SGRDL+GP  +MARTDQKNLPSA VKQLRSTEKGN NLIGH+VH+S VRRS EL+KSHRRISA KGT  VSSV TDLGGSAK  GE
Subjt:  GSLRQTMHRVAADGSGRDLRGPAAMMARTDQKNLPSAQVKQLRSTEKGNGNLIGHRVHSSTVRRSSELMKSHRRISALKGTSSVSSVATDLGGSAKLLGE

Query:  SSFKGR
        SSFKGR
Subjt:  SSFKGR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-0533.33Show/hide
Query:  LSNGPCLEMVKFLGRYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARRK
        L+N   L+   +    +EGT   SH+N    ++ +L  +GVKI+EE KA+ LL SLP S++ + T + +  G+  ++   +  A L  E  RK
Subjt:  LSNGPCLEMVKFLGRYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARRK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGACAGATATCAAGCACACCATTATTATGAAGGGGTCCTGAAGTTCAATGGAGAGAATTTCAGTTTTTGGAAGATGCAGGTAAAGGATCTTCTTACATATACTAC
AGCGAAAGATTTGTTGAAGGTCTTGCAAGACAGCCGCCAGCCCCCCCCCCCCTCGAACTTCTTCTCCGGCGACTCCACTGTGGCGACAACTCACAACCCGAACAGCGACA
CACCTTCAGTCTGCACCAGATCTGTTCGACGATCGGCGACCACGGAGAAAGCTTCTTCTGGCGACCACGAGCACGGCGACGCTGCGCTTCAACTGCGTCCTACTGACGGC
ATCTACGACGGATCGGTCAAACCCACAGCGGGCCGGCGTTCCTTGAACCAACAGCGACGGCGTTTCCTCGAACAGCAGCGGCGGCTCTCAATCCCGAACGGCGGCGCAAC
TTGGAACTTCTTCGGCAGTCGTCGACGGCAGGAGCGGCGTGCCTTCAGCCCTTTTCCGGCTTCTCACACGGGTCTGGAACCCAAACCCGGGGCGATTCCGGTTTTCTACA
GTGGCTGTGCCACGCTTACCCACACCTGTCCGGATTTCGATTCGCGGTACCCACCCCTATTTAGACTCAAGCCAGCATTACCCATCGCTTTTGACATCGGAACAGCAAGC
ATAAGGACATTCAAACTCAGTTTTGGATGCCCGAACCTCCTCGGCGTCGACCCACTCCTTACCCAAGAGGTTTTATGGAACCCTTCGGTGAACTTGGGCCTCGAGTATAA
ATGGTCGAGGGCTGATACGTCACTAATTGGGTATCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATACGCCAATATTGGATAAAGATGAGCGTCAAGGCCTCAGGGTCG
GGGCTGTGGGTATAAATGGTCAGGGGCCGACTCTATTTGAAGGAGATAGCTGTGGAAGGTTCGCAGGTAGGCTTAGTAACGGTCCCTGTTTGGAAATGGTGAAATTCCTG
GGGCGTTACAATGAGGGAACCTTGGTGAATTCCCACATTAATGAACTCACCAATATCTTGAACAAGTTAGAAGGGATGGGCGTCAAGATTGACGAGGAGGTGAAAGCTAT
GAGGCTGTTGACGTCTTTACCTGACAGTTGGGAGACGATGAAGACCGCAGTGTCGAATTCGCTAGGGGAAAATAACTTGAAATTTACAGCTATTTGTGATGCCACCTTAT
CTGAGGAAGCCCGGAGAAAATTAGGGAAAATGTCTGCATCTACTTCAGGGGCAGAAAACGGGGTTGAATCAGCTTTGGTAGCTCAGAACAAAGGGAAGACAAAGATTAGT
TACAATGGGAAGCAGCAGAAGTTTAAAGAAGATCTTAAGAAGGGGAATACTACTGCAAACGTTGTAACAGAAGAAGAACAGATTGAAGAGGTGGTGGCAACAGGCCGCAA
GAGATCTTCTGTTTATGTGTCAGAATTTGAGGTTGCCAAGGGTTCACTGAGACAGACGATGCATAGAGTAGCTGCAGATGGTTCAGGGCGAGACCTTAGAGGACCGGCAG
CAATGATGGCCAGAACAGATCAGAAGAATCTGCCATCAGCTCAAGTTAAACAGTTGAGAAGTACAGAAAAGGGAAACGGGAATTTGATAGGCCATCGAGTTCATTCCTCA
ACTGTCAGACGTTCAAGCGAGCTGATGAAGTCGCATAGGCGAATTAGTGCATTGAAGGGTACGAGTTCTGTTTCTAGTGTGGCGACAGACTTGGGTGGGAGTGCCAAGTT
ATTAGGGGAATCTTCCTTCAAAGGTCGTTCGGTTCGATCGAGAAAGAAAGCGACGGGGACCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGACAGATATCAAGCACACCATTATTATGAAGGGGTCCTGAAGTTCAATGGAGAGAATTTCAGTTTTTGGAAGATGCAGGTAAAGGATCTTCTTACATATACTAC
AGCGAAAGATTTGTTGAAGGTCTTGCAAGACAGCCGCCAGCCCCCCCCCCCCTCGAACTTCTTCTCCGGCGACTCCACTGTGGCGACAACTCACAACCCGAACAGCGACA
CACCTTCAGTCTGCACCAGATCTGTTCGACGATCGGCGACCACGGAGAAAGCTTCTTCTGGCGACCACGAGCACGGCGACGCTGCGCTTCAACTGCGTCCTACTGACGGC
ATCTACGACGGATCGGTCAAACCCACAGCGGGCCGGCGTTCCTTGAACCAACAGCGACGGCGTTTCCTCGAACAGCAGCGGCGGCTCTCAATCCCGAACGGCGGCGCAAC
TTGGAACTTCTTCGGCAGTCGTCGACGGCAGGAGCGGCGTGCCTTCAGCCCTTTTCCGGCTTCTCACACGGGTCTGGAACCCAAACCCGGGGCGATTCCGGTTTTCTACA
GTGGCTGTGCCACGCTTACCCACACCTGTCCGGATTTCGATTCGCGGTACCCACCCCTATTTAGACTCAAGCCAGCATTACCCATCGCTTTTGACATCGGAACAGCAAGC
ATAAGGACATTCAAACTCAGTTTTGGATGCCCGAACCTCCTCGGCGTCGACCCACTCCTTACCCAAGAGGTTTTATGGAACCCTTCGGTGAACTTGGGCCTCGAGTATAA
ATGGTCGAGGGCTGATACGTCACTAATTGGGTATCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATACGCCAATATTGGATAAAGATGAGCGTCAAGGCCTCAGGGTCG
GGGCTGTGGGTATAAATGGTCAGGGGCCGACTCTATTTGAAGGAGATAGCTGTGGAAGGTTCGCAGGTAGGCTTAGTAACGGTCCCTGTTTGGAAATGGTGAAATTCCTG
GGGCGTTACAATGAGGGAACCTTGGTGAATTCCCACATTAATGAACTCACCAATATCTTGAACAAGTTAGAAGGGATGGGCGTCAAGATTGACGAGGAGGTGAAAGCTAT
GAGGCTGTTGACGTCTTTACCTGACAGTTGGGAGACGATGAAGACCGCAGTGTCGAATTCGCTAGGGGAAAATAACTTGAAATTTACAGCTATTTGTGATGCCACCTTAT
CTGAGGAAGCCCGGAGAAAATTAGGGAAAATGTCTGCATCTACTTCAGGGGCAGAAAACGGGGTTGAATCAGCTTTGGTAGCTCAGAACAAAGGGAAGACAAAGATTAGT
TACAATGGGAAGCAGCAGAAGTTTAAAGAAGATCTTAAGAAGGGGAATACTACTGCAAACGTTGTAACAGAAGAAGAACAGATTGAAGAGGTGGTGGCAACAGGCCGCAA
GAGATCTTCTGTTTATGTGTCAGAATTTGAGGTTGCCAAGGGTTCACTGAGACAGACGATGCATAGAGTAGCTGCAGATGGTTCAGGGCGAGACCTTAGAGGACCGGCAG
CAATGATGGCCAGAACAGATCAGAAGAATCTGCCATCAGCTCAAGTTAAACAGTTGAGAAGTACAGAAAAGGGAAACGGGAATTTGATAGGCCATCGAGTTCATTCCTCA
ACTGTCAGACGTTCAAGCGAGCTGATGAAGTCGCATAGGCGAATTAGTGCATTGAAGGGTACGAGTTCTGTTTCTAGTGTGGCGACAGACTTGGGTGGGAGTGCCAAGTT
ATTAGGGGAATCTTCCTTCAAAGGTCGTTCGGTTCGATCGAGAAAGAAAGCGACGGGGACCACTTAG
Protein sequenceShow/hide protein sequence
MGDRYQAHHYYEGVLKFNGENFSFWKMQVKDLLTYTTAKDLLKVLQDSRQPPPPSNFFSGDSTVATTHNPNSDTPSVCTRSVRRSATTEKASSGDHEHGDAALQLRPTDG
IYDGSVKPTAGRRSLNQQRRRFLEQQRRLSIPNGGATWNFFGSRRRQERRAFSPFPASHTGLEPKPGAIPVFYSGCATLTHTCPDFDSRYPPLFRLKPALPIAFDIGTAS
IRTFKLSFGCPNLLGVDPLLTQEVLWNPSVNLGLEYKWSRADTSLIGYRGLGYKWSGVDTPILDKDERQGLRVGAVGINGQGPTLFEGDSCGRFAGRLSNGPCLEMVKFL
GRYNEGTLVNSHINELTNILNKLEGMGVKIDEEVKAMRLLTSLPDSWETMKTAVSNSLGENNLKFTAICDATLSEEARRKLGKMSASTSGAENGVESALVAQNKGKTKIS
YNGKQQKFKEDLKKGNTTANVVTEEEQIEEVVATGRKRSSVYVSEFEVAKGSLRQTMHRVAADGSGRDLRGPAAMMARTDQKNLPSAQVKQLRSTEKGNGNLIGHRVHSS
TVRRSSELMKSHRRISALKGTSSVSSVATDLGGSAKLLGESSFKGRSVRSRKKATGTT