; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g20090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g20090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr7:14470637..14472781
RNA-Seq ExpressionMoc07g20090
SyntenyMoc07g20090
Gene Ontology termsGO:0009059 - macromolecule biosynthetic process (biological process)
GO:0010467 - gene expression (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0044267 - cellular protein metabolic process (biological process)
GO:0044271 - cellular nitrogen compound biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5758504.1 putative RNA-directed DNA polymerase [Helianthus annuus]2.0e-2368.48Show/hide
Query:  TKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKK
        TKISDHLS LN+I+ ELEAI VK++DEDK LRLILSL  SYEHMKPILMYGK+TL +++ T KLLSEE+RL S G TS E + L+  N KKK
Subjt:  TKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKK

KAF5765959.1 putative RNA-directed DNA polymerase [Helianthus annuus]2.0e-2368.48Show/hide
Query:  TKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKK
        TKISDHLS LN+I+ ELEAI VK++DEDK LRLILSL  SYEHMKPILMYGK+TL +++ T KLLSEE+RL S G TS E + L+  N KKK
Subjt:  TKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKK

PON72378.1 Zinc finger, CCHC-type [Trema orientale]8.1e-2568.57Show/hide
Query:  SRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQ
        S GT ISDHLS LN I+FELEAI VKIDDEDK LRLI SLP SYEHMKPIL+YGK+T+ FSE TS LLSEERRL   G  S E S L   N KKK NS +
Subjt:  SRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQ

Query:  KKDCC
        K   C
Subjt:  KKDCC

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]6.6e-5152.61Show/hide
Query:  CDDEKKFLEGDIPVEVGDF-----FPSSGIRALGCAVQSGLHKALKGRPSEGTSEKLSSDGGPMESSGGSSRGT--------------------------
        C+ +  F    + ++V  F     F    ++     +QS LHKALKGRPSEG SEKLS DGGPMESSGGSSRG+                          
Subjt:  CDDEKKFLEGDIPVEVGDF-----FPSSGIRALGCAVQSGLHKALKGRPSEGTSEKLSSDGGPMESSGGSSRGT--------------------------

Query:  ------------------------------------------------KISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKD
                                                        KISDHLSNLNSIIFELEAIEVKIDDEDK LRLILSLP SYEHMKPILMYGKD
Subjt:  ------------------------------------------------KISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKD

Query:  TLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQKKDCC
        TLNF+E TSKLLSEERRLKSEGRTSHEDSALV SNWKKK++SVQKK CC
Subjt:  TLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQKKDCC

XP_022145135.1 uncharacterized protein LOC111014651 [Momordica charantia]3.0e-2786.75Show/hide
Query:  GTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDS
        GTKI DHLSNLNSIIFELEAIEVKIDDEDK LRLILSL  SYEHMKPILMYGKD LNF+EATSKLL E+RRLKSEGRTS  D+
Subjt:  GTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDS

TrEMBL top hitse value%identityAlignment
A0A2P5DGF9 Zinc finger, CCHC-type3.9e-2568.57Show/hide
Query:  SRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQ
        S GT ISDHLS LN I+FELEAI VKIDDEDK LRLI SLP SYEHMKPIL+YGK+T+ FSE TS LLSEERRL   G  S E S L   N KKK NS +
Subjt:  SRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQ

Query:  KKDCC
        K   C
Subjt:  KKDCC

A0A2P5EHD2 Zinc finger, CCHC-type2.2e-2366.67Show/hide
Query:  SRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQ
        S GT ISDHL  LN I+ ELEAI VKIDDEDK LRLI SLP SYEHMK IL+YGK+T+ FSE TS LLSEERRL   G  S E SAL+  N KKK NS +
Subjt:  SRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQ

Query:  KKDCC
        K   C
Subjt:  KKDCC

A0A6J1CG82 uncharacterized protein LOC1110105213.2e-5152.61Show/hide
Query:  CDDEKKFLEGDIPVEVGDF-----FPSSGIRALGCAVQSGLHKALKGRPSEGTSEKLSSDGGPMESSGGSSRGT--------------------------
        C+ +  F    + ++V  F     F    ++     +QS LHKALKGRPSEG SEKLS DGGPMESSGGSSRG+                          
Subjt:  CDDEKKFLEGDIPVEVGDF-----FPSSGIRALGCAVQSGLHKALKGRPSEGTSEKLSSDGGPMESSGGSSRGT--------------------------

Query:  ------------------------------------------------KISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKD
                                                        KISDHLSNLNSIIFELEAIEVKIDDEDK LRLILSLP SYEHMKPILMYGKD
Subjt:  ------------------------------------------------KISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKD

Query:  TLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQKKDCC
        TLNF+E TSKLLSEERRLKSEGRTSHEDSALV SNWKKK++SVQKK CC
Subjt:  TLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQKKDCC

A0A6J1CTL7 uncharacterized protein LOC1110146511.4e-2786.75Show/hide
Query:  GTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDS
        GTKI DHLSNLNSIIFELEAIEVKIDDEDK LRLILSL  SYEHMKPILMYGKD LNF+EATSKLL E+RRLKSEGRTS  D+
Subjt:  GTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDS

A0A7J7LXP2 CCHC-type domain-containing protein1.3e-2331.2Show/hide
Query:  TKISDHLSNLNSIIFELEAIEVKIDDEDEALRLILSLPPSYEHMKPILDFGH---------MKKDCPNKA---DSSKGSRRDADSVYLIRETMNSSFEER
        T + DHL  LN I+ ELE+I VK++DE +AL+LI SLP S++H++P L +G          +  +  N+A    ++ GSR ++  V+    T ++  ++ 
Subjt:  TKISDHLSNLNSIIFELEAIEVKIDDEDEALRLILSLPPSYEHMKPILDFGH---------MKKDCPNKA---DSSKGSRRDADSVYLIRETMNSSFEER

Query:  ENASSWYVRFTMIED------------VILAVPQVYTWALAWRLCKVCVVEVMLMAEE-LPGEPSRWKLHHKSQQDLSTCDDEKKFLEGDIPVEVGDFFP
            +W+  +   E              I  V  +       R+  + VV+ +LMA++ +  E   W       +DL                   D + 
Subjt:  ENASSWYVRFTMIED------------VILAVPQVYTWALAWRLCKVCVVEVMLMAEE-LPGEPSRWKLHHKSQQDLSTCDDEKKFLEGDIPVEVGDFFP

Query:  SSGIRALGCAVQSGLHKALKGRPSEGTSEKLSSDGGPMESSG-----------GSSRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYE
         S +R   C  ++ L     G+  +G  +KL S       S              + GT + DHL +LN I+ ELE+I VK++DE K L+LI  +P S++
Subjt:  SSGIRALGCAVQSGLHKALKGRPSEGTSEKLSSDGGPMESSG-----------GSSRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYE

Query:  HMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQKKDC
        H++P L+YGK+TL+F E TS LLSEERRLK   ++  E+SA+V S  KK  N  +K  C
Subjt:  HMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQKKDC

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-0738.67Show/hide
Query:  SRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLK
        S GT    HL+  N +I +L  + VKI++EDK + L+ SLP SY+++   +++GK T+   + TS LL  E+  K
Subjt:  SRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAGGTACGAAAATTTCAGATCATCTGAGTAATCTCAATAGCATCATCTTTGAGCTGGAGGCGATCGAAGTGAAGATAGATGACGAAGATGAAGCACTCAGGCT
CATCTTATCACTTCCACCTTCTTATGAACACATGAAGCCGATACTTGATTTTGGGCACATGAAGAAAGATTGTCCCAATAAAGCTGATTCGTCAAAGGGCTCTAGGCGGG
ATGCTGATAGTGTTTATCTCATCAGGGAGACGATGAACTCTTCCTTCGAAGAGAGAGAGAATGCATCCTCATGGTATGTCCGCTTTACCATGATAGAGGATGTGATATTA
GCGGTTCCACAAGTTTACACATGGGCATTGGCTTGGCGGTTATGTAAGGTGTGTGTGGTGGAAGTTATGTTGATGGCTGAAGAACTTCCAGGTGAGCCAAGTAGGTGGAA
GTTGCACCATAAATCTCAGCAAGATTTATCGACATGTGACGATGAAAAAAAATTCTTGGAAGGTGACATTCCAGTAGAAGTAGGGGATTTTTTCCCATCAAGTGGTATCA
GAGCTTTAGGATGTGCTGTTCAATCTGGGTTACACAAGGCGTTGAAGGGAAGACCGAGTGAAGGTACTTCTGAAAAGCTAAGCAGTGATGGTGGTCCAATGGAGTCTAGT
GGTGGTTCCAGCAGAGGTACGAAAATTTCAGACCATCTGAGTAATCTCAATAGCATCATCTTTGAGCTGGAGGCGATCGAAGTGAAGATAGATGACGAAGATAAAGTACT
CAGGCTCATCTTATCACTTCCACTTTCTTATGAACACATGAAGCCGATCTTGATGTATGGGAAGGATACTTTGAATTTTTCCGAGGCTACTAGTAAACTTCTTTCAGAGG
AAAGAAGGCTGAAGAGTGAAGGGCGTACTTCACATGAAGATTCGGCACTGGTAGCTAGCAATTGGAAGAAGAAGGAAAACTCCGTACAAAAGAAAGATTGTTGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAGGTACGAAAATTTCAGATCATCTGAGTAATCTCAATAGCATCATCTTTGAGCTGGAGGCGATCGAAGTGAAGATAGATGACGAAGATGAAGCACTCAGGCT
CATCTTATCACTTCCACCTTCTTATGAACACATGAAGCCGATACTTGATTTTGGGCACATGAAGAAAGATTGTCCCAATAAAGCTGATTCGTCAAAGGGCTCTAGGCGGG
ATGCTGATAGTGTTTATCTCATCAGGGAGACGATGAACTCTTCCTTCGAAGAGAGAGAGAATGCATCCTCATGGTATGTCCGCTTTACCATGATAGAGGATGTGATATTA
GCGGTTCCACAAGTTTACACATGGGCATTGGCTTGGCGGTTATGTAAGGTGTGTGTGGTGGAAGTTATGTTGATGGCTGAAGAACTTCCAGGTGAGCCAAGTAGGTGGAA
GTTGCACCATAAATCTCAGCAAGATTTATCGACATGTGACGATGAAAAAAAATTCTTGGAAGGTGACATTCCAGTAGAAGTAGGGGATTTTTTCCCATCAAGTGGTATCA
GAGCTTTAGGATGTGCTGTTCAATCTGGGTTACACAAGGCGTTGAAGGGAAGACCGAGTGAAGGTACTTCTGAAAAGCTAAGCAGTGATGGTGGTCCAATGGAGTCTAGT
GGTGGTTCCAGCAGAGGTACGAAAATTTCAGACCATCTGAGTAATCTCAATAGCATCATCTTTGAGCTGGAGGCGATCGAAGTGAAGATAGATGACGAAGATAAAGTACT
CAGGCTCATCTTATCACTTCCACTTTCTTATGAACACATGAAGCCGATCTTGATGTATGGGAAGGATACTTTGAATTTTTCCGAGGCTACTAGTAAACTTCTTTCAGAGG
AAAGAAGGCTGAAGAGTGAAGGGCGTACTTCACATGAAGATTCGGCACTGGTAGCTAGCAATTGGAAGAAGAAGGAAAACTCCGTACAAAAGAAAGATTGTTGCTAG
Protein sequenceShow/hide protein sequence
MEEGTKISDHLSNLNSIIFELEAIEVKIDDEDEALRLILSLPPSYEHMKPILDFGHMKKDCPNKADSSKGSRRDADSVYLIRETMNSSFEERENASSWYVRFTMIEDVIL
AVPQVYTWALAWRLCKVCVVEVMLMAEELPGEPSRWKLHHKSQQDLSTCDDEKKFLEGDIPVEVGDFFPSSGIRALGCAVQSGLHKALKGRPSEGTSEKLSSDGGPMESS
GGSSRGTKISDHLSNLNSIIFELEAIEVKIDDEDKVLRLILSLPLSYEHMKPILMYGKDTLNFSEATSKLLSEERRLKSEGRTSHEDSALVASNWKKKENSVQKKDCC