; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g19060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g19060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:13962321..13974301
RNA-Seq ExpressionMoc04g19060
SyntenyMoc04g19060
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153198.1 uncharacterized protein LOC111020753 [Momordica charantia]3.8e-3137.66Show/hide
Query:  DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA----------------------------------------------------
        DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA                                                    
Subjt:  DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------NPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFV
                                              NPPPYLEDMYHY VKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPK KFV
Subjt:  --------------------------------------NPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFV

Query:  AAKRMEGE
        AAKRMEGE
Subjt:  AAKRMEGE

XP_022158838.1 uncharacterized protein LOC111025303 [Momordica charantia]1.6e-3240.88Show/hide
Query:  DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA----------------------------------------------------
        DGVPQVPQDPN VILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA                                                    
Subjt:  DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----NPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE
            NPP YLEDMYHYAVKIEDQ KEEKEYSKRYT KANTYSNSNAW KAGFVNRNES+QPKGKFVAAKRMEGE
Subjt:  ----NPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE

XP_022932136.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111438459, partial [Cucurbita moschata]1.8e-2060.95Show/hide
Query:  MEMMMEDRQERRAQQQREERAL--------QEDDANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAK
        ME MME    R  ++    R L           D NPPPYLEDM HYA+KIEDQLKEEKE+SKRYTS+ NT+SNS  WNK  FVNRNES+ PK KFVAAK
Subjt:  MEMMMEDRQERRAQQQREERAL--------QEDDANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAK

Query:  RMEGE
        R+E E
Subjt:  RMEGE

XP_023520950.1 uncharacterized protein LOC111784506, partial [Cucurbita pepo subsp. pepo]1.0e-2380.56Show/hide
Query:  DANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE
        D NPPPYLEDMYHYA+KIEDQLKEEKE+SKRYTS+ NT+SNSN WNK  FVNRNES+ PK KFVAAKR+E E
Subjt:  DANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE

XP_023544048.1 uncharacterized protein LOC111803745 [Cucurbita pepo subsp. pepo]8.0e-2160.95Show/hide
Query:  MEMMMEDRQERRAQQQREERAL--------QEDDANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAK
        ME MME    R  ++    R L           D NPPPYLEDMYHYA+KIEDQLKEEKE+SKRYTS+ N +SNS  WNK  FVNRNES+ PK KFVAAK
Subjt:  MEMMMEDRQERRAQQQREERAL--------QEDDANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAK

Query:  RMEGE
        R+E E
Subjt:  RMEGE

TrEMBL top hitse value%identityAlignment
A0A5A7TLJ3 Reverse transcriptase6.2e-1970.83Show/hide
Query:  DANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE
        D NPP  +EDMYHYA+KIE QLKEEKE SKRY SK +T+S+SN WNK GFVNRNES+Q +GKFVAA+++E E
Subjt:  DANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE

A0A5D3C8C6 CCHC-type domain-containing protein6.2e-1970.83Show/hide
Query:  DANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE
        D NPP  +EDMYHYA+KIE QLKEEKE SKRY SK +T+S+SN WNK GFVNRNES+Q +GKFVAA+++E E
Subjt:  DANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE

A0A6J1DGU9 uncharacterized protein LOC1110207531.9e-3137.66Show/hide
Query:  DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA----------------------------------------------------
        DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA                                                    
Subjt:  DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------NPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFV
                                              NPPPYLEDMYHY VKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPK KFV
Subjt:  --------------------------------------NPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFV

Query:  AAKRMEGE
        AAKRMEGE
Subjt:  AAKRMEGE

A0A6J1DX75 uncharacterized protein LOC1110253037.5e-3340.88Show/hide
Query:  DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA----------------------------------------------------
        DGVPQVPQDPN VILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA                                                    
Subjt:  DGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDA----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----NPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE
            NPP YLEDMYHYAVKIEDQ KEEKEYSKRYT KANTYSNSNAW KAGFVNRNES+QPKGKFVAAKRMEGE
Subjt:  ----NPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAKRMEGE

A0A6J1EVI6 LOW QUALITY PROTEIN: uncharacterized protein LOC1114384598.7e-2160.95Show/hide
Query:  MEMMMEDRQERRAQQQREERAL--------QEDDANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAK
        ME MME    R  ++    R L           D NPPPYLEDM HYA+KIEDQLKEEKE+SKRYTS+ NT+SNS  WNK  FVNRNES+ PK KFVAAK
Subjt:  MEMMMEDRQERRAQQQREERAL--------QEDDANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPKGKFVAAK

Query:  RMEGE
        R+E E
Subjt:  RMEGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTTTGATTCCGATGTTCTACAGAGCAATGAAGAAAAACAGGATCCGGAGTCAATACTCAGTGCTTTCGCCGGACGTCGCCGCCGCTTCTCAATCTAATTACAA
CATTGCTGACTTCTACGTCAAACCGCCCCCCTGCCACGTCTACCCCAAAACTTCTTCTGCTTCTGCAGCACCAGATCTCGGCCCCCGCGGCGGGGCTCACCGGAGATACA
ACTCCTTCGGCGCTTTCTCGGCGGAGGATCAGGCCAACACCCGCAATGGGCCACCGCCACAGCTTGTCAGGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATCAA
GCTAATTCTGGAGATGGAGTACCACAAGTACCGCAAGATCCTAACATGGTGATTCTTCAAGCAATTCAAGGAATGATGGAAATGATGATGGAAGATAGACAAGAAAGGAG
AGCGCAACAACAAAGAGAAGAACGAGCCTTACAAGAAGATGATGCAAATCCACCGCCTTACCTAGAAGATATGTACCATTATGCTGTCAAAATTGAAGATCAATTGAAGG
AAGAAAAGGAGTATTCAAAAAGGTACACATCTAAAGCTAATACATATTCCAATTCTAATGCTTGGAATAAGGCTGGTTTTGTGAATAGGAATGAATCATTGCAGCCAAAA
GGAAAGTTCGTAGCTGCCAAAAGAATGGAGGGAGAAAGATTTGAAAGACTTTTGGTGTTCCTCATGTTGAACGACCCTTTGGCCTATTTAAAGGCTTGTAATCACCTTGT
TTTGGAGGATTCAAAACATATTAAAATTCAAGAACTTTTGTTCCTTAACACTTTTGGTGTTGCTTCTTTCAAATCTTGCTTTAGCCAGTTGGTTGAGGTGGCTGAGATTG
AGGTGATGTTAGTCGTGGTTAGGCTCGAGGTGGTGTTGGTTATGGTGAAGTTGGCCCAGGTGGCTAAGACCTGGGATCAAGGTGATGTTGGCCGAGGATCAAGAGGAGGG
AGAATGTATAAAAAAAAAAATGGAATTGGGATCCCGGACCGGGATTGGAGGATCCCGGTCCGGGTTAGAAGGGATCCCGGTCCAGGATTGAATCATCCCGGACCAGGTTA
CTATTCTATCCGGGATTCTGCAATCTCGGTCCGGGATGGTTCAATCCCGGACCGGGATCCTTCTAACCCGGACCGGGATCCTCCATCTCGTCTTCAGATGCCTCTCTTGG
ACTCATATGTCTCCTCTGTTGTGGCACTCTTTCAAGTCTTTCCAATCCTTGAACGCGAGTGTTCATCATTTGTAAGGATTTGCCATACTGCCTCTTTTGCTGTTTCACAA
GCTGCAATATACTTGACCTTCATTGTGGAGTCGGCGATGCATCTTTCTTTGACGCTTCATCATACTATAGCTCCTCCATTAAGAGTGAACACTGATCCCGACATGGATTT
CTTAGAATCCTTGTCAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGTTTGATTCCGATGTTCTACAGAGCAATGAAGAAAAACAGGATCCGGAGTCAATACTCAGTGCTTTCGCCGGACGTCGCCGCCGCTTCTCAATCTAATTACAA
CATTGCTGACTTCTACGTCAAACCGCCCCCCTGCCACGTCTACCCCAAAACTTCTTCTGCTTCTGCAGCACCAGATCTCGGCCCCCGCGGCGGGGCTCACCGGAGATACA
ACTCCTTCGGCGCTTTCTCGGCGGAGGATCAGGCCAACACCCGCAATGGGCCACCGCCACAGCTTGTCAGGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATCAA
GCTAATTCTGGAGATGGAGTACCACAAGTACCGCAAGATCCTAACATGGTGATTCTTCAAGCAATTCAAGGAATGATGGAAATGATGATGGAAGATAGACAAGAAAGGAG
AGCGCAACAACAAAGAGAAGAACGAGCCTTACAAGAAGATGATGCAAATCCACCGCCTTACCTAGAAGATATGTACCATTATGCTGTCAAAATTGAAGATCAATTGAAGG
AAGAAAAGGAGTATTCAAAAAGGTACACATCTAAAGCTAATACATATTCCAATTCTAATGCTTGGAATAAGGCTGGTTTTGTGAATAGGAATGAATCATTGCAGCCAAAA
GGAAAGTTCGTAGCTGCCAAAAGAATGGAGGGAGAAAGATTTGAAAGACTTTTGGTGTTCCTCATGTTGAACGACCCTTTGGCCTATTTAAAGGCTTGTAATCACCTTGT
TTTGGAGGATTCAAAACATATTAAAATTCAAGAACTTTTGTTCCTTAACACTTTTGGTGTTGCTTCTTTCAAATCTTGCTTTAGCCAGTTGGTTGAGGTGGCTGAGATTG
AGGTGATGTTAGTCGTGGTTAGGCTCGAGGTGGTGTTGGTTATGGTGAAGTTGGCCCAGGTGGCTAAGACCTGGGATCAAGGTGATGTTGGCCGAGGATCAAGAGGAGGG
AGAATGTATAAAAAAAAAAATGGAATTGGGATCCCGGACCGGGATTGGAGGATCCCGGTCCGGGTTAGAAGGGATCCCGGTCCAGGATTGAATCATCCCGGACCAGGTTA
CTATTCTATCCGGGATTCTGCAATCTCGGTCCGGGATGGTTCAATCCCGGACCGGGATCCTTCTAACCCGGACCGGGATCCTCCATCTCGTCTTCAGATGCCTCTCTTGG
ACTCATATGTCTCCTCTGTTGTGGCACTCTTTCAAGTCTTTCCAATCCTTGAACGCGAGTGTTCATCATTTGTAAGGATTTGCCATACTGCCTCTTTTGCTGTTTCACAA
GCTGCAATATACTTGACCTTCATTGTGGAGTCGGCGATGCATCTTTCTTTGACGCTTCATCATACTATAGCTCCTCCATTAAGAGTGAACACTGATCCCGACATGGATTT
CTTAGAATCCTTGTCAGTTTAG
Protein sequenceShow/hide protein sequence
MEGLIPMFYRAMKKNRIRSQYSVLSPDVAAASQSNYNIADFYVKPPPCHVYPKTSSASAAPDLGPRGGAHRRYNSFGAFSAEDQANTRNGPPPQLVRFDVKPIHWISLDQ
ANSGDGVPQVPQDPNMVILQAIQGMMEMMMEDRQERRAQQQREERALQEDDANPPPYLEDMYHYAVKIEDQLKEEKEYSKRYTSKANTYSNSNAWNKAGFVNRNESLQPK
GKFVAAKRMEGERFERLLVFLMLNDPLAYLKACNHLVLEDSKHIKIQELLFLNTFGVASFKSCFSQLVEVAEIEVMLVVVRLEVVLVMVKLAQVAKTWDQGDVGRGSRGG
RMYKKKNGIGIPDRDWRIPVRVRRDPGPGLNHPGPGYYSIRDSAISVRDGSIPDRDPSNPDRDPPSRLQMPLLDSYVSSVVALFQVFPILERECSSFVRICHTASFAVSQ
AAIYLTFIVESAMHLSLTLHHTIAPPLRVNTDPDMDFLESLSV