; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g10200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g10200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr7:7820955..7823665
RNA-Seq ExpressionMoc07g10200
SyntenyMoc07g10200
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046201.1 gag/pol protein [Cucumis melo var. makuwa]6.6e-4244.87Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        M+ SI+ LL +QKLNG+NY  WK NLNT LV++DLRFVL E+C QA A  A   VR AYDRW+KAN+KA+VYI+A++S+VLAKKHE   TAKEIMDSL  
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEALK------VERGCHRRA---------KSGQLYSGISFEEFPAIPQQCGSTEVRPLEPSLSSSG-SKTIKKKKAAGKGFKPDSATAAP
        MFGQPS   +HEA+K      ++ G   R             + +  ++  +   +      TE   +  S S +    ++ K K  GK          P
Subjt:  MFGQPSSQARHEALK------VERGCHRRA---------KSGQLYSGISFEEFPAIPQQCGSTEVRPLEPSLSSSG-SKTIKKKKAAGKGFKPDSATAAP

Query:  KKGKAKVAEKGKCFHCNMDGHRKRNCPKYLPNRR
           K K   KGKC+HCN DGH  RNCPKYL  ++
Subjt:  KKGKAKVAEKGKCFHCNMDGHRKRNCPKYLPNRR

TYK14981.1 DNA-binding protein HEXBP-like [Cucumis melo var. makuwa]1.5e-4150.68Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        MS+SIIALL   +L  ENY  WKS LN  LVI DLRFVL E+C      NA  +VR+AYDRW KANDKA++YILA +SN+L+KKHE  VTA++IMDSL+ 
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEALKVERGCHRRAKSGQLYSGISFEEFPAIPQQCGSTEVRPLEPSLSSSGSKTIKKKKAAGKGFKPDSATAAPKKGKAKVAEKGKCFHC
        MFGQPS Q                                I Q+      R   P  SSSGSK I+K+K  GKG  P    A   KGKAKVA KGKCFHC
Subjt:  MFGQPSSQARHEALKVERGCHRRAKSGQLYSGISFEEFPAIPQQCGSTEVRPLEPSLSSSGSKTIKKKKAAGKGFKPDSATAAPKKGKAKVAEKGKCFHC

Query:  NMDGHRKRNCPKYLPNRRK
        N++ H KRNCPKYL  +++
Subjt:  NMDGHRKRNCPKYLPNRRK

XP_022152352.1 uncharacterized protein LOC111020095 [Momordica charantia]4.3e-4989.57Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        MS SIIALL AQ+LNGENYKQWKSNLNT LVIDDL+FVLQEDC QA APNATVAVR AYDRWIKANDKAKVYILAS+S+VLAKKHEDT+TAKEIMDSLQ+
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEALK
        MFGQPSSQARHEALK
Subjt:  MFGQPSSQARHEALK

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]2.4e-4484.21Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        MS SII LL AQKLN ENYKQWKSN+NT L+IDDLRFVLQEDC QA APNATVAVRN YDRWIKANDKAKV ILAS+S+VLAKKHE++V  KEIMDSLQ+
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEAL
        MFGQPSSQARHEAL
Subjt:  MFGQPSSQARHEAL

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]4.7e-4888.7Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        MSASIIALL AQKLNGENY+QWKSNLNT LVIDDLRFVLQEDC QA   NATVAVRNAYDRWIK+NDKAKVYILAS+S+VLAKKHEDTVT KEIMDSLQ+
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEALK
        MFGQPS QARHEALK
Subjt:  MFGQPSSQARHEALK

TrEMBL top hitse value%identityAlignment
A0A5A7TXW7 Gag/pol protein3.2e-4244.87Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        M+ SI+ LL +QKLNG+NY  WK NLNT LV++DLRFVL E+C QA A  A   VR AYDRW+KAN+KA+VYI+A++S+VLAKKHE   TAKEIMDSL  
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEALK------VERGCHRRA---------KSGQLYSGISFEEFPAIPQQCGSTEVRPLEPSLSSSG-SKTIKKKKAAGKGFKPDSATAAP
        MFGQPS   +HEA+K      ++ G   R             + +  ++  +   +      TE   +  S S +    ++ K K  GK          P
Subjt:  MFGQPSSQARHEALK------VERGCHRRA---------KSGQLYSGISFEEFPAIPQQCGSTEVRPLEPSLSSSG-SKTIKKKKAAGKGFKPDSATAAP

Query:  KKGKAKVAEKGKCFHCNMDGHRKRNCPKYLPNRR
           K K   KGKC+HCN DGH  RNCPKYL  ++
Subjt:  KKGKAKVAEKGKCFHCNMDGHRKRNCPKYLPNRR

A0A5D3CT18 DNA-binding protein HEXBP-like7.1e-4250.68Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        MS+SIIALL   +L  ENY  WKS LN  LVI DLRFVL E+C      NA  +VR+AYDRW KANDKA++YILA +SN+L+KKHE  VTA++IMDSL+ 
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEALKVERGCHRRAKSGQLYSGISFEEFPAIPQQCGSTEVRPLEPSLSSSGSKTIKKKKAAGKGFKPDSATAAPKKGKAKVAEKGKCFHC
        MFGQPS Q                                I Q+      R   P  SSSGSK I+K+K  GKG  P    A   KGKAKVA KGKCFHC
Subjt:  MFGQPSSQARHEALKVERGCHRRAKSGQLYSGISFEEFPAIPQQCGSTEVRPLEPSLSSSGSKTIKKKKAAGKGFKPDSATAAPKKGKAKVAEKGKCFHC

Query:  NMDGHRKRNCPKYLPNRRK
        N++ H KRNCPKYL  +++
Subjt:  NMDGHRKRNCPKYLPNRRK

A0A6J1DFZ2 uncharacterized protein LOC1110200952.1e-4989.57Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        MS SIIALL AQ+LNGENYKQWKSNLNT LVIDDL+FVLQEDC QA APNATVAVR AYDRWIKANDKAKVYILAS+S+VLAKKHEDT+TAKEIMDSLQ+
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEALK
        MFGQPSSQARHEALK
Subjt:  MFGQPSSQARHEALK

A0A6J1DW68 uncharacterized protein LOC1110246371.2e-4484.21Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        MS SII LL AQKLN ENYKQWKSN+NT L+IDDLRFVLQEDC QA APNATVAVRN YDRWIKANDKAKV ILAS+S+VLAKKHE++V  KEIMDSLQ+
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEAL
        MFGQPSSQARHEAL
Subjt:  MFGQPSSQARHEAL

A0A6J1DWL0 uncharacterized protein LOC1110247342.3e-4888.7Show/hide
Query:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN
        MSASIIALL AQKLNGENY+QWKSNLNT LVIDDLRFVLQEDC QA   NATVAVRNAYDRWIK+NDKAKVYILAS+S+VLAKKHEDTVT KEIMDSLQ+
Subjt:  MSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATVAVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQN

Query:  MFGQPSSQARHEALK
        MFGQPS QARHEALK
Subjt:  MFGQPSSQARHEALK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCAGCGCCATGGCGCTGCAGGGATAGCACACGGCGCCATGACGTTGCACTGTAGCTCCGCGGCGCTGTGCAGCCTCATGGCGCCATCCCTGGGCGCCGCG
GCATTGCTGGTGCGGCATTTTGCTGCAGCAGCGCCGAGGCGCTGTCCCGGCATGTCTGCTTCCATTATTGCACTCCTACCCGCTCAAAAACTTAACGGCGAGAAT
TACAAACAATGGAAATCAAACCTAAATACTACTCTCGTGATAGATGATCTTAGGTTTGTCTTGCAAGAGGATTGTTCTCAAGCTCTTGCGCCTAACGCCACTGTG
GCGGTGCGCAACGCCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCGTATCTAATGTGCTTGCCAAGAAGCATGAGGACACG
GTCACCGCTAAGGAGATCATGGACTCACTGCAGAACATGTTTGGACAACCGTCCTCACAGGCTAGACATGAAGCCCTTAAAGTTGAACGGGGCTGTCATAGACGA
GCAAAGTCAGGTCAGCTTTATTCTGGAATCTCTTTCGAAGAGTTTCCTGCCATTCCGCAGCAATGTGGTTCAACCGAGGTTCGTCCTTTGGAACCAAGTCTCTCT
TCTTCTGGAAGTAAGACTATTAAGAAGAAGAAGGCTGCTGGTAAGGGGTTTAAACCTGACTCCGCTACTGCCGCTCCCAAGAAAGGCAAGGCCAAGGTTGCAGAG
AAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATCGGAAGCGCAACTGCCCAAAGTACTTGCCGAACAGAAGAAAGCCAACGAAGGAGCCACTAATCACGTTT
GTTCTTCATTTCAGGGAATTAGTTCCTAGAGGCAGCTTGACGCCGGAGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCAGCGCCATGGCGCTGCAGGGATAGCACACGGCGCCATGACGTTGCACTGTAGCTCCGCGGCGCTGTGCAGCCTCATGGCGCCATCCCTGGGCGCCGCG
GCATTGCTGGTGCGGCATTTTGCTGCAGCAGCGCCGAGGCGCTGTCCCGGCATGTCTGCTTCCATTATTGCACTCCTACCCGCTCAAAAACTTAACGGCGAGAAT
TACAAACAATGGAAATCAAACCTAAATACTACTCTCGTGATAGATGATCTTAGGTTTGTCTTGCAAGAGGATTGTTCTCAAGCTCTTGCGCCTAACGCCACTGTG
GCGGTGCGCAACGCCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCGTATCTAATGTGCTTGCCAAGAAGCATGAGGACACG
GTCACCGCTAAGGAGATCATGGACTCACTGCAGAACATGTTTGGACAACCGTCCTCACAGGCTAGACATGAAGCCCTTAAAGTTGAACGGGGCTGTCATAGACGA
GCAAAGTCAGGTCAGCTTTATTCTGGAATCTCTTTCGAAGAGTTTCCTGCCATTCCGCAGCAATGTGGTTCAACCGAGGTTCGTCCTTTGGAACCAAGTCTCTCT
TCTTCTGGAAGTAAGACTATTAAGAAGAAGAAGGCTGCTGGTAAGGGGTTTAAACCTGACTCCGCTACTGCCGCTCCCAAGAAAGGCAAGGCCAAGGTTGCAGAG
AAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATCGGAAGCGCAACTGCCCAAAGTACTTGCCGAACAGAAGAAAGCCAACGAAGGAGCCACTAATCACGTTT
GTTCTTCATTTCAGGGAATTAGTTCCTAGAGGCAGCTTGACGCCGGAGAGATGA
Protein sequenceShow/hide protein sequence
MQQRHGAAGIAHGAMTLHCSSAALCSLMAPSLGAAALLVRHFAAAAPRRCPGMSASIIALLPAQKLNGENYKQWKSNLNTTLVIDDLRFVLQEDCSQALAPNATV
AVRNAYDRWIKANDKAKVYILASVSNVLAKKHEDTVTAKEIMDSLQNMFGQPSSQARHEALKVERGCHRRAKSGQLYSGISFEEFPAIPQQCGSTEVRPLEPSLS
SSGSKTIKKKKAAGKGFKPDSATAAPKKGKAKVAEKGKCFHCNMDGHRKRNCPKYLPNRRKPTKEPLITFVLHFRELVPRGSLTPER