; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g02210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g02210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr7:1734281..1740697
RNA-Seq ExpressionMoc07g02210
SyntenyMoc07g02210
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5758504.1 putative RNA-directed DNA polymerase [Helianthus annuus]2.3e-3454.07Show/hide
Query:  GGPMESSGGSSRGSKKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------I
        G P   S   S G+ K    DE+WE++DLRAA+AIR  LAKN+L NV+G+STAK+LWEKLE LYQ KGISNR                           I
Subjt:  GGPMESSGGSSRGSKKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------I

Query:  IFELEAIEVKIDDEDKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTSRFDTELV
        + ELEAI VK++DEDKALRLILSL  SYEHMKPILMYGK+ L +A+ T KLL E++RL S G TS   T L+
Subjt:  IFELEAIEVKIDDEDKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTSRFDTELV

QHN81458.1 Retrovirus-related Pol polyprotein [Arachis hypogaea]3.5e-3559.06Show/hide
Query:  SSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDEDK
        S M DE+WEE+DLRAA+AIR  LAKN+L NV G+ TAKELW+KLE LYQ+KGISNR                           I+ ELEAI VKIDDEDK
Subjt:  SSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDEDK

Query:  ALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTS
        ALRLILSL  SYE++KP+LMYGK+ LNF E  SKL+ E+RR+K+EG TS
Subjt:  ALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTS

QHO24915.1 Retrovirus-related Pol polyprotein [Arachis hypogaea]1.3e-3458.94Show/hide
Query:  KKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDE
        K S M DE+WEE+DLRAA+AIR  LAKN+L NV GM TAKELW KLE LYQAK ISNR                           I+ ELEAI VKIDDE
Subjt:  KKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDE

Query:  DKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTS
        DKALRLILSL  SYE++KP+LMYGK+ LNF E  SKL+ E+RR+K++G TS
Subjt:  DKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTS

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]2.4e-6074.21Show/hide
Query:  QGVEGRPSEVASEKLSSDGGPMESSGGSSRGSKKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR----------
        + ++GRPSE ASEKLS DGGPMESSGGSSRGSKKSSMS EDWEEMDLRAA+AIRTSLAKNIL NV+ +STAKELWEKLEALYQAKGISNR          
Subjt:  QGVEGRPSEVASEKLSSDGGPMESSGGSSRGSKKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR----------

Query:  -----------------IIFELEAIEVKIDDEDKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTSRFDTELV
                         IIFELEAIEVKIDDEDKALRLILSL  SYEHMKPILMYGKD LNFAE TSKLL E+RRLKSEGRTS  D+ LV
Subjt:  -----------------IIFELEAIEVKIDDEDKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTSRFDTELV

XP_025611318.1 LOW QUALITY PROTEIN: uncharacterized protein LOC112703916 [Arachis hypogaea]1.3e-3459.86Show/hide
Query:  MSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDEDKAL
        M DE+WEE+DLRA +AIR  LAKN+L NV GM TAKELW+KLE L+QAKGISNR                           II ELEAIEVKIDDEDKAL
Subjt:  MSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDEDKAL

Query:  RLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTS
         LILSL  SYE++KP+LMYGK+ LNF E  SKL+ E+RR+K+EG TS
Subjt:  RLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTS

TrEMBL top hitse value%identityAlignment
A0A2K3L7F8 Cytochrome p4504.8e-3049.09Show/hide
Query:  SSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDEDK
        S M  + WEE+DLRAA+AIR  LAKN+L NVY +S+AKELWE+LE LYQAK ISNR                           I+ ELE+I+V+IDDEDK
Subjt:  SSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDEDK

Query:  ALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTSRFDTELVRQKVLVKGS
         LRLI SL  SY H+KP+L YGK+ LNF E  +K++ E+RR+KS+  TS     L R  ++ + S
Subjt:  ALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTSRFDTELVRQKVLVKGS

A0A2P5C765 Uncharacterized protein3.7e-3054.86Show/hide
Query:  MSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDEDKAL
        MSD DW+++D RAA+AIR  LAKN+L NV G++TAK+LW KLE LYQAKG+SNR                           I+ ELEAI VKI+DEDKAL
Subjt:  MSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDEDKAL

Query:  RLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEG
        R I S+ PSYEHMKPIL++GK+ + F+E TSKLL E+RRL   G
Subjt:  RLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEG

A0A444XD23 Uncharacterized protein2.7e-3358.28Show/hide
Query:  KKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDE
        K S M DE+ EE+DLRAA+AI   LAKN+L NV GM TAKELW+KLE LYQAKGISNR                           I+ ELEAI VKIDDE
Subjt:  KKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDE

Query:  DKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTS
        DKALRLILSL  SYE++K +LMYGK+ LNF E  SKL+ E+RR+K+EG TS
Subjt:  DKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTS

A0A6A2ZV50 Scarecrow-like protein 325.7e-3158.27Show/hide
Query:  SDGGPMESSGGSSRGSKKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR----IIFELEAIEVKIDDEDKALRLI
        S+G   +    SS    KS MS+E+WEE+D+RAA+ IR  LAKN+L NV   S+ KELWEKLE +YQAK +SN     I+ ELE+I V+IDDEDKALRLI
Subjt:  SDGGPMESSGGSSRGSKKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR----IIFELEAIEVKIDDEDKALRLI

Query:  LSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKS
         SL  SYEHM+ +LMYGK+ +NF E TSKL+ E+RRLK+
Subjt:  LSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKS

A0A6J1CG82 uncharacterized protein LOC1110105211.2e-6074.21Show/hide
Query:  QGVEGRPSEVASEKLSSDGGPMESSGGSSRGSKKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR----------
        + ++GRPSE ASEKLS DGGPMESSGGSSRGSKKSSMS EDWEEMDLRAA+AIRTSLAKNIL NV+ +STAKELWEKLEALYQAKGISNR          
Subjt:  QGVEGRPSEVASEKLSSDGGPMESSGGSSRGSKKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR----------

Query:  -----------------IIFELEAIEVKIDDEDKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTSRFDTELV
                         IIFELEAIEVKIDDEDKALRLILSL  SYEHMKPILMYGKD LNFAE TSKLL E+RRLKSEGRTS  D+ LV
Subjt:  -----------------IIFELEAIEVKIDDEDKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTSRFDTELV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-1432.48Show/hide
Query:  KKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDE
        K  +M  EDW ++D RAA+AIR  L+ +++ N+    TA+ +W +LE+LY +K ++N+                           +I +L  + VKI++E
Subjt:  KKSSMSDEDWEEMDLRAANAIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNR---------------------------IIFELEAIEVKIDDE

Query:  DKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLL-EKRRLKSEGRTSRFDTE
        DKA+ L+ SL  SY+++   +++GK  +   + TS LLL EK R K E +     TE
Subjt:  DKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLL-EKRRLKSEGRTSRFDTE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTTGCACAGTGTTGCGACGCTATGTTGTAACTCTGTGGTGCTGCCTCCTCCGCCAAAAGGCCCCTGCAGCCCGCTTCTTCGATCCTCTTTCATTCTGTCTTCGTC
TTGTCTCCGTTTTGTTGGATTGACGGCAAGTGCAAGTCAAGGATGTGCTGATACAATCTGGGTTACACAAGGCGTTGAAGGAAGACCGAGTGAAGTTGCTTCTGAAAAGC
TAAGCAGTGATGGTGGTCCAATGGAATCTAGTGGTGGTTCCAGTAGAGGTTCTAAAAAGTCCAGCATGAGTGATGAAGATTGGGAGGAAATGGATTTGAGGGCTGCAAAT
GCAATACGAACAAGTTTGGCTAAGAATATTCTTCTGAATGTGTATGGAATGTCGACAGCCAAAGAACTTTGGGAGAAGCTCGAAGCATTGTATCAGGCAAAGGGCATCTC
AAATCGCATCATCTTTGAGCTGGAGGCGATCGAAGTGAAGATAGATGACGAAGATAAAGCACTCAGGCTCATCTTATCACTTCTACCTTCTTATGAACACATGAAGCCGA
TCTTGATGTATGGTAAGGATCCTTTGAATTTTGCTGAGGCTACTAGTAAACTGTTGTTAGAGAAAAGAAGACTGAAGAGTGAAGGGCGTACTTCAAGATTTGACACTGAG
CTGGTTCGTCAAAAGGTTCTGGTGAAAGGATCGAATCCCAAGCGGAAGTGGTTGATCGCTTGTGATTCAACTTGGGTTAACACAACTCCGAACTCAGGCGTGCACTCGCT
GGAGCCGGATGTAATCGAAGAAGTTGCTCAGAAATTCTATCAAAAGCTCATAATATTCCCCATGCATAAGGTTGTTCTCTTTCTTATATATGGAGATAGAGACATGTTCG
AGAGAGCTTCCCCTCGCCTCAAAGGTGTATGTGCGAGTACTGATATGTCGTGCCACGGGTCTGGGGAATCACCAACATTCTCTGCTTCTCACAGTACTACGGATAGGGGG
GCTTGGTGGCATCCTAAGTGGTCTCGTGAGGTATATCACAGTTTGGATACAGAGTGTGTCAACACCATCTTGGCCACTAATTGTCCTCTGTATAGAGGACATCCAAGAGC
ACTCGACCTCATTCTCAATGTGGTCCGCGTGCCCAGTTGTTTTCTCGACTCAATCCATGTGGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATTTGCACAGTGTTGCGACGCTATGTTGTAACTCTGTGGTGCTGCCTCCTCCGCCAAAAGGCCCCTGCAGCCCGCTTCTTCGATCCTCTTTCATTCTGTCTTCGTC
TTGTCTCCGTTTTGTTGGATTGACGGCAAGTGCAAGTCAAGGATGTGCTGATACAATCTGGGTTACACAAGGCGTTGAAGGAAGACCGAGTGAAGTTGCTTCTGAAAAGC
TAAGCAGTGATGGTGGTCCAATGGAATCTAGTGGTGGTTCCAGTAGAGGTTCTAAAAAGTCCAGCATGAGTGATGAAGATTGGGAGGAAATGGATTTGAGGGCTGCAAAT
GCAATACGAACAAGTTTGGCTAAGAATATTCTTCTGAATGTGTATGGAATGTCGACAGCCAAAGAACTTTGGGAGAAGCTCGAAGCATTGTATCAGGCAAAGGGCATCTC
AAATCGCATCATCTTTGAGCTGGAGGCGATCGAAGTGAAGATAGATGACGAAGATAAAGCACTCAGGCTCATCTTATCACTTCTACCTTCTTATGAACACATGAAGCCGA
TCTTGATGTATGGTAAGGATCCTTTGAATTTTGCTGAGGCTACTAGTAAACTGTTGTTAGAGAAAAGAAGACTGAAGAGTGAAGGGCGTACTTCAAGATTTGACACTGAG
CTGGTTCGTCAAAAGGTTCTGGTGAAAGGATCGAATCCCAAGCGGAAGTGGTTGATCGCTTGTGATTCAACTTGGGTTAACACAACTCCGAACTCAGGCGTGCACTCGCT
GGAGCCGGATGTAATCGAAGAAGTTGCTCAGAAATTCTATCAAAAGCTCATAATATTCCCCATGCATAAGGTTGTTCTCTTTCTTATATATGGAGATAGAGACATGTTCG
AGAGAGCTTCCCCTCGCCTCAAAGGTGTATGTGCGAGTACTGATATGTCGTGCCACGGGTCTGGGGAATCACCAACATTCTCTGCTTCTCACAGTACTACGGATAGGGGG
GCTTGGTGGCATCCTAAGTGGTCTCGTGAGGTATATCACAGTTTGGATACAGAGTGTGTCAACACCATCTTGGCCACTAATTGTCCTCTGTATAGAGGACATCCAAGAGC
ACTCGACCTCATTCTCAATGTGGTCCGCGTGCCCAGTTGTTTTCTCGACTCAATCCATGTGGCTTAA
Protein sequenceShow/hide protein sequence
MHLHSVATLCCNSVVLPPPPKGPCSPLLRSSFILSSSCLRFVGLTASASQGCADTIWVTQGVEGRPSEVASEKLSSDGGPMESSGGSSRGSKKSSMSDEDWEEMDLRAAN
AIRTSLAKNILLNVYGMSTAKELWEKLEALYQAKGISNRIIFELEAIEVKIDDEDKALRLILSLLPSYEHMKPILMYGKDPLNFAEATSKLLLEKRRLKSEGRTSRFDTE
LVRQKVLVKGSNPKRKWLIACDSTWVNTTPNSGVHSLEPDVIEEVAQKFYQKLIIFPMHKVVLFLIYGDRDMFERASPRLKGVCASTDMSCHGSGESPTFSASHSTTDRG
AWWHPKWSREVYHSLDTECVNTILATNCPLYRGHPRALDLILNVVRVPSCFLDSIHVA