; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:15677476..15679529
RNA-Seq ExpressionMoc06g20030
SyntenyMoc06g20030
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEV82879.1 reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]1.8e-2440.26Show/hide
Query:  VGVEKLRLPKPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGISP---NEQVLAIGVRNEDEEPWFA
        + VE+++  K SI++PP LELK LP HL+Y +   ++ LP+II+  L  E++  L  V K+H+ AI W I DI GI P     ++L          PWF 
Subjt:  VGVEKLRLPKPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGISP---NEQVLAIGVRNEDEEPWFA

Query:  DLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAE
        D+A Y         M+ QQ K+F  + K Y WD PYL++I  D+VIRRC+   E
Subjt:  DLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAE

GEZ15114.1 reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]3.3e-2339.61Show/hide
Query:  KPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGISP---NEQVLAIGVRNEDE--------EPWFAD
        K SI++PPV+ELK LP +L+YA+  + +  PII A +L  E++  +  V K+H+RA+ W + DI GI+P     ++L     N  E         PWFA 
Subjt:  KPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGISP---NEQVLAIGVRNEDE--------EPWFAD

Query:  LANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEA
         ANY A  ++   M+ QQ  +F  + K Y WD P+L++I  D+VIRRC+   EA
Subjt:  LANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEA

GEZ19255.1 reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]2.6e-2339.49Show/hide
Query:  KPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIED--------------INGISPNEQVLAIGVRNEDEEPW
        K SI++PP +ELK LP HL+YA+      LP IIA  L  E++  L  V K+H+RAI W + D              IN   P E +  +  R   + PW
Subjt:  KPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIED--------------INGISPNEQVLAIGVRNEDEEPW

Query:  FADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEA
        F D ANY A   L   M+ QQ  +F  + K Y WD P+L+++  D+VIRRC+   EA
Subjt:  FADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEA

KAB2613120.1 hypothetical protein D8674_035436 [Pyrus ussuriensis x Pyrus communis]1.1e-2640.91Show/hide
Query:  MKFPAESEEFSVLKI---LDEALMEELETEVMLERLEAVGVEKLRLPK----------------PSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAAN
        +K PAE EE S +++   +D A   E E E  ++ + A+    +  PK                PSIE  P LELK LP+HLKY + G S+TLP+IIAA+
Subjt:  MKFPAESEEFSVLKI---LDEALMEELETEVMLERLEAVGVEKLRLPK----------------PSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAAN

Query:  LPLEKEHMLFNVFKTHQRAIGWTIEDINGISPN--EQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKV
        L   +E  L +V K ++  +G TI DI GISP   EQ+ AI    +D  PW+AD+ANY+   +LP  ++ QQ K+FL   KFY WD  YLY+   D++
Subjt:  LPLEKEHMLFNVFKTHQRAIGWTIEDINGISPN--EQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKV

XP_022159162.1 uncharacterized protein LOC111025586 [Momordica charantia]2.9e-2774.39Show/hide
Query:  DINGISPNEQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEAR
        +I G  P+EQVL +    E+EEPWFADLANYI+S ILPA MNKQQLKRFLY+SK YLWD PYLYRI QDKVIRRC+A AEAR
Subjt:  DINGISPNEQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEAR

TrEMBL top hitse value%identityAlignment
A0A5N5GIH9 Uncharacterized protein5.4e-2740.91Show/hide
Query:  MKFPAESEEFSVLKI---LDEALMEELETEVMLERLEAVGVEKLRLPK----------------PSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAAN
        +K PAE EE S +++   +D A   E E E  ++ + A+    +  PK                PSIE  P LELK LP+HLKY + G S+TLP+IIAA+
Subjt:  MKFPAESEEFSVLKI---LDEALMEELETEVMLERLEAVGVEKLRLPK----------------PSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAAN

Query:  LPLEKEHMLFNVFKTHQRAIGWTIEDINGISPN--EQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKV
        L   +E  L +V K ++  +G TI DI GISP   EQ+ AI    +D  PW+AD+ANY+   +LP  ++ QQ K+FL   KFY WD  YLY+   D++
Subjt:  LPLEKEHMLFNVFKTHQRAIGWTIEDINGISPN--EQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKV

A0A699IXS7 Reverse transcriptase domain-containing protein (Fragment)3.0e-2236.81Show/hide
Query:  KPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGISPNEQVLAIGVRNE-------------------
        K SI+  P +ELK LP HL+Y +    + LP+IIA +L +E++  L  V K+H++AI W + DINGI        I + ++                   
Subjt:  KPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGISPNEQVLAIGVRNE-------------------

Query:  -DEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEA
            PWFAD ANY A   +   M+ QQ  +F  + K Y WD P+L++I  D+VIRRC+   EA
Subjt:  -DEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEA

A0A6J1DZ22 uncharacterized protein LOC1110255861.4e-2774.39Show/hide
Query:  DINGISPNEQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEAR
        +I G  P+EQVL +    E+EEPWFADLANYI+S ILPA MNKQQLKRFLY+SK YLWD PYLYRI QDKVIRRC+A AEAR
Subjt:  DINGISPNEQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEAR

A0A6L2K3Z6 Retrovirus-related Pol polyprotein from transposon 17.63.4e-2139.16Show/hide
Query:  KPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGISPNEQVLAIGVRNEDEEPWFADLANYIASVILP
        K SIE+P  LELK LP HL+Y +   ++ LP+IIA  L   +   L  V K+H+RAI W I DI            G  +    PWFAD+AN+ +   + 
Subjt:  KPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGISPNEQVLAIGVRNEDEEPWFADLANYIASVILP

Query:  ASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEA
          +  QQ K+F  + K Y WD PYL+RI  +++I +C+   EA
Subjt:  ASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEA

A0A6L2NIP5 Reverse transcriptase domain-containing protein4.0e-2232.18Show/hide
Query:  LKILDEALMEELETEVMLERLEAVGVEKLRLPKPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGIS
        +++ +E L ++  + + L+ L+ V   + +  K  I++PP LELK LP HL+YA+   +  L +II+ NL  +++  L  V K+H+ AI W + DI G  
Subjt:  LKILDEALMEELETEVMLERLEAVGVEKLRLPKPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDINGIS

Query:  ---------------------------PNEQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKA
                                   P + +  +  R +   PWF + ANY A   +   M+ QQ K+F  + K Y WD PYL+RIG D+VI+RC+   
Subjt:  ---------------------------PNEQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKA

Query:  EA
        EA
Subjt:  EA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTTCCTGCTGAATCTGAAGAGTTCTCAGTACTAAAGATCTTAGATGAAGCATTAATGGAGGAGCTGGAAACAGAAGTCATGTTGGAACGTCTAGAAGCAGTTGG
CGTCGAAAAGCTTAGATTACCCAAGCCATCTATTGAGGACCCGCCAGTGCTGGAGCTTAAAACATTGCCACAACATCTGAAATATGCTTACCGGGGTTTATCAGAGACAT
TGCCAATCATCATAGCAGCGAACTTACCCTTAGAAAAGGAACATATGCTGTTCAATGTATTCAAGACACATCAGAGAGCAATTGGATGGACTATTGAGGATATCAATGGG
ATAAGCCCCAATGAGCAAGTGTTGGCTATTGGAGTTCGAAATGAAGATGAGGAACCATGGTTTGCAGATCTAGCCAACTATATTGCTAGTGTAATTCTGCCAGCAAGCAT
GAACAAACAACAACTGAAAAGGTTTTTGTATGAGAGTAAGTTTTATTTGTGGGATGGCCCATATCTCTATAGGATTGGCCAAGATAAAGTGATCAGAAGATGCATCGCCA
AGGCTGAAGCACGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTTCCTGCTGAATCTGAAGAGTTCTCAGTACTAAAGATCTTAGATGAAGCATTAATGGAGGAGCTGGAAACAGAAGTCATGTTGGAACGTCTAGAAGCAGTTGG
CGTCGAAAAGCTTAGATTACCCAAGCCATCTATTGAGGACCCGCCAGTGCTGGAGCTTAAAACATTGCCACAACATCTGAAATATGCTTACCGGGGTTTATCAGAGACAT
TGCCAATCATCATAGCAGCGAACTTACCCTTAGAAAAGGAACATATGCTGTTCAATGTATTCAAGACACATCAGAGAGCAATTGGATGGACTATTGAGGATATCAATGGG
ATAAGCCCCAATGAGCAAGTGTTGGCTATTGGAGTTCGAAATGAAGATGAGGAACCATGGTTTGCAGATCTAGCCAACTATATTGCTAGTGTAATTCTGCCAGCAAGCAT
GAACAAACAACAACTGAAAAGGTTTTTGTATGAGAGTAAGTTTTATTTGTGGGATGGCCCATATCTCTATAGGATTGGCCAAGATAAAGTGATCAGAAGATGCATCGCCA
AGGCTGAAGCACGATAA
Protein sequenceShow/hide protein sequence
MKFPAESEEFSVLKILDEALMEELETEVMLERLEAVGVEKLRLPKPSIEDPPVLELKTLPQHLKYAYRGLSETLPIIIAANLPLEKEHMLFNVFKTHQRAIGWTIEDING
ISPNEQVLAIGVRNEDEEPWFADLANYIASVILPASMNKQQLKRFLYESKFYLWDGPYLYRIGQDKVIRRCIAKAEAR