; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g26100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g26100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr11:19166987..19168537
RNA-Seq ExpressionMoc11g26100
SyntenyMoc11g26100
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045217.1 uncharacterized protein E6C27_scaffold30G002260 [Cucumis melo var. makuwa]1.7e-2864.08Show/hide
Query:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD
        W+KD R+MKAVNSA LPI+ + KR  ++L  WSG VDFV+V+MDDFDVVL MEFLLEH+VIP+PL K LV+TG  P+VV T ++QP G+KMIS +QLKK 
Subjt:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD

Query:  VAR
        ++R
Subjt:  VAR

KAA0045217.1 uncharacterized protein E6C27_scaffold30G002260 [Cucumis melo var. makuwa]2.1e-0576.32Show/hide
Query:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDA
        DV + FEVETDASD+ALGGVLLQ+ HPIAYES  LN A
Subjt:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDA

KAA0045217.1 uncharacterized protein E6C27_scaffold30G002260 [Cucumis melo var. makuwa]2.3e-2866.67Show/hide
Query:  DKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKDV
        +KDT KMKAVNS  LPI+ V+KRV++KL  W+G  DFV+VRMDDFDVVL MEFL+EHKVIP+PL K ++VT + PTVV  SIKQP G++MIS LQLKK +
Subjt:  DKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKDV

Query:  AR
         R
Subjt:  AR

KAA0066677.1 uncharacterized protein E6C27_scaffold271G00020 [Cucumis melo var. makuwa]1.7e-2864.08Show/hide
Query:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD
        W+KD  +MKAVNSA LPI+ + KR  ++LE WSG VDFV+V+MDDFDVVL MEFLLEH+VIP+PL K LV+TG  P+VV T ++QP G+KMIS +QLKK 
Subjt:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD

Query:  VAR
        ++R
Subjt:  VAR

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]1.3e-1577.59Show/hide
Query:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVVHCLRA
        DV + FEVETDASD+ALGGVLLQD HPIAYES  LN+AER+Y  SEK+MLAVVHCLR+
Subjt:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVVHCLRA

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]2.8e-3480.58Show/hide
Query:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD
        WDKD  KMKAVNSA LPIM VAKRVSVKL  WSG VDFVIVRMDDFDVVL ++FLLEHKVIP+PL K LVVT SDP VV TSIKQPSGVKMIS LQLKK 
Subjt:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD

Query:  VAR
        +A+
Subjt:  VAR

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]2.8e-1887.93Show/hide
Query:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVVHCLRA
        DV R FEVETDASDFALGGVLLQD HPIAYES  LNDAER+YAASEK+MLAVVHCLRA
Subjt:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVVHCLRA

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]1.3e-2862.14Show/hide
Query:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD
        W+KD  +MKAVNSA+LPI+ + K+  ++L  WSG VDF++V+MDDFDVVL MEFLLEH+VIP+PL K LV+TG  P+VV T ++QP G+KMIST+QLKK 
Subjt:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD

Query:  VAR
        ++R
Subjt:  VAR

TrEMBL top hitse value%identityAlignment
A0A5A7TTJ4 Reverse transcriptase domain-containing protein8.4e-2964.08Show/hide
Query:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD
        W+KD R+MKAVNSA LPI+ + KR  ++L  WSG VDFV+V+MDDFDVVL MEFLLEH+VIP+PL K LV+TG  P+VV T ++QP G+KMIS +QLKK 
Subjt:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD

Query:  VAR
        ++R
Subjt:  VAR

A0A5A7TTJ4 Reverse transcriptase domain-containing protein1.0e-0576.32Show/hide
Query:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDA
        DV + FEVETDASD+ALGGVLLQ+ HPIAYES  LN A
Subjt:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDA

A0A5A7TTJ4 Reverse transcriptase domain-containing protein8.4e-2964.08Show/hide
Query:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD
        W+KD  +MKAVNSA LPI+ + KR  ++LE WSG VDFV+V+MDDFDVVL MEFLLEH+VIP+PL K LV+TG  P+VV T ++QP G+KMIS +QLKK 
Subjt:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD

Query:  VAR
        ++R
Subjt:  VAR

A0A5A7U3I4 Reverse transcriptase domain-containing protein1.1e-2864.08Show/hide
Query:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD
        W+KD  +MKAVNSA LPI+ + KR+ ++L  WSG VDFV+V+MDDFDVVL MEFLLEH+VIP+PL K LV+TGS P+VV T ++QP G+KMIS +QLKK 
Subjt:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD

Query:  VAR
        ++R
Subjt:  VAR

A0A6J1DLQ6 uncharacterized protein LOC1110223201.3e-3480.58Show/hide
Query:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD
        WDKD  KMKAVNSA LPIM VAKRVSVKL  WSG VDFVIVRMDDFDVVL ++FLLEHKVIP+PL K LVVT SDP VV TSIKQPSGVKMIS LQLKK 
Subjt:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD

Query:  VAR
        +A+
Subjt:  VAR

A0A6J1DLQ6 uncharacterized protein LOC1110223201.3e-1887.93Show/hide
Query:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVVHCLRA
        DV R FEVETDASDFALGGVLLQD HPIAYES  LNDAER+YAASEK+MLAVVHCLRA
Subjt:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVVHCLRA

A0A6J1DLQ6 uncharacterized protein LOC1110223206.5e-2962.14Show/hide
Query:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD
        W+KD  +MKAVNSA+LPI+ + K+  ++L  WSG VDF++V+MDDFDVVL MEFLLEH+VIP+PL K LV+TG  P+VV T ++QP G+KMIST+QLKK 
Subjt:  WDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKD

Query:  VAR
        ++R
Subjt:  VAR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.9e-0750.94Show/hide
Query:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVV
        D  + F + TDASD ALG VL QD HP++Y S  LN+ E  Y+  EK++LA+V
Subjt:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVV

P0CT41 Transposon Tf2-12 polyprotein4.1e-0440.32Show/hide
Query:  DVARLFEVETDASDFALGGVLLQ---DD--HPIAYESWMLNDAERKYAASEKKMLAVVHCLR
        D ++   +ETDASD A+G VL Q   DD  +P+ Y S  ++ A+  Y+ S+K+MLA++  L+
Subjt:  DVARLFEVETDASDFALGGVLLQ---DD--HPIAYESWMLNDAERKYAASEKKMLAVVHCLR

P20825 Retrovirus-related Pol polyprotein from transposon 2971.1e-0650.94Show/hide
Query:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVV
        D  + F + TDAS+ ALG VL Q+ HPI++ S  LND E  Y+A EK++LA+V
Subjt:  DVARLFEVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.7e-0650.91Show/hide
Query:  FEVETDASDFALGGVLLQD----DHPIAYESWMLNDAERKYAASEKKMLAVVHCL
        F + TDAS++A+G VL QD    D PIAY S  LN  E  YA  EK+MLA++  L
Subjt:  FEVETDASDFALGGVLLQD----DHPIAYESWMLNDAERKYAASEKKMLAVVHCL

Q9UR07 Transposon Tf2-11 polyprotein4.1e-0440.32Show/hide
Query:  DVARLFEVETDASDFALGGVLLQ---DD--HPIAYESWMLNDAERKYAASEKKMLAVVHCLR
        D ++   +ETDASD A+G VL Q   DD  +P+ Y S  ++ A+  Y+ S+K+MLA++  L+
Subjt:  DVARLFEVETDASDFALGGVLLQ---DD--HPIAYESWMLNDAERKYAASEKKMLAVVHCLR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGTTGAAGCATGGGATAAGGACACAAGAAAGATGAAAGCTGTCAACTCGGCAATCCTCCCCATCATGAGAGTTGCTAAGAGAGTCTCAGTAAAACTAGAGGTATG
GAGTGGACAAGTCGATTTTGTGATAGTGCGGATGGATGACTTCGACGTAGTATTGGAAATGGAATTTCTGCTGGAACACAAAGTCATCCCTGTGCCTCTTGTTAAGTACC
TGGTTGTAACGGGTTCCGACCCCACAGTTGTTTGGACAAGCATCAAACAACCAAGCGGAGTGAAGATGATCTCAACACTCCAACTAAAGAAAGATGTTGCAAGACTTTTT
GAAGTCGAGACTGATGCTTCAGACTTTGCCCTGGGAGGAGTGCTTCTCCAAGACGACCACCCTATTGCATACGAGAGTTGGATGTTGAATGATGCGGAAAGAAAGTATGC
TGCCTCCGAGAAAAAGATGTTAGCAGTAGTCCACTGCTTGAGGGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATGTTGAAGCATGGGATAAGGACACAAGAAAGATGAAAGCTGTCAACTCGGCAATCCTCCCCATCATGAGAGTTGCTAAGAGAGTCTCAGTAAAACTAGAGGTATG
GAGTGGACAAGTCGATTTTGTGATAGTGCGGATGGATGACTTCGACGTAGTATTGGAAATGGAATTTCTGCTGGAACACAAAGTCATCCCTGTGCCTCTTGTTAAGTACC
TGGTTGTAACGGGTTCCGACCCCACAGTTGTTTGGACAAGCATCAAACAACCAAGCGGAGTGAAGATGATCTCAACACTCCAACTAAAGAAAGATGTTGCAAGACTTTTT
GAAGTCGAGACTGATGCTTCAGACTTTGCCCTGGGAGGAGTGCTTCTCCAAGACGACCACCCTATTGCATACGAGAGTTGGATGTTGAATGATGCGGAAAGAAAGTATGC
TGCCTCCGAGAAAAAGATGTTAGCAGTAGTCCACTGCTTGAGGGCCTGA
Protein sequenceShow/hide protein sequence
MYVEAWDKDTRKMKAVNSAILPIMRVAKRVSVKLEVWSGQVDFVIVRMDDFDVVLEMEFLLEHKVIPVPLVKYLVVTGSDPTVVWTSIKQPSGVKMISTLQLKKDVARLF
EVETDASDFALGGVLLQDDHPIAYESWMLNDAERKYAASEKKMLAVVHCLRA