; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g16770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g16770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:13191083..13196486
RNA-Seq ExpressionMoc06g16770
SyntenyMoc06g16770
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCI18708.1 hypothetical protein [Trifolium medium]2.7e-0741.94Show/hide
Query:  GDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRV-RNYQKPFRFKECWVANPACEQIISEQGNWSNIDLYYSFSDSI
        GD +   LDR L +D FL  F    V HLP   SDH A+ + +  P  RV R  ++PFRF+E W +N  CE +I     WS   L  SFSD +
Subjt:  GDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRV-RNYQKPFRFKECWVANPACEQIISEQGNWSNIDLYYSFSDSI

XP_022158772.1 uncharacterized protein LOC111025237 [Momordica charantia]2.0e-1861.04Show/hide
Query:  FIGDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILP-PPTRVRNYQKPFRFKECWVANPACEQIIS
        F GDQLWK LDRFL NDSF   FPDA++ HLPWSKSDH AI L++   P ++++   KP RF+E WV NP CEQ+IS
Subjt:  FIGDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILP-PPTRVRNYQKPFRFKECWVANPACEQIIS

XP_023881794.1 uncharacterized protein LOC111994169 [Quercus suber]1.2e-0737.65Show/hide
Query:  IGDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQIISEQ-GNWSNID
        I + + + +DRF  N  +   +PDA VTHLP   SDHC + +   PP  +  N  +PFRF+E W+++ +   I+S   GN  N+D
Subjt:  IGDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQIISEQ-GNWSNID

XP_023888364.1 uncharacterized protein LOC112000452 [Quercus suber]9.3e-0833.94Show/hide
Query:  YLEFIGDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQIIS---EQGNWSNIDLYYSFSDSI
        Y +F G Q+ + LDR L    +L  FP A   HL  S SDHC +AL+ +  P + R Y KP RF+  W+ N  C++++    E+G   N    +  +  +
Subjt:  YLEFIGDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQIIS---EQGNWSNIDLYYSFSDSI

Query:  HACSSALRD
         +C   L D
Subjt:  HACSSALRD

XP_023911264.1 uncharacterized protein LOC112022870, partial [Quercus suber]7.8e-0735.58Show/hide
Query:  GDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQII----SEQGNWSNIDLYYSFSDSIHACS
        G  LW+ LDR L N+ +L  +    V HL  S SDHC   L I+P      N +KPFRF+E W+A   C + +    S+QG+   +         I +C 
Subjt:  GDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQII----SEQGNWSNIDLYYSFSDSIHACS

Query:  SALR
         AL+
Subjt:  SALR

TrEMBL top hitse value%identityAlignment
A0A2N9F6L9 Reverse transcriptase domain-containing protein3.8e-0746.48Show/hide
Query:  WKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQIISE
        W  LDR + N  +L  FP A V HL   KSDH  + LN   PP+  R  +KPFRF+E W+++  CEQ I E
Subjt:  WKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQIISE

A0A2N9FQ30 Uncharacterized protein2.9e-0741.43Show/hide
Query:  LWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQII
        +W+ LDR L N  ++  +P+A V HL    SDH  I L + P P + R +QKPFRF+E W+ N  C   +
Subjt:  LWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQII

A0A2N9H1U1 Reverse transcriptase domain-containing protein6.5e-0734.69Show/hide
Query:  LWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPAC-EQIISEQGNWSNIDLYYSFSDSIHACSSALR
        +W+ LDR L    +  +FP+A + HL  + SDH  I L   P  T  R   +PFRF+E W++NP C E +++      N    +   D I  C   LR
Subjt:  LWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPAC-EQIISEQGNWSNIDLYYSFSDSIHACSSALR

A0A2N9HW04 Reverse transcriptase domain-containing protein6.5e-0734.69Show/hide
Query:  LWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPAC-EQIISEQGNWSNIDLYYSFSDSIHACSSALR
        +W+ LDR L    +  +FP+A + HL  + SDH  I L   P  T  R   +PFRF+E W++NP C E +++      N    +   D I  C   LR
Subjt:  LWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPAC-EQIISEQGNWSNIDLYYSFSDSIHACSSALR

A0A6J1DY29 uncharacterized protein LOC1110252379.6e-1961.04Show/hide
Query:  FIGDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILP-PPTRVRNYQKPFRFKECWVANPACEQIIS
        F GDQLWK LDRFL NDSF   FPDA++ HLPWSKSDH AI L++   P ++++   KP RF+E WV NP CEQ+IS
Subjt:  FIGDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILP-PPTRVRNYQKPFRFKECWVANPACEQIIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGAGGTCCGCCTAAGTATCCAGAACGGTCCGGAGGACGAGTTCGAGCTGCAATCTGAAATACACTGTTGTGCATATCCTTGCATAAACAAAAT
CAAACCAAACGGTCCATTCTTGAGTTTAGGTCGAGTCGGGGACCGGAGGAAGAGCTCCATTTCAACTCATCCCCGCCAAGAGGAAGAGCTTCATTTCAGCTCATTCCCAT
CTAGAGGAAGAGCGCTTCATCTTAGCTCATCCCCGGCAAGAGAAAGAGCTGTCATCATCATAACTCAACCTCTCCAAGAGAAAGAGCTCGATCTCAGCTCATTCCCGCCT
AGAGGAAGAGCTACATCTCAGCTCATCCCGACAAGAGGAAGAGCTGCATCTCAGCTCATCCTCACTTATCCCCGCCAAGCGGAAGAGCTTAATCTCAGCTCATCTCCGGC
AAGAAAAAGAGCTGCCATCATCATAACTCAACCTCGCCAAGAGGAGGAGCTTGATCTCAACTCATCCCGGTTAGAGGAAGTATTGCTATCAGATAACTCAACCTCGCCAA
GAGGAAGAGTTTCGTCTCAGCTCATCCCACTCATCTCGCCTAGAGGAAAAGCTTCATATCAGATAACTCAACCTCACCAAGAGGAAGAGTTTTATCTCAGCTCATCCCCG
CCTTGTGGAAGAGCTTCATCTCAGCTCATCCCCGGCAAGAGAAAGAGCTGCTATCATCATAGCTCATCCCGGCAAGAGGAAGAGCTCAATCTCAGCTCATCCTCGCTAGA
AAAAGAATCAAACTTAGGGGCTGCCGACCTACATTCCAATGGGGGCCAAGTACTCTCACATGGCATAAGTGCCAAGGTCTTTCCTCGCCAAGTGGCCTTTCAGCTCATTC
CCGACACCAGTTCATCCCAATCAGCCGACCTTGACCCTTTGAGCTGGTCACCCCTTTCCAAGGAAACCCAAGCTCTTCCTGCGGTCATACCTGAAAAGGAGGTCATGCTC
TCCCGACGGTCATACCTGGAGTTTATAGGTGACCAGCTTTGGAAAATGTTAGACCGCTTTCTACGTAATGATTCCTTTCTCTTTTCTTTTCCAGATGCTGCTGTTACTCA
TCTTCCTTGGTCAAAATCGGACCATTGTGCAATTGCTCTGAATATCTTGCCCCCACCAACTCGAGTGAGGAATTATCAGAAACCATTTCGGTTTAAAGAATGCTGGGTTG
CAAATCCAGCCTGTGAACAAATAATTTCGGAACAGGGAAATTGGTCCAACATAGATCTCTACTATTCTTTCTCTGATAGTATACACGCCTGCTCATCGGCTCTTCGGGAT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGAGGTCCGCCTAAGTATCCAGAACGGTCCGGAGGACGAGTTCGAGCTGCAATCTGAAATACACTGTTGTGCATATCCTTGCATAAACAAAAT
CAAACCAAACGGTCCATTCTTGAGTTTAGGTCGAGTCGGGGACCGGAGGAAGAGCTCCATTTCAACTCATCCCCGCCAAGAGGAAGAGCTTCATTTCAGCTCATTCCCAT
CTAGAGGAAGAGCGCTTCATCTTAGCTCATCCCCGGCAAGAGAAAGAGCTGTCATCATCATAACTCAACCTCTCCAAGAGAAAGAGCTCGATCTCAGCTCATTCCCGCCT
AGAGGAAGAGCTACATCTCAGCTCATCCCGACAAGAGGAAGAGCTGCATCTCAGCTCATCCTCACTTATCCCCGCCAAGCGGAAGAGCTTAATCTCAGCTCATCTCCGGC
AAGAAAAAGAGCTGCCATCATCATAACTCAACCTCGCCAAGAGGAGGAGCTTGATCTCAACTCATCCCGGTTAGAGGAAGTATTGCTATCAGATAACTCAACCTCGCCAA
GAGGAAGAGTTTCGTCTCAGCTCATCCCACTCATCTCGCCTAGAGGAAAAGCTTCATATCAGATAACTCAACCTCACCAAGAGGAAGAGTTTTATCTCAGCTCATCCCCG
CCTTGTGGAAGAGCTTCATCTCAGCTCATCCCCGGCAAGAGAAAGAGCTGCTATCATCATAGCTCATCCCGGCAAGAGGAAGAGCTCAATCTCAGCTCATCCTCGCTAGA
AAAAGAATCAAACTTAGGGGCTGCCGACCTACATTCCAATGGGGGCCAAGTACTCTCACATGGCATAAGTGCCAAGGTCTTTCCTCGCCAAGTGGCCTTTCAGCTCATTC
CCGACACCAGTTCATCCCAATCAGCCGACCTTGACCCTTTGAGCTGGTCACCCCTTTCCAAGGAAACCCAAGCTCTTCCTGCGGTCATACCTGAAAAGGAGGTCATGCTC
TCCCGACGGTCATACCTGGAGTTTATAGGTGACCAGCTTTGGAAAATGTTAGACCGCTTTCTACGTAATGATTCCTTTCTCTTTTCTTTTCCAGATGCTGCTGTTACTCA
TCTTCCTTGGTCAAAATCGGACCATTGTGCAATTGCTCTGAATATCTTGCCCCCACCAACTCGAGTGAGGAATTATCAGAAACCATTTCGGTTTAAAGAATGCTGGGTTG
CAAATCCAGCCTGTGAACAAATAATTTCGGAACAGGGAAATTGGTCCAACATAGATCTCTACTATTCTTTCTCTGATAGTATACACGCCTGCTCATCGGCTCTTCGGGAT
TGA
Protein sequenceShow/hide protein sequence
MLSMRAEVRLSIQNGPEDEFELQSEIHCCAYPCINKIKPNGPFLSLGRVGDRRKSSISTHPRQEEELHFSSFPSRGRALHLSSSPARERAVIIITQPLQEKELDLSSFPP
RGRATSQLIPTRGRAASQLILTYPRQAEELNLSSSPARKRAAIIITQPRQEEELDLNSSRLEEVLLSDNSTSPRGRVSSQLIPLISPRGKASYQITQPHQEEEFYLSSSP
PCGRASSQLIPGKRKSCYHHSSSRQEEELNLSSSSLEKESNLGAADLHSNGGQVLSHGISAKVFPRQVAFQLIPDTSSSQSADLDPLSWSPLSKETQALPAVIPEKEVML
SRRSYLEFIGDQLWKMLDRFLRNDSFLFSFPDAAVTHLPWSKSDHCAIALNILPPPTRVRNYQKPFRFKECWVANPACEQIISEQGNWSNIDLYYSFSDSIHACSSALRD