; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g22700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g22700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr5:16237554..16238324
RNA-Seq ExpressionMoc05g22700
SyntenyMoc05g22700
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039476.1 uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa]2.7e-2941.43Show/hide
Query:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N
        D +    AR AA+G  +  + D +K + IERLKALGAT F+GTT+P D + W+                                              +
Subjt:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N

Query:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS
        W E K  F + +YPRS+ DAKR EFL+L Q SMTVAEY+KKY ELSKYAT +IEDE +R +RFE+GL EEI++  T    W +F  LVE A RV KS L+
Subjt:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS

Query:  EHDRKTEADR
        E  R+ E  +
Subjt:  EHDRKTEADR

KAA0060484.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa]7.5e-3242.45Show/hide
Query:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N
        D +    AR AA+G  +  + D EK + IERLKALGAT F+GTT+PADA+ W+                                              +
Subjt:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N

Query:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS
        W E K  F + +YPRS+ DAKR EFL+L Q SMT+AEY+KKY ELS YAT +IEDE +RC+RFE+GL EEI++  T    W +F  LVE A RVEKSL  
Subjt:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS

Query:  EHDRKTEADRAQ
         ++RK E + ++
Subjt:  EHDRKTEADRAQ

KAF8378904.1 hypothetical protein HHK36_030253 [Tetracentron sinense]4.1e-3043.86Show/hide
Query:  AALGIKMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK------------NWGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVE
        A   +   R   E++  IE    L AT+FSG++DPADA+ W++             W + +  F E YY R+Y D K+REFL+LVQ  MTVA+Y++K+ E
Subjt:  AALGIKMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK------------NWGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVE

Query:  LSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESVWQEFRPLVEVAARVEKSLLSEHDRKTEADRAQKCSS
        LS++A+T++ +E DRCRRFEDGLH EI+S  T + W E+  LV+ A RVE+S ++EH R+ E  R +   S
Subjt:  LSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESVWQEFRPLVEVAARVEKSLLSEHDRKTEADRAQKCSS

TYJ95881.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]1.9e-3040Show/hide
Query:  LQEDIVDQDERVMARIAALGIKMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWM------------------------------------------
        ++E++ +Q    +A+    GI+  + D EK +  ERLKALGAT F+GTT+P D + W+                                          
Subjt:  LQEDIVDQDERVMARIAALGIKMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWM------------------------------------------

Query:  ---KNWGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVE
            +W E K  F + +YPRS+ DAK  EF++L Q +MTVAEY+KKY ELSKYAT +I DE +RC+RFE+GL EEI++  T    W +F  LVEVA RVE
Subjt:  ---KNWGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVE

Query:  KSLLSEHDRKTEADR
        KS L+E  R+ EA +
Subjt:  KSLLSEHDRKTEADR

TYK15233.1 uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa]2.7e-2941.43Show/hide
Query:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N
        D +    AR AA+G  +  + D +K + IERLKALGAT F+GTT+P D + W+                                              +
Subjt:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N

Query:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS
        W E K  F + +YPRS+ DAKR EFL+L Q SMTVAEY+KKY ELSKYAT +IEDE +R +RFE+GL EEI++  T    W +F  LVE A RV KS L+
Subjt:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS

Query:  EHDRKTEADR
        E  R+ E  +
Subjt:  EHDRKTEADR

TrEMBL top hitse value%identityAlignment
A0A5A7TBS0 CCHC-type domain-containing protein1.3e-2941.43Show/hide
Query:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N
        D +    AR AA+G  +  + D +K + IERLKALGAT F+GTT+P D + W+                                              +
Subjt:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N

Query:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS
        W E K  F + +YPRS+ DAKR EFL+L Q SMTVAEY+KKY ELSKYAT +IEDE +R +RFE+GL EEI++  T    W +F  LVE A RV KS L+
Subjt:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS

Query:  EHDRKTEADR
        E  R+ E  +
Subjt:  EHDRKTEADR

A0A5A7TL70 Reverse transcriptase5.4e-2839.71Show/hide
Query:  DQDERVMARIAALG-IKMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWM---------------------------------------------KN
        D +    AR AA+G I+  + + EK + I+RLKALGAT F+GTT+P DA+  +                                              +
Subjt:  DQDERVMARIAALG-IKMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWM---------------------------------------------KN

Query:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESVWQEFRPLVEVAARVEKSLLSE
        W E K  F + +YP  + DAKR EFL+L+Q SMTV EY+KKY ELSKYAT +IEDE +RC+RFE+GL E     T+ + W +F  LVE A RV+KS L+E
Subjt:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESVWQEFRPLVEVAARVEKSLLSE

Query:  HDRKTEADR
          R+ E  +
Subjt:  HDRKTEADR

A0A5A7UZM6 Gag protease polyprotein-like protein3.6e-3242.45Show/hide
Query:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N
        D +    AR AA+G  +  + D EK + IERLKALGAT F+GTT+PADA+ W+                                              +
Subjt:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N

Query:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS
        W E K  F + +YPRS+ DAKR EFL+L Q SMT+AEY+KKY ELS YAT +IEDE +RC+RFE+GL EEI++  T    W +F  LVE A RVEKSL  
Subjt:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS

Query:  EHDRKTEADRAQ
         ++RK E + ++
Subjt:  EHDRKTEADRAQ

A0A5D3BB91 Reverse transcriptase9.0e-3140Show/hide
Query:  LQEDIVDQDERVMARIAALGIKMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWM------------------------------------------
        ++E++ +Q    +A+    GI+  + D EK +  ERLKALGAT F+GTT+P D + W+                                          
Subjt:  LQEDIVDQDERVMARIAALGIKMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWM------------------------------------------

Query:  ---KNWGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVE
            +W E K  F + +YPRS+ DAK  EF++L Q +MTVAEY+KKY ELSKYAT +I DE +RC+RFE+GL EEI++  T    W +F  LVEVA RVE
Subjt:  ---KNWGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVE

Query:  KSLLSEHDRKTEADR
        KS L+E  R+ EA +
Subjt:  KSLLSEHDRKTEADR

A0A5D3CTK6 CCHC-type domain-containing protein1.3e-2941.43Show/hide
Query:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N
        D +    AR AA+G  +  + D +K + IERLKALGAT F+GTT+P D + W+                                              +
Subjt:  DQDERVMARIAALGI-KMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMK---------------------------------------------N

Query:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS
        W E K  F + +YPRS+ DAKR EFL+L Q SMTVAEY+KKY ELSKYAT +IEDE +R +RFE+GL EEI++  T    W +F  LVE A RV KS L+
Subjt:  WGELKNLFNEAYYPRSYTDAKRREFLKLVQRSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESV-WQEFRPLVEVAARVEKSLLS

Query:  EHDRKTEADR
        E  R+ E  +
Subjt:  EHDRKTEADR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAAGACGTAGTTAGTAAGTCTAGCCACCGACTAGTAGATTTACAAGAAGATATAGTTGATCAAGATGAACGTGTAATGGCAAGAATTGCTGCATTGGGA
ATAAAAATGTTGAGGAGAGATCTCGAGAAAGACTTTAAGATTGAGCGGCTCAAAGCTTTAGGGGCGACTAATTTTTCTGGGACGACAGATCCAGCTGATGCAGAT
CCCTGGATGAAAAACTGGGGAGAATTGAAAAACTTGTTTAATGAAGCATACTACCCACGATCGTACACAGATGCAAAAAGAAGAGAGTTCTTAAAGCTGGTACAG
AGGTCGATGACAGTAGCAGAATATCAGAAAAAATATGTAGAACTTTCAAAGTATGCCACCACTATCATTGAAGATGAAACTGACCGATGTAGACGATTTGAGGAT
GGGTTGCATGAGGAGATTCAAAGCTGTACTACTGAATCTGTGTGGCAAGAATTTAGGCCCTTGGTGGAAGTTGCAGCGAGGGTTGAGAAGAGTTTGTTATCTGAA
CATGACCGGAAAACAGAAGCAGACAGAGCACAAAAATGCTCCAGTCGAGTATCGACTCGCACCACCATACACGAAGCGGCTACAAAAGAAATAGCAGGATGTGCA
ATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAAGACGTAGTTAGTAAGTCTAGCCACCGACTAGTAGATTTACAAGAAGATATAGTTGATCAAGATGAACGTGTAATGGCAAGAATTGCTGCATTGGGA
ATAAAAATGTTGAGGAGAGATCTCGAGAAAGACTTTAAGATTGAGCGGCTCAAAGCTTTAGGGGCGACTAATTTTTCTGGGACGACAGATCCAGCTGATGCAGAT
CCCTGGATGAAAAACTGGGGAGAATTGAAAAACTTGTTTAATGAAGCATACTACCCACGATCGTACACAGATGCAAAAAGAAGAGAGTTCTTAAAGCTGGTACAG
AGGTCGATGACAGTAGCAGAATATCAGAAAAAATATGTAGAACTTTCAAAGTATGCCACCACTATCATTGAAGATGAAACTGACCGATGTAGACGATTTGAGGAT
GGGTTGCATGAGGAGATTCAAAGCTGTACTACTGAATCTGTGTGGCAAGAATTTAGGCCCTTGGTGGAAGTTGCAGCGAGGGTTGAGAAGAGTTTGTTATCTGAA
CATGACCGGAAAACAGAAGCAGACAGAGCACAAAAATGCTCCAGTCGAGTATCGACTCGCACCACCATACACGAAGCGGCTACAAAAGAAATAGCAGGATGTGCA
ATTTAG
Protein sequenceShow/hide protein sequence
MAEDVVSKSSHRLVDLQEDIVDQDERVMARIAALGIKMLRRDLEKDFKIERLKALGATNFSGTTDPADADPWMKNWGELKNLFNEAYYPRSYTDAKRREFLKLVQ
RSMTVAEYQKKYVELSKYATTIIEDETDRCRRFEDGLHEEIQSCTTESVWQEFRPLVEVAARVEKSLLSEHDRKTEADRAQKCSSRVSTRTTIHEAATKEIAGCA
I