; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS028535 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS028535
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold29:292982..293452
RNA-Seq ExpressionMS028535
SyntenyMS028535
Gene Ontology termsGO:0005739 - mitochondrion (cellular component)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN80093.1 hypothetical protein VITISV_010721 [Vitis vinifera]1.4e-1139.2Show/hide
Query:  YLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASD
        Y  WC+ G  N     S   G   L        ++DF D + DC+L+DLPL  ASFTWSNM   +E  +  RLD FL  ++WE     +IQ V   + SD
Subjt:  YLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASD

Query:  HDLILLFYFEQLPSTVPFSFELMWL
        H  I+L          PF FE MWL
Subjt:  HDLILLFYFEQLPSTVPFSFELMWL

CAN83313.1 hypothetical protein VITISV_001463 [Vitis vinifera]1.9e-1138.4Show/hide
Query:  YLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASD
        Y  WC+ G  N     S   G   L        ++DF D + DC+L+DLPL  ASFTWSNM   +E  +  RLD FL  ++WE     ++Q V   + SD
Subjt:  YLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASD

Query:  HDLILLFYFEQLPSTVPFSFELMWL
        H  I+L         +PF FE MWL
Subjt:  HDLILLFYFEQLPSTVPFSFELMWL

KAF3961445.1 hypothetical protein CMV_013935 [Castanea mollissima]3.4e-1338.71Show/hide
Query:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH
        + WC  G  N     S  +G   L        +E+F++ V D +LVDLPL G SFTWSN   G +  +  R+D  L+  DWEDH  D  Q++     SDH
Subjt:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH

Query:  DLILLFYFEQLPSTVPFSFELMWL
          ILL          PF FE MWL
Subjt:  DLILLFYFEQLPSTVPFSFELMWL

KAF4349952.1 hypothetical protein G4B88_002374 [Cannabis sativa]3.8e-4184.76Show/hide
Query:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLPSTVPFSFELMW
        L AEN+R G+EDFAD VADCD VDLPLYGASFTWSNMRKGKEMEIRCRLDLFLI SD ED LQDTI+K GTTFASDHDLILLF+FEQLPSTVPFSFELM 
Subjt:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLPSTVPFSFELMW

Query:  LSLSK
         + SK
Subjt:  LSLSK

RVX12865.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.9e-1138.4Show/hide
Query:  YLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASD
        Y  WC+ G  N     S   G   L        ++DF D + DC+L+DLPL  ASFTWSNM   +E  +  RLD FL  ++WE     ++Q V   + SD
Subjt:  YLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASD

Query:  HDLILLFYFEQLPSTVPFSFELMWL
        H  I+L         +PF FE MWL
Subjt:  HDLILLFYFEQLPSTVPFSFELMWL

TrEMBL top hitse value%identityAlignment
A0A2N9FQM2 Reverse transcriptase domain-containing protein8.2e-1338.71Show/hide
Query:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH
        L WC+ G  N     S   G   + ++     + DF+D + + +LVDLPL G S+TWS+   G +     RLD FL+ SDWED   D  QK+     SDH
Subjt:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH

Query:  DLILLFYFEQLPSTVPFSFELMWL
          +LL     L    PF FE MWL
Subjt:  DLILLFYFEQLPSTVPFSFELMWL

A0A2N9HTW1 Reverse transcriptase domain-containing protein4.0e-1237.1Show/hide
Query:  WCISGKSNECLMQSN--MKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH
        WCI G  N     S   M+G LT       R + +F+D +++ +L+DLPL+   FTWSN    ++   + R+D FL+ +DWED     +QK    F SDH
Subjt:  WCISGKSNECLMQSN--MKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH

Query:  DLILLFYFEQLPSTVPFSFELMWL
         LI L     +     F FE MWL
Subjt:  DLILLFYFEQLPSTVPFSFELMWL

A0A2N9II52 Uncharacterized protein2.4e-1240.16Show/hide
Query:  WCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDL
        WCI G  N     S   G + L        ++DF+D ++DC L+D PL G  FTWSN R+   M    RLD FL   DW DHL    Q+      SDH  
Subjt:  WCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDL

Query:  ILLFYFEQLPSTVPFSFELMWL
        ILL     +    PF FE MWL
Subjt:  ILLFYFEQLPSTVPFSFELMWL

A0A7J6DV31 Uncharacterized protein1.9e-4184.76Show/hide
Query:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLPSTVPFSFELMW
        L AEN+R G+EDFAD VADCD VDLPLYGASFTWSNMRKGKEMEIRCRLDLFLI SD ED LQDTI+K GTTFASDHDLILLF+FEQLPSTVPFSFELM 
Subjt:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLPSTVPFSFELMW

Query:  LSLSK
         + SK
Subjt:  LSLSK

A5AI05 Reverse transcriptase domain-containing protein6.9e-1239.2Show/hide
Query:  YLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASD
        Y  WC+ G  N     S   G   L        ++DF D + DC+L+DLPL  ASFTWSNM   +E  +  RLD FL  ++WE     +IQ V   + SD
Subjt:  YLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASD

Query:  HDLILLFYFEQLPSTVPFSFELMWL
        H  I+L          PF FE MWL
Subjt:  HDLILLFYFEQLPSTVPFSFELMWL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein4.0e-0430.38Show/hide
Query:  WCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDW
        W + G  N+  + S  +    + +    +GLED    + D DLVDLP  G  +TWSN +  ++  I  +LD  +++  W
Subjt:  WCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDW

AT1G43760.1 DNAse I-like superfamily protein1.1e-0633.71Show/hide
Query:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLP
        LQ     RGLE+F + + D DLVD+P  G  +TWSN +   +  I  +LD  + + DW       I     +  SDH   ++   E LP
Subjt:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCAGATTCATCCATACCGTTTCCAACCTACCTTGCTTGGTGCATTAGTGGAAAGTCAAATGAATGCCTTATGCAATCAAATATGAAAGGGGTTCTGACTTTACA
GGCCGAAAATCAGAGAAGAGGTCTTGAAGATTTCGCGGATCCCGTTGCTGATTGTGATTTGGTTGACCTCCCACTCTATGGCGCTTCTTTCACTTGGTCTAACATGAGGA
AAGGGAAGGAAATGGAAATCCGATGCAGACTGGATCTTTTCCTAATCGACTCGGATTGGGAAGACCATCTGCAGGACACTATTCAAAAAGTGGGGACCACATTCGCTTCC
GATCATGACCTCATTCTTCTTTTCTATTTCGAACAACTTCCCTCCACGGTACCCTTCTCTTTCGAACTGATGTGGTTGTCCCTATCGAAATCTAGAACTCAAAGTTGTGA
AGCAAAAGAATGCAACATTCTATTGACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCAGATTCATCCATACCGTTTCCAACCTACCTTGCTTGGTGCATTAGTGGAAAGTCAAATGAATGCCTTATGCAATCAAATATGAAAGGGGTTCTGACTTTACA
GGCCGAAAATCAGAGAAGAGGTCTTGAAGATTTCGCGGATCCCGTTGCTGATTGTGATTTGGTTGACCTCCCACTCTATGGCGCTTCTTTCACTTGGTCTAACATGAGGA
AAGGGAAGGAAATGGAAATCCGATGCAGACTGGATCTTTTCCTAATCGACTCGGATTGGGAAGACCATCTGCAGGACACTATTCAAAAAGTGGGGACCACATTCGCTTCC
GATCATGACCTCATTCTTCTTTTCTATTTCGAACAACTTCCCTCCACGGTACCCTTCTCTTTCGAACTGATGTGGTTGTCCCTATCGAAATCTAGAACTCAAAGTTGTGA
AGCAAAAGAATGCAACATTCTATTGACTTGA
Protein sequenceShow/hide protein sequence
MSSDSSIPFPTYLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFAS
DHDLILLFYFEQLPSTVPFSFELMWLSLSKSRTQSCEAKECNILLT