; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g15690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g15690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:13072572..13074923
RNA-Seq ExpressionMoc09g15690
SyntenyMoc09g15690
Gene Ontology termsGO:0005739 - mitochondrion (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8663210.1 Detected protein of unknown function [Hibiscus syriacus]7.3e-2473.53Show/hide
Query:  LPPQPLQTLSEN-GVAVATPTLSFLSLRVPIYMTSGRYKVNLFRRGRQISTYLSSPGQQRSSGLRAWEHSSNKEE-SGLNSGPACPSAARSGNGARRGRR
        LP QPLQTLSEN  VAVATPTLSFLSLRV IYMTSGRY+V+LF RG        SPGQ RSS LRAWE    +   SGLNSGPA PSAAR GNGARRGRR
Subjt:  LPPQPLQTLSEN-GVAVATPTLSFLSLRVPIYMTSGRYKVNLFRRGRQISTYLSSPGQQRSSGLRAWEHSSNKEE-SGLNSGPACPSAARSGNGARRGRR

Query:  YA
        YA
Subjt:  YA

KAF3961445.1 hypothetical protein CMV_013935 [Castanea mollissima]1.6e-1039.05Show/hide
Query:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH
        + WC  G  N     S  +G   L        +E+F++ V D +LVDLPL G SFTWSN   G +  +  R+D  L+  DWEDH  D  Q++     SDH
Subjt:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH

Query:  DLILL
          ILL
Subjt:  DLILL

KAF4349952.1 hypothetical protein G4B88_002374 [Cannabis sativa]2.9e-3686.81Show/hide
Query:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLPST
        L AEN+R G+EDFAD VADCD VDLPLYGASFTWSNMRKGKEMEIRCRLDLFLI SD ED LQDTI+K GTTFASDHDLILLF+FEQLPST
Subjt:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLPST

KAG2720218.1 hypothetical protein I3760_02G027200 [Carya illinoinensis]2.3e-0938.32Show/hide
Query:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH
        L WCI G  N     S   G + L      + +++F+D + D DLVDLPL G  +TWSN R         RLD FL+   WE H    +QK      SDH
Subjt:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH

Query:  DLILLFY
          I+L Y
Subjt:  DLILLFY

TYI32627.1 hypothetical protein ES332_A04G077300v1 [Gossypium tomentosum]5.2e-1453.92Show/hide
Query:  PIYMTSGRYKVNLFRRGRQISTYLSSPGQQRSSGLRAWEHSSNKEE-SGLNSGPACPSAARSGNGARRGRRYAPQLCCDLHLIPTAGIDLMRPAHYYDEK
        P+Y+++  Y  N      +I TYLSSPGQ RSSGLRAWE    +   SGLNSG A PSAARS NGARRGRRYAP LCCD            RP  Y  E 
Subjt:  PIYMTSGRYKVNLFRRGRQISTYLSSPGQQRSSGLRAWEHSSNKEE-SGLNSGPACPSAARSGNGARRGRRYAPQLCCDLHLIPTAGIDLMRPAHYYDEK

Query:  SP
         P
Subjt:  SP

TrEMBL top hitse value%identityAlignment
A0A2N9EIH5 Uncharacterized protein2.9e-1038Show/hide
Query:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH
        ++WCI G  N     S       L+AE   R +  F+D +A+  L+DLPL    FTWSN    ++   + R+D FL+ SDWEDH    +QK    F S+H
Subjt:  LAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH

A0A2N9HTW1 Reverse transcriptase domain-containing protein2.9e-1038.1Show/hide
Query:  WCISGKSNECLMQSN--MKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH
        WCI G  N     S   M+G LT       R + +F+D +++ +L+DLPL+   FTWSN    ++   + R+D FL+ +DWED     +QK    F SDH
Subjt:  WCISGKSNECLMQSN--MKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDH

Query:  DLILL
         LI L
Subjt:  DLILL

A0A5D2QVI1 Uncharacterized protein2.5e-1453.92Show/hide
Query:  PIYMTSGRYKVNLFRRGRQISTYLSSPGQQRSSGLRAWEHSSNKEE-SGLNSGPACPSAARSGNGARRGRRYAPQLCCDLHLIPTAGIDLMRPAHYYDEK
        P+Y+++  Y  N      +I TYLSSPGQ RSSGLRAWE    +   SGLNSG A PSAARS NGARRGRRYAP LCCD            RP  Y  E 
Subjt:  PIYMTSGRYKVNLFRRGRQISTYLSSPGQQRSSGLRAWEHSSNKEE-SGLNSGPACPSAARSGNGARRGRRYAPQLCCDLHLIPTAGIDLMRPAHYYDEK

Query:  SP
         P
Subjt:  SP

A0A6A2X3H8 Uncharacterized protein3.5e-2473.53Show/hide
Query:  LPPQPLQTLSEN-GVAVATPTLSFLSLRVPIYMTSGRYKVNLFRRGRQISTYLSSPGQQRSSGLRAWEHSSNKEE-SGLNSGPACPSAARSGNGARRGRR
        LP QPLQTLSEN  VAVATPTLSFLSLRV IYMTSGRY+V+LF RG        SPGQ RSS LRAWE    +   SGLNSGPA PSAAR GNGARRGRR
Subjt:  LPPQPLQTLSEN-GVAVATPTLSFLSLRVPIYMTSGRYKVNLFRRGRQISTYLSSPGQQRSSGLRAWEHSSNKEE-SGLNSGPACPSAARSGNGARRGRR

Query:  YA
        YA
Subjt:  YA

A0A7J6DV31 Uncharacterized protein1.4e-3686.81Show/hide
Query:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLPST
        L AEN+R G+EDFAD VADCD VDLPLYGASFTWSNMRKGKEMEIRCRLDLFLI SD ED LQDTI+K GTTFASDHDLILLF+FEQLPST
Subjt:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLPST

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein6.8e-0430.38Show/hide
Query:  WCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDW
        W + G  N+  + S  +    + +    +GLED    + D DLVDLP  G  +TWSN +  ++  I  +LD  +++  W
Subjt:  WCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDW

AT1G43760.1 DNAse I-like superfamily protein1.5e-0633.71Show/hide
Query:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLP
        LQ     RGLE+F + + D DLVD+P  G  +TWSN +   +  I  +LD  + + DW       I     +  SDH   ++   E LP
Subjt:  LQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFASDHDLILLFYFEQLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCAGATTCATCCATACCGTTTCCAACCTACCTTGCTTGGTGCATTAGTGGAAAGTCAAATGAATGCCTTATGCAATCAAATATGAAAGGGGTTCTGACTTTACA
GGCCGAAAATCAGAGAAGAGGTCTTGAAGATTTCGCGGATCCCGTTGCTGATTGTGATTTGGTTGACCTCCCACTCTATGGCGCTTCTTTCACTTGGTCTAACATGAGGA
AAGGGAAGGAAATGGAAATCCGATGCAGACTGGATCTTTTCCTAATCGACTCGGATTGGGAAGACCATCTGCAGGACACTATTCAAAAAGTGGGGACCACATTCGCTTCC
GATCATGACCTCATTCTTCTTTTCTATTTCGAACAACTTCCCTCCACGGCTTCGCTTCTACGACTGCCTCCCCAGCCGTTGCAAACACTGAGCGAGAACGGTGTAGCAGT
GGCCACACCAACCCTTTCGTTCTTATCGCTCCGGGTTCCGATCTATATGACCTCCGGGCGGTACAAAGTCAACCTCTTCCGTAGAGGCAGGCAGATCTCTACGTACTTGT
CCAGTCCAGGACAACAGAGATCTTCCGGTCTGCGTGCGTGGGAGCACAGCTCAAACAAAGAAGAGTCTGGTCTGAATTCAGGCCCGGCCTGCCCGTCTGCTGCTAGGTCC
GGGAATGGCGCCAGGCGAGGGCGCCGCTATGCCCCGCAGCTCTGCTGCGACCTTCATTTGATTCCGACGGCCGGCATAGATCTCATGAGACCCGCCCACTATTACGACGA
AAAAAGCCCTGCCCTTTTCCTTCGCTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCAGATTCATCCATACCGTTTCCAACCTACCTTGCTTGGTGCATTAGTGGAAAGTCAAATGAATGCCTTATGCAATCAAATATGAAAGGGGTTCTGACTTTACA
GGCCGAAAATCAGAGAAGAGGTCTTGAAGATTTCGCGGATCCCGTTGCTGATTGTGATTTGGTTGACCTCCCACTCTATGGCGCTTCTTTCACTTGGTCTAACATGAGGA
AAGGGAAGGAAATGGAAATCCGATGCAGACTGGATCTTTTCCTAATCGACTCGGATTGGGAAGACCATCTGCAGGACACTATTCAAAAAGTGGGGACCACATTCGCTTCC
GATCATGACCTCATTCTTCTTTTCTATTTCGAACAACTTCCCTCCACGGCTTCGCTTCTACGACTGCCTCCCCAGCCGTTGCAAACACTGAGCGAGAACGGTGTAGCAGT
GGCCACACCAACCCTTTCGTTCTTATCGCTCCGGGTTCCGATCTATATGACCTCCGGGCGGTACAAAGTCAACCTCTTCCGTAGAGGCAGGCAGATCTCTACGTACTTGT
CCAGTCCAGGACAACAGAGATCTTCCGGTCTGCGTGCGTGGGAGCACAGCTCAAACAAAGAAGAGTCTGGTCTGAATTCAGGCCCGGCCTGCCCGTCTGCTGCTAGGTCC
GGGAATGGCGCCAGGCGAGGGCGCCGCTATGCCCCGCAGCTCTGCTGCGACCTTCATTTGATTCCGACGGCCGGCATAGATCTCATGAGACCCGCCCACTATTACGACGA
AAAAAGCCCTGCCCTTTTCCTTCGCTACTAG
Protein sequenceShow/hide protein sequence
MSSDSSIPFPTYLAWCISGKSNECLMQSNMKGVLTLQAENQRRGLEDFADPVADCDLVDLPLYGASFTWSNMRKGKEMEIRCRLDLFLIDSDWEDHLQDTIQKVGTTFAS
DHDLILLFYFEQLPSTASLLRLPPQPLQTLSENGVAVATPTLSFLSLRVPIYMTSGRYKVNLFRRGRQISTYLSSPGQQRSSGLRAWEHSSNKEESGLNSGPACPSAARS
GNGARRGRRYAPQLCCDLHLIPTAGIDLMRPAHYYDEKSPALFLRY