; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013010 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013010
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold344:128827..129345
RNA-Seq ExpressionMS013010
SyntenyMS013010
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]4.0e-5557.8Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        LN T I LIPK + PR++ EFRPISLCNV Y++++KA+ NR+K +LN IISPNQSAFIP   + DNVI+GYEC+H +R   G +    AL LD+SKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLLS
        VEW+FLEQ M  +GFS  W++LI+ CI++  FS  +NG+ VG +KP R LRQG P SPYLF+LCA+  S+LL+
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLLS

XP_024043083.1 uncharacterized protein LOC112099827 [Citrus clementina]1.8e-5558.14Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        LN T I LIPK K PR++ EFRPISLCNV Y++++KA+ NR+KS+LN IISPNQSAFIP   ++DNVI+GY+C+H +R   G +    AL LD+SKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL
        VEW FLEQ M  +GFS  W++LI+ CI++  FS  +NG+ VG +KP + LRQG P SPYLF+LCA+  S+LL
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL

XP_042956310.1 uncharacterized protein LOC122292152 [Carya illinoinensis]3.4e-5458.72Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        +NET ITLIPK K P+R+P+FRPISLCNV YK+++K L NR+K VL +IISPNQSAF+PG  + DN+++ YE +H++  +  GK+ + AL LDMSKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL
        VEWSFL  +M R+GF   W++L+ + ISSVSFS  +NG      KPSR LRQGDP SPYLF+LCA+ LS +L
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL

XP_042958247.1 uncharacterized protein LOC122293873 [Carya illinoinensis]3.4e-5458.72Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        +NET ITLIPK K P+R+P+FRPISLCNV YK+++K L NR+K VL +IISPNQSAF+PG  + DN+++ YE +H++  +  GK+ + AL LDMSKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL
        VEWSFL  +M R+GF   W++L+ + ISSVSFS  +NG      KPSR LRQGDP SPYLF+LCA+ LS +L
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL

XP_042965942.1 uncharacterized protein LOC122299620 [Carya illinoinensis]1.2e-5457.56Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        LN + I LIPK ++P ++ +FRPISLCNV YKL+SKA+ NR+K++L  +IS +QSAF+PG  + DNV++ YE +H LR + GGK  + ++ LDMSKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL
        VEW FLE+IM++MGF   W+NLI+ C++SV FS  +NG   GC+KP+R LRQGDP SPYLFLLC +GL S+L
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL

TrEMBL top hitse value%identityAlignment
A0A2N9E7R2 Reverse transcriptase domain-containing protein3.3e-5560.12Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        +N T ITLIPK K+P R+ EFRPISLCNV YKL+SK + NR+K +L  IIS +QSAF+PG  + DNV++ +E +H +     G+    AL LDMSKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLLS
        VEW++LE IM +MGF P WV++I+QCIS+VS+S  VNG   G +KPSR LRQGDP SPYLFLLCA+GL SL+S
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLLS

A0A2N9EDY7 Reverse transcriptase domain-containing protein4.3e-5561.27Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        +N T ITLIPK K+P RI EFRPISLCNV+YKLISK + NR+K +L  IIS  QSAF+PG  + DNV++ +E +H +     GK    A+ LDMSKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLLS
        VEWSFLE+IM +MGF P WV LI+ CIS+VS+S  VNG   G +KPSR +RQGDP SPYLFLLCA+GL  L+S
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLLS

A0A2N9HPU0 Uncharacterized protein1.9e-5561.27Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        +N T ITLIPK K+P RI EFRPISLCNV+YKLISK + NR+K +L  IIS  QSAF+PG  + DNV++ +E +H +     GK    A+ LDMSKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLLS
        VEWSFLE+IM +MGF P WV+LI+ CIS+VS+S  VNG   G +KPSR +RQGDP SPYLFLLCA+GL  L+S
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLLS

A0A5B7BN08 Reverse transcriptase domain-containing protein5.1e-5660.47Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        +N T I LIPK   PR+I EFRPISLCNV YK+ISK L NR+K++L  II+ +QSAF+PG  + DN+++ +E IH L+ K  GK   +AL LDMSKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL
        VEWSFLE +MLRMGF   WV+LI+ C+S+VSFS  +NG   GC+KP+R LRQGDP SPYLF+LCA+  S+LL
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL

A0A803QGA2 Uncharacterized protein1.5e-5557.56Show/hide
Query:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ
        +N+++ITLIPK + P  I EFRPISLCNV YK+ISK L+NR+K++L  +IS NQSAF+PG  + D+V++ YEC+H+L+ +  G+ ++ A+ LDMSKAYD+
Subjt:  LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQ

Query:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL
        VEW F+E++ML+MGF   WVN++L+C+SSV + F++N S  G V P+R LRQGDP SPYLFL+CA+GLSSL+
Subjt:  VEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.8e-1327.38Show/hide
Query:  ETIITLIPK-AKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQV
        E  I LIPK  +   +   FRPISL N+  K+++K L NR++  + ++I  +Q  FIPG     N+      I  + R      N   +++D  KA+D++
Subjt:  ETIITLIPK-AKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQV

Query:  EWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLS
        +  F+ + + ++G   +++ +I       + +  +NG ++         RQG P SP LF +  + L+
Subjt:  EWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLS

P08548 LINE-1 reverse transcriptase homolog2.1e-1430.36Show/hide
Query:  ETIITLIPK-AKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQV
        E  ITLIPK  K P R   +RPISL N+  K+++K L NR++  + +II  +Q  FIPG     N+      I  + +      +   L++D  KA+D +
Subjt:  ETIITLIPK-AKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQV

Query:  EWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLS
        +  F+ + + ++G    ++ LI    S  + +  +NG ++         RQG P SP LF +  + L+
Subjt:  EWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLS

P11369 LINE-1 retrotransposable element ORF2 protein3.9e-2135.12Show/hide
Query:  ETIITLIPK-AKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQV
        E  ITLIPK  K P +I  FRPISL N+  K+++K L NR++  +  II P+Q  FIPG     N+      IH + +      N   ++LD  KA+D++
Subjt:  ETIITLIPK-AKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQV

Query:  EWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLS
        +  F+ +++ R G    ++N+I    S    + KVNG ++  +      RQG P SPYLF +  + L+
Subjt:  EWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLS

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-2034.52Show/hide
Query:  IITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQVEWS
        +++L+PK    R I  +RP+SL +  YK+++KA+  R+KSVL E+I P+QS  +PG  + DNV L  + +H  RR      + A L+LD  KA+D+V+  
Subjt:  IITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQVEWS

Query:  FLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL
        +L   +    F P +V  +    +S     K+N S    +   R +RQG P S  L+ L  +    LL
Subjt:  FLEQIMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLL

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM2.9e-0828.39Show/hide
Query:  IPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQVEWSFLEQ
        IPK    +R  +FRPIS+ +V  + ++  L  R+ S +N    P Q  F+P     DN  +  + +  LR       +    NLD+SKA+D +  + +  
Subjt:  IPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQVEWSFLEQ

Query:  IMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLL
         +   G    +V+ +         S   +G       P+R ++QGDP SP LF L
Subjt:  IMLRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLL

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.1e-1836.75Show/hide
Query:  LVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQVEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVN
        +V R+K ++  +I P Q++FIPG    DN++   E +HS+RRK G K  W  L LD+ KAYD++ W +LE  ++  GF  +W    L  I+  +F  +  
Subjt:  LVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQVEWSFLEQIMLRMGFSPLWVNLILQCISSVSFSFKVN

Query:  GSQVGCVKPSRRLRQGD
          +VG    S+R R  D
Subjt:  GSQVGCVKPSRRLRQGD

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.1e-0763.16Show/hide
Query:  FKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSL
        F +NG+  G V PSR LRQGDP SPYLF+LC + LS L
Subjt:  FKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTCAACGAAACGATAATTACTCTTATCCCAAAAGCCAAGCACCCGAGACGTATTCCAGAATTTCGGCCGATTTCTCTTTGTAATGTCAGCTATAAATTAATTTCCAAGGC
TTTGGTGAACAGAATGAAGTCTGTTTTGAATGAGATCATCTCGCCCAATCAGAGTGCCTTTATTCCGGGTCATTGTGTGGTGGATAATGTCATTCTGGGCTATGAATGTA
TTCATTCTTTGCGTCGAAAATCAGGGGGTAAGACGAATTGGGCTGCTCTAAATCTCGATATGAGTAAGGCCTATGATCAGGTTGAATGGTCCTTTCTGGAACAAATCATG
TTAAGGATGGGGTTTTCTCCACTGTGGGTGAACTTGATTCTCCAGTGTATATCTTCCGTTTCCTTTTCCTTTAAGGTTAATGGCTCTCAGGTGGGTTGTGTGAAACCGAG
TCGCAGACTCCGTCAAGGTGATCCTTTCTCCCCTTACTTATTTTTGTTATGTGCCCAGGGGCTTTCCAGTTTGTTGTCG
mRNA sequenceShow/hide mRNA sequence
CTCAACGAAACGATAATTACTCTTATCCCAAAAGCCAAGCACCCGAGACGTATTCCAGAATTTCGGCCGATTTCTCTTTGTAATGTCAGCTATAAATTAATTTCCAAGGC
TTTGGTGAACAGAATGAAGTCTGTTTTGAATGAGATCATCTCGCCCAATCAGAGTGCCTTTATTCCGGGTCATTGTGTGGTGGATAATGTCATTCTGGGCTATGAATGTA
TTCATTCTTTGCGTCGAAAATCAGGGGGTAAGACGAATTGGGCTGCTCTAAATCTCGATATGAGTAAGGCCTATGATCAGGTTGAATGGTCCTTTCTGGAACAAATCATG
TTAAGGATGGGGTTTTCTCCACTGTGGGTGAACTTGATTCTCCAGTGTATATCTTCCGTTTCCTTTTCCTTTAAGGTTAATGGCTCTCAGGTGGGTTGTGTGAAACCGAG
TCGCAGACTCCGTCAAGGTGATCCTTTCTCCCCTTACTTATTTTTGTTATGTGCCCAGGGGCTTTCCAGTTTGTTGTCG
Protein sequenceShow/hide protein sequence
LNETIITLIPKAKHPRRIPEFRPISLCNVSYKLISKALVNRMKSVLNEIISPNQSAFIPGHCVVDNVILGYECIHSLRRKSGGKTNWAALNLDMSKAYDQVEWSFLEQIM
LRMGFSPLWVNLILQCISSVSFSFKVNGSQVGCVKPSRRLRQGDPFSPYLFLLCAQGLSSLLS