; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004699 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004699
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRetrovirus-related Pol polyprotein LINE-1
Genome locationscaffold741:156040..156657
RNA-Seq ExpressionMS004699
SyntenyMS004699
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
QHO25231.1 Putative ribonuclease H protein [Arachis hypogaea]1.8e-4644.66Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        ISHL FADD++LFAEAN+DQ  V+   L  FC +S Q ++  KT ++FSKNV H  + ++ +V  F  T ++GKYLG+PI+H ++     + IINK+   
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        L++W A+SLSLAG +TLV+ VL ++P ++M     P   C  +D++CRN LWG+T Q  +IHLISWK++ +PK  G LG+    + N A +    WGLI 
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
        ++  LW
Subjt:  KRSDLW

RYR74850.1 hypothetical protein Ahy_A02g009560 [Arachis hypogaea]3.6e-4745.63Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        +SHL FADDL+LFAEAN+DQ  V+K  L+ FCE+S Q V+  KT I+FS NV    +++I D  GF  T N+GKYLG+P+ H ++ +S   +II+K++  
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        L+SW A+SLSLAG  TLV+ VL ++P+++M     P   C  +D+ CRN LWG T+Q  +IH +SW+ + + K  G LG+   +  N A +  + WGLI+
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
        K+  LW
Subjt:  KRSDLW

XP_016164673.1 uncharacterized protein LOC107607211 [Arachis ipaensis]1.9e-4844.66Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        ISHL FADD++LFAEAN+DQ  ++   L  FC++S QKV+  KT ++FS+NV H  + +I +V  F  T ++ KYLG+PI+H ++     + IINK+ + 
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        L+SW A+SLSLAG  TLV+ VL ++P ++MH    P   C  +D++CRN +WG T Q  ++HL++WK I +PK  G LG+    + N A +    WGLI 
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
        ++ DLW
Subjt:  KRSDLW

XP_025664883.1 uncharacterized protein LOC112763420 [Arachis hypogaea]1.9e-4844.66Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        ISHL FADD++LFAEAN+DQ  ++   L  FC++S QKV+  KT ++FS+NV H  + +I +V  F  T ++ KYLG+PI+H ++     + IINK+ + 
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        L+SW A+SLSLAG  TLV+ VL ++P ++MH    P   C  +D++CRN +WG T Q  ++HL++WK I +PK  G LG+    + N A +    WGLI 
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
        ++ DLW
Subjt:  KRSDLW

XP_025679132.1 uncharacterized protein LOC112779089 [Arachis hypogaea]8.1e-4743.69Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        +SHL FADDL+LFAEA++ Q +++  VL +FC++S +KVNN KT I+FS NV H  +++I +   F  T ++GKYLG+PI+H ++     K +INK++  
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        L++W A++LSLAG  TLV+ +L +LP++++     P   C  +D+ C N LWG T Q  ++H++SW  I KPKV G LGL   K+ N + +  + WGLI 
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
         +  LW
Subjt:  KRSDLW

TrEMBL top hitse value%identityAlignment
A0A151RCT4 Putative ribonuclease H protein At1g65750 family (Fragment)1.5e-4647.57Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        +SHL FADDL+LF+EA+LDQ+EV+K  L  FC++S QKV+  KT I+FSKNV    +++I    GF+ T N+GKYLG+PI H R+       IINKV+  
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        LSSW A +LS AG +TL + VL  LP ++M     P  +C  +D+ CR+ LWGH  ++ RIH ++W  I KPK EG LGL   ++ N AL+    W L  
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
         +S LW
Subjt:  KRSDLW

A0A151SZM7 Putative ribonuclease H protein At1g65750 family (Fragment)3.7e-4543.69Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        +SHL FADDL+LFAEA+L+Q+EV++  L+ FC +S QKV+  KT ++FSKNV    ++ I    GF+ T N+GKYLGIP  H R+  S+ ++II K++  
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        LS W A +LS AG +TL + VL+ALP+++M   + P L+C ++D++CRN LWG    + + H + W ++  PK  G LGL + ++ NT+ +    W LI 
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
        +   LW
Subjt:  KRSDLW

A0A444Y771 Reverse transcriptase domain-containing protein1.4e-4441.75Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        +SHL FADD++LF +A+++Q+EVV  +L  FC+ S QKVN  K+ +YFS N+    ++++ D  G R+T+N+GKYLG+P++HGR      + I+++++  
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        LSSW A +LSLAG +TL Q  L ++P++ M   + P+ +C  +D++CRN LW   S   ++HL+SW+ +  PK +G LGL   +  N A L  +AW LI 
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
         +  LW
Subjt:  KRSDLW

A0A445EHD6 Uncharacterized protein1.8e-4745.63Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        +SHL FADDL+LFAEAN+DQ  V+K  L+ FCE+S Q V+  KT I+FS NV    +++I D  GF  T N+GKYLG+P+ H ++ +S   +II+K++  
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        L+SW A+SLSLAG  TLV+ VL ++P+++M     P   C  +D+ CRN LWG T+Q  +IH +SW+ + + K  G LG+   +  N A +  + WGLI+
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
        K+  LW
Subjt:  KRSDLW

A0A6P4B3X6 uncharacterized protein LOC1074617878.2e-4544.66Show/hide
Query:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC
        ISHL FADD++LFAEA +DQ  ++   L  FC++S Q V+  KT I+FSKNV H  + +I +V  F  T ++GKYLG+P++H ++     + IINK+   
Subjt:  ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLC

Query:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK
        L+S  A+SLSLAG  TLV+ VL ++P ++M     P   C  +D++CRN +WG T Q  +IHL++WK I +PK  G LG+      N A +    WGLI 
Subjt:  LSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIK

Query:  KRSDLW
        K+ DLW
Subjt:  KRSDLW

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.3e-2336.43Show/hide
Query:  IPIIHGRLPASALKEIINKVSLCLSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGD
        +P++  R+      EI+ +VS  +S W   +LS AG +TL + VL ++P HSM     P  +  +LDQL R  LWG T+++ + HL+ W  +  PK EG 
Subjt:  IPIIHGRLPASALKEIINKVSLCLSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGD

Query:  LGLHKFKEFNTALLGMIAWGLIKKRSDLW
        LG+   K  N AL+  + W L+++++ LW
Subjt:  LGLHKFKEFNTALLGMIAWGLIKKRSDLW

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.9e-1529.82Show/hide
Query:  KYLGIPIIHGRLPASALKEIINKVSLCLSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPK
        +YLG+P++  ++  S    ++ K+ + +  W+A  LS AG + L+  V+ +L    M  FR P    K++D +C + LW      ++   ++W  +  PK
Subjt:  KYLGIPIIHGRLPASALKEIINKVSLCLSSWSAASLSLAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPK

Query:  VEGDLGLHKFKEFN
         EG LG+   KE N
Subjt:  VEGDLGLHKFKEFN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATCTCTCACTTAATGTTTGCAGATGATCTATTACTCTTTGCGGAAGCTAATTTAGACCAAATTGAGGTTGTTAAGACAGTTCTAAGTAAATTTTGTGAGGCTTCAGTCCA
AAAAGTAAACAACTTCAAGACAATTATCTATTTCTCAAAGAATGTCCCTCACGAGGAGCAACAACAGATTGTTGATGTTGGTGGATTTAGAGTTACTTCCAATGTAGGGA
AATACTTAGGGATTCCAATTATACATGGTCGGCTTCCTGCCTCGGCTCTCAAAGAGATAATAAACAAAGTTAGTCTTTGTTTGAGTAGCTGGTCAGCTGCTTCTTTATCA
TTAGCAGGGCACATGACCCTTGTCCAACCTGTGCTTCAAGCTCTTCCAACTCACTCCATGCACATTTTTCGGTTCCCAGTTTTGGTGTGTAAAAAACTTGACCAACTGTG
TAGAAACTTATTGTGGGGGCATACCTCTCAGAGGTCTAGAATCCATTTAATCAGTTGGAAATCAATTACTAAGCCTAAAGTTGAGGGGGATCTTGGTCTTCATAAATTTA
AAGAGTTTAATACGGCTCTCTTAGGGATGATTGCCTGGGGTTTGATTAAAAAGCGAAGCGACCTTTGG
mRNA sequenceShow/hide mRNA sequence
ATCTCTCACTTAATGTTTGCAGATGATCTATTACTCTTTGCGGAAGCTAATTTAGACCAAATTGAGGTTGTTAAGACAGTTCTAAGTAAATTTTGTGAGGCTTCAGTCCA
AAAAGTAAACAACTTCAAGACAATTATCTATTTCTCAAAGAATGTCCCTCACGAGGAGCAACAACAGATTGTTGATGTTGGTGGATTTAGAGTTACTTCCAATGTAGGGA
AATACTTAGGGATTCCAATTATACATGGTCGGCTTCCTGCCTCGGCTCTCAAAGAGATAATAAACAAAGTTAGTCTTTGTTTGAGTAGCTGGTCAGCTGCTTCTTTATCA
TTAGCAGGGCACATGACCCTTGTCCAACCTGTGCTTCAAGCTCTTCCAACTCACTCCATGCACATTTTTCGGTTCCCAGTTTTGGTGTGTAAAAAACTTGACCAACTGTG
TAGAAACTTATTGTGGGGGCATACCTCTCAGAGGTCTAGAATCCATTTAATCAGTTGGAAATCAATTACTAAGCCTAAAGTTGAGGGGGATCTTGGTCTTCATAAATTTA
AAGAGTTTAATACGGCTCTCTTAGGGATGATTGCCTGGGGTTTGATTAAAAAGCGAAGCGACCTTTGG
Protein sequenceShow/hide protein sequence
ISHLMFADDLLLFAEANLDQIEVVKTVLSKFCEASVQKVNNFKTIIYFSKNVPHEEQQQIVDVGGFRVTSNVGKYLGIPIIHGRLPASALKEIINKVSLCLSSWSAASLS
LAGHMTLVQPVLQALPTHSMHIFRFPVLVCKKLDQLCRNLLWGHTSQRSRIHLISWKSITKPKVEGDLGLHKFKEFNTALLGMIAWGLIKKRSDLW