; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005999 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005999
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold254:3019402..3020034
RNA-Seq ExpressionMS005999
SyntenyMS005999
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKI35403.1 hypothetical protein CRG98_044205, partial [Punica granatum]4.5e-5347.62Show/hide
Query:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWP
        +++ I  H+P+I+VIVEP+ISG  AD+VCR F  +S  RVEA    GGIWV+W+ + V +     ++QA+H R  Q   +  FT VY SP   ++R+LW 
Subjt:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWP

Query:  FLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLP
         L ++      PW+++GDFN I  A EK G AP +P+ A  F   +D+C L+DL SSGP+FTW GP+  G++RVFERLDRA+ N  W+  F D SVRVLP
Subjt:  FLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLP

Query:  MVLSDHHPII
         + SDHHP++
Subjt:  MVLSDHHPII

RYR24104.1 hypothetical protein Ahy_B02g057597 isoform B [Arachis hypogaea]1.3e-3940.28Show/hide
Query:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFF-TTVYGSPQRSSKRELW
        LR+    +KPDI++++E K+SG  A +V R+    + +  EA    GGIW+ W  + +++  +  +QQ IH + +      +F T VY SPQ  ++  LW
Subjt:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFF-TTVYGSPQRSSKRELW

Query:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL
          L ++     SPW++ GDFN I  A EK G + +       F   I+   LIDLG SG +FTWKGPL  G +RVF+RLDRAL+NS W+L   +  V+VL
Subjt:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL

Query:  PMVLSDHHPII
        P   SDHHP++
Subjt:  PMVLSDHHPII

XP_015934914.2 uncharacterized protein LOC107461000 [Arachis duranensis]3.4e-4039.34Show/hide
Query:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGF-FTTVYGSPQRSSKRELW
        L++F+  + PDI++++E K+SG  A  + +     +++  EA    GGIW+ WK + +S+  +  N+Q IH R  +     +  T VY SPQ +++R +W
Subjt:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGF-FTTVYGSPQRSSKRELW

Query:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL
          ++ +      PWLLIGDFN I    EK G  P++      F   ID C L+DLG  G +FTW+GP   G+DRVF+RLDRAL+N  W+  F +  V+VL
Subjt:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL

Query:  PMVLSDHHPII
        P   SDHHP++
Subjt:  PMVLSDHHPII

XP_022137804.1 uncharacterized protein LOC111009151 [Momordica charantia]2.1e-114100Show/hide
Query:  MVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWPFLKSMVPTNQSP
        MVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWPFLKSMVPTNQSP
Subjt:  MVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWPFLKSMVPTNQSP

Query:  WLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLPMVLSDHHPII
        WLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLPMVLSDHHPII
Subjt:  WLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLPMVLSDHHPII

XP_031402735.1 uncharacterized protein LOC116212324 [Punica granatum]2.0e-5347.87Show/hide
Query:  VLRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELW
        V+++ I  H+P+I+VIVEP+ISG  AD+VCR F  +S  RVEA    GGIWV+W+ + V +     ++QA+H R  Q   +  FT VY SP   ++R+LW
Subjt:  VLRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELW

Query:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL
          L ++      PW+++GDFN I  A EK G AP +P+ A  F   +D+C L+DL SSGP+FTW GP+  G++RVFERLDRA+ N  W+  F D SVRVL
Subjt:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL

Query:  PMVLSDHHPII
        P + SDHHP++
Subjt:  PMVLSDHHPII

TrEMBL top hitse value%identityAlignment
A0A2I0HUV7 Reverse transcriptase domain-containing protein (Fragment)2.2e-5347.62Show/hide
Query:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWP
        +++ I  H+P+I+VIVEP+ISG  AD+VCR F  +S  RVEA    GGIWV+W+ + V +     ++QA+H R  Q   +  FT VY SP   ++R+LW 
Subjt:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWP

Query:  FLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLP
         L ++      PW+++GDFN I  A EK G AP +P+ A  F   +D+C L+DL SSGP+FTW GP+  G++RVFERLDRA+ N  W+  F D SVRVLP
Subjt:  FLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLP

Query:  MVLSDHHPII
         + SDHHP++
Subjt:  MVLSDHHPII

A0A445AC75 Uncharacterized protein6.2e-4040.28Show/hide
Query:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFF-TTVYGSPQRSSKRELW
        LR+    +KPDI++++E K+SG  A +V R+    + +  EA    GGIW+ W  + +++  +  +QQ IH + +      +F T VY SPQ  ++  LW
Subjt:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFF-TTVYGSPQRSSKRELW

Query:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL
          L ++     SPW++ GDFN I  A EK G + +       F   I+   LIDLG SG +FTWKGPL  G +RVF+RLDRAL+NS W+L   +  V+VL
Subjt:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL

Query:  PMVLSDHHPII
        P   SDHHP++
Subjt:  PMVLSDHHPII

A0A6J1C8B2 uncharacterized protein LOC1110091511.0e-114100Show/hide
Query:  MVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWPFLKSMVPTNQSP
        MVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWPFLKSMVPTNQSP
Subjt:  MVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWPFLKSMVPTNQSP

Query:  WLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLPMVLSDHHPII
        WLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLPMVLSDHHPII
Subjt:  WLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLPMVLSDHHPII

A0A6P4BTE3 uncharacterized protein LOC1074610001.6e-4039.34Show/hide
Query:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGF-FTTVYGSPQRSSKRELW
        L++F+  + PDI++++E K+SG  A  + +     +++  EA    GGIW+ WK + +S+  +  N+Q IH R  +     +  T VY SPQ +++R +W
Subjt:  LRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGF-FTTVYGSPQRSSKRELW

Query:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL
          ++ +      PWLLIGDFN I    EK G  P++      F   ID C L+DLG  G +FTW+GP   G+DRVF+RLDRAL+N  W+  F +  V+VL
Subjt:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL

Query:  PMVLSDHHPII
        P   SDHHP++
Subjt:  PMVLSDHHPII

A0A6P8E5K3 uncharacterized protein LOC1162123249.9e-5447.87Show/hide
Query:  VLRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELW
        V+++ I  H+P+I+VIVEP+ISG  AD+VCR F  +S  RVEA    GGIWV+W+ + V +     ++QA+H R  Q   +  FT VY SP   ++R+LW
Subjt:  VLRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELW

Query:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL
          L ++      PW+++GDFN I  A EK G AP +P+ A  F   +D+C L+DL SSGP+FTW GP+  G++RVFERLDRA+ N  W+  F D SVRVL
Subjt:  PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVL

Query:  PMVLSDHHPII
        P + SDHHP++
Subjt:  PMVLSDHHPII

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein2.1e-0831.09Show/hide
Query:  SSKRELW---PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAP--LSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHW
        + +R LW     L +  P   SPWL++GDFN IAS  E     P  +S          +    L+DL   G  +TW        + +  +LDRA+ N  W
Subjt:  SSKRELW---PFLKSMVPTNQSPWLLIGDFNVIASADEKDGRAP--LSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHW

Query:  QLAFLDTSVRVLPMVLSDH
           F   S    P   SDH
Subjt:  QLAFLDTSVRVLPMVLSDH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTTCTTCGTGACTTTATTCTTCTGCATAAGCCTGATATTATGGTTATTGTGGAGCCTAAGATAAGTGGTCCTATTGCTGATTCGGTTTGTCGTAGTTTCCCCCATTTTTC
GTGGGTTCGAGTTGAAGCTGACCATTTAAAAGGTGGTATTTGGGTTTTTTGGAAGCTTGATCGAGTTTCTCTAATGGAGGTTGCTTGGAATCAGCAGGCTATTCATTTCC
GTTTTGATCAGGAGGCCTTATCGGGTTTCTTTACTACTGTCTATGGTAGCCCCCAACGCAGCTCTAAACGTGAGCTTTGGCCGTTTCTTAAATCCATGGTGCCAACCAAT
CAGAGTCCTTGGCTTTTGATTGGAGACTTTAATGTCATTGCTTCGGCTGATGAAAAGGATGGCAGAGCTCCCTTGAGCCCAATGGACGCTACTCCTTTTTTGACTACTAT
TGATCACTGTCAGCTTATCGATTTAGGAAGTTCTGGGCCTAAGTTTACTTGGAAGGGTCCCTTGATTGCTGGGTTTGATCGTGTCTTTGAACGGCTTGATCGAGCGTTGG
CAAATTCGCATTGGCAGTTGGCTTTTCTTGATACGTCAGTTCGTGTGCTGCCGATGGTGCTCTCAGACCACCACCCTATTATT
mRNA sequenceShow/hide mRNA sequence
GTTCTTCGTGACTTTATTCTTCTGCATAAGCCTGATATTATGGTTATTGTGGAGCCTAAGATAAGTGGTCCTATTGCTGATTCGGTTTGTCGTAGTTTCCCCCATTTTTC
GTGGGTTCGAGTTGAAGCTGACCATTTAAAAGGTGGTATTTGGGTTTTTTGGAAGCTTGATCGAGTTTCTCTAATGGAGGTTGCTTGGAATCAGCAGGCTATTCATTTCC
GTTTTGATCAGGAGGCCTTATCGGGTTTCTTTACTACTGTCTATGGTAGCCCCCAACGCAGCTCTAAACGTGAGCTTTGGCCGTTTCTTAAATCCATGGTGCCAACCAAT
CAGAGTCCTTGGCTTTTGATTGGAGACTTTAATGTCATTGCTTCGGCTGATGAAAAGGATGGCAGAGCTCCCTTGAGCCCAATGGACGCTACTCCTTTTTTGACTACTAT
TGATCACTGTCAGCTTATCGATTTAGGAAGTTCTGGGCCTAAGTTTACTTGGAAGGGTCCCTTGATTGCTGGGTTTGATCGTGTCTTTGAACGGCTTGATCGAGCGTTGG
CAAATTCGCATTGGCAGTTGGCTTTTCTTGATACGTCAGTTCGTGTGCTGCCGATGGTGCTCTCAGACCACCACCCTATTATT
Protein sequenceShow/hide protein sequence
VLRDFILLHKPDIMVIVEPKISGPIADSVCRSFPHFSWVRVEADHLKGGIWVFWKLDRVSLMEVAWNQQAIHFRFDQEALSGFFTTVYGSPQRSSKRELWPFLKSMVPTN
QSPWLLIGDFNVIASADEKDGRAPLSPMDATPFLTTIDHCQLIDLGSSGPKFTWKGPLIAGFDRVFERLDRALANSHWQLAFLDTSVRVLPMVLSDHHPII