; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS027261 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS027261
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold1474:213771..223360
RNA-Seq ExpressionMS027261
SyntenyMS027261
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026071.1 uncharacterized protein E6C27_scaffold581G00620 [Cucumis melo var. makuwa]1.7e-0651.85Show/hide
Query:  MGYKDLTDRLAGILDFVWPIECCVDLCRPVTSLEVKETLFSMSSGKAPGPDGFS
        +GY++L+  +  I+ F W  ECC  L  P++  EV+  LFSM SGKAPGPDGFS
Subjt:  MGYKDLTDRLAGILDFVWPIECCVDLCRPVTSLEVKETLFSMSSGKAPGPDGFS

KAA0060725.1 uncharacterized protein E6C27_scaffold72G00120 [Cucumis melo var. makuwa]7.2e-2640.4Show/hide
Query:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNL---FGWLGLRLIFFGGIVFGPFVF
        RITS +R+W+ARVLSF G LQ +RSVL S QVY A++F+LPA V ++V++IL S+LW+G +  + G KVAW E  L    G L ++      I  GP + 
Subjt:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNL---FGWLGLRLIFFGGIVFGPFVF

Query:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWFS
              VGE+ F         ++     ++ F+  DG W+WPRVS+EL +L   V +V   +   D  +WIP   G FS++S W+ +RP    V W S
Subjt:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWFS

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]1.2e-2539.19Show/hide
Query:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW------FERNLFG------W--------L
        RITS +R+W+ARVLSFAG+LQ +RSVL+S QVY A+VF+LPA V ++V++IL S+LW+G +  + G+KVAW      FE   FG      W        L
Subjt:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW------FERNLFG------W--------L

Query:  GLRLIFFGG---------IVFGPFVFHLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPAS
         L L   G          I+ G  ++ + S  VG  + L+ +L G  S             DG W WPRVS+EL +L   V  V   +   D  +W+   
Subjt:  GLRLIFFGG---------IVFGPFVFHLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPAS

Query:  SGLFSVSSAWDVLRPARSFVPW
         G FS+SSAW+ +RP    V W
Subjt:  SGLFSVSSAWDVLRPARSFVPW

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]1.7e-0651.85Show/hide
Query:  MGYKDLTDRLAGILDFVWPIECCVDLCRPVTSLEVKETLFSMSSGKAPGPDGFS
        +GY++L+  +  I+ F W  ECC  L  P++  EV+  LFSM SGKAPGPDGFS
Subjt:  MGYKDLTDRLAGILDFVWPIECCVDLCRPVTSLEVKETLFSMSSGKAPGPDGFS

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]1.2e-2538.94Show/hide
Query:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW------FERNLFG------W--------L
        RITS +R+W+ARVLSFAG+LQ +RSVL+S QVY A+VF+LPA V ++V++IL S+LW+G +  + G+KVAW      FE   FG      W        L
Subjt:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW------FERNLFG------W--------L

Query:  GLRLIFFGG---------IVFGPFVFHLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPAS
         L L   G          I+ G  ++ + S  VG  + L+ +L G  S             DG W WPRVS+EL +L   V  V   +   D  +W+   
Subjt:  GLRLIFFGG---------IVFGPFVFHLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPAS

Query:  SGLFSVSSAWDVLRPARSFVPWFSCF
         G FS+SSAW+ +RP    V W   F
Subjt:  SGLFSVSSAWDVLRPARSFVPWFSCF

XP_016902060.1 PREDICTED: uncharacterized protein LOC103496880 [Cucumis melo]7.2e-2640.4Show/hide
Query:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNL---FGWLGLRLIFFGGIVFGPFVF
        RITS +R+W+ARVLSF G LQ +RSVL S QVY A++F+LPA V ++V++IL S+LW+G +  + G KVAW E  L    G L ++      I  GP + 
Subjt:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNL---FGWLGLRLIFFGGIVFGPFVF

Query:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWFS
              VGE+ F         ++     ++ F+  DG W+WPRVS+EL +L   V +V   +   D  +WIP   G FS++S W+ +RP    V W S
Subjt:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWFS

XP_022158861.1 uncharacterized protein LOC111025324 [Momordica charantia]3.8e-2739.83Show/hide
Query:  VLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNLFGWLGLRLI----------------------FFGGI
        +LSFAG LQ I SVLQSFQVY A+VF+LPA VVH+VE++L SFLWKG +   SG KVA       G   L L+                        GG+
Subjt:  VLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNLFGWLGLRLI----------------------FFGGI

Query:  VFGPFVF---------------HLGSLGVGEKFFLFGMLFGLW---------------SNLLFDM-------VAYFLFPDGSWRWPRVSVELWELVSEVS
            F++               + G  G    F   G L  +W                 +++D+       V  FL PDGSWRWPRVSV+L EL+ EV 
Subjt:  VFGPFVF---------------HLGSLGVGEKFFLFGMLFGLW---------------SNLLFDM-------VAYFLFPDGSWRWPRVSVELWELVSEVS

Query:  SVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWF
        SV   VG+ D  +W PA SGLFSVSS W +LRP R  V +F
Subjt:  SVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWF

TrEMBL top hitse value%identityAlignment
A0A1S4E258 uncharacterized protein LOC1034968803.5e-2640.4Show/hide
Query:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNL---FGWLGLRLIFFGGIVFGPFVF
        RITS +R+W+ARVLSF G LQ +RSVL S QVY A++F+LPA V ++V++IL S+LW+G +  + G KVAW E  L    G L ++      I  GP + 
Subjt:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNL---FGWLGLRLIFFGGIVFGPFVF

Query:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWFS
              VGE+ F         ++     ++ F+  DG W+WPRVS+EL +L   V +V   +   D  +WIP   G FS++S W+ +RP    V W S
Subjt:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWFS

A0A5A7SPE5 Reverse transcriptase domain-containing protein5.9e-2638.94Show/hide
Query:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW------FERNLFG------W--------L
        RITS +R+W+ARVLSFAG+LQ +RSVL+S QVY A+VF+LPA V ++V++IL S+LW+G +  + G+KVAW      FE   FG      W        L
Subjt:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW------FERNLFG------W--------L

Query:  GLRLIFFGG---------IVFGPFVFHLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPAS
         L L   G          I+ G  ++ + S  VG  + L+ +L G  S             DG W WPRVS+EL +L   V  V   +   D  +W+   
Subjt:  GLRLIFFGG---------IVFGPFVFHLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPAS

Query:  SGLFSVSSAWDVLRPARSFVPWFSCF
         G FS+SSAW+ +RP    V W   F
Subjt:  SGLFSVSSAWDVLRPARSFVPWFSCF

A0A5A7SPE5 Reverse transcriptase domain-containing protein8.0e-0751.85Show/hide
Query:  MGYKDLTDRLAGILDFVWPIECCVDLCRPVTSLEVKETLFSMSSGKAPGPDGFS
        +GY++L+  +  I+ F W  ECC  L  P++  EV+  LFSM SGKAPGPDGFS
Subjt:  MGYKDLTDRLAGILDFVWPIECCVDLCRPVTSLEVKETLFSMSSGKAPGPDGFS

A0A5A7SPE5 Reverse transcriptase domain-containing protein5.9e-2639.19Show/hide
Query:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW------FERNLFG------W--------L
        RITS +R+W+ARVLSFAG+LQ +RSVL+S QVY A+VF+LPA V ++V++IL S+LW+G +  + G+KVAW      FE   FG      W        L
Subjt:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW------FERNLFG------W--------L

Query:  GLRLIFFGG---------IVFGPFVFHLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPAS
         L L   G          I+ G  ++ + S  VG  + L+ +L G  S             DG W WPRVS+EL +L   V  V   +   D  +W+   
Subjt:  GLRLIFFGG---------IVFGPFVFHLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPAS

Query:  SGLFSVSSAWDVLRPARSFVPW
         G FS+SSAW+ +RP    V W
Subjt:  SGLFSVSSAWDVLRPARSFVPW

A0A5A7V4C1 Reverse transcriptase domain-containing protein3.5e-2640.4Show/hide
Query:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNL---FGWLGLRLIFFGGIVFGPFVF
        RITS +R+W+ARVLSF G LQ +RSVL S QVY A++F+LPA V ++V++IL S+LW+G +  + G KVAW E  L    G L ++      I  GP + 
Subjt:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNL---FGWLGLRLIFFGGIVFGPFVF

Query:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWFS
              VGE+ F         ++     ++ F+  DG W+WPRVS+EL +L   V +V   +   D  +WIP   G FS++S W+ +RP    V W S
Subjt:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWFS

A0A5D3DXE4 Reverse transcriptase domain-containing protein8.0e-0751.85Show/hide
Query:  MGYKDLTDRLAGILDFVWPIECCVDLCRPVTSLEVKETLFSMSSGKAPGPDGFS
        +GY++L+  +  I+ F W  ECC  L  P++  EV+  LFSM SGKAPGPDGFS
Subjt:  MGYKDLTDRLAGILDFVWPIECCVDLCRPVTSLEVKETLFSMSSGKAPGPDGFS

A0A6J1E271 uncharacterized protein LOC1110253241.8e-2739.83Show/hide
Query:  VLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNLFGWLGLRLI----------------------FFGGI
        +LSFAG LQ I SVLQSFQVY A+VF+LPA VVH+VE++L SFLWKG +   SG KVA       G   L L+                        GG+
Subjt:  VLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFERNLFGWLGLRLI----------------------FFGGI

Query:  VFGPFVF---------------HLGSLGVGEKFFLFGMLFGLW---------------SNLLFDM-------VAYFLFPDGSWRWPRVSVELWELVSEVS
            F++               + G  G    F   G L  +W                 +++D+       V  FL PDGSWRWPRVSV+L EL+ EV 
Subjt:  VFGPFVF---------------HLGSLGVGEKFFLFGMLFGLW---------------SNLLFDM-------VAYFLFPDGSWRWPRVSVELWELVSEVS

Query:  SVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWF
        SV   VG+ D  +W PA SGLFSVSS W +LRP R  V +F
Subjt:  SVPVSVGRVDVPIWIPASSGLFSVSSAWDVLRPARSFVPWF

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657508.0e-0424.52Show/hide
Query:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFE---RNLFGWLGLRLIFFGGIVFGPFVF
        R++S +  W  + LSFAG+L   ++VL S  V+  +  +LP  +++ ++Q+  +FLW     ++    V W +       G LG+R       +    + 
Subjt:  RITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAWFE---RNLFGWLGLRLIFFGGIVFGPFVF

Query:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSW--RWPRVSVELWELVS
         +G   + EK  L+ ++     ++     + +L P GSW   W  +++ L ++VS
Subjt:  HLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSW--RWPRVSVELWELVS

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.8e-0940.91Show/hide
Query:  VRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW
        +  W+AR LSFAG+LQ I SV+ S   +  + F LP+  + +++ I  SFLW G +      KVAW
Subjt:  VRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFLWKGHKSRQSGVKVAW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTATAAAGACCTGACTGATCGACTCGCTGGGATTCTTGATTTTGTTTGGCCGATTGAGTGTTGTGTGGACTTATGTCGTCCTGTGACCTCTTTAGAGGTTAAGGA
GACTTTATTTTCTATGAGTAGTGGGAAGGCTCCTGGCCCTGATGGGTTTTCTCGGATTACCTCTTGTGTTCGGAATTGGTCGGCTAGGGTGCTTTCTTTTGCTGGCCAAT
TGCAGTTTATTCGATCGGTTCTGCAGAGTTTTCAGGTCTATTGTGCCAATGTTTTTATCCTTCCGGCTCCTGTTGTTCATGATGTTGAGCAGATTTTGTGTTCTTTCTTG
TGGAAGGGGCACAAGAGCAGGCAATCGGGGGTTAAGGTGGCTTGGTTTGAGCGGAATCTCTTTGGGTGGCTTGGGTTGAGGCTTATATTCTTCGGGGGTATTGTATTTGG
ACCGTTCGTGTTTCACCTCGGTTCTCTTGGTGTTGGTGAGAAATTCTTTCTGTTCGGAATGCTTTTCGGCCTCTGGTCCAATTTGCTATTCGATATGGTTGCTTATTTTC
TGTTTCCCGATGGTTCGTGGCGCTGGCCTAGGGTGTCGGTTGAGCTTTGGGAACTGGTCTCTGAGGTTTCATCTGTGCCGGTTTCAGTGGGGAGGGTTGATGTGCCCATT
TGGATTCCGGCATCGTCGGGTCTCTTTTCGGTGTCTAGTGCGTGGGATGTGCTGCGGCCGGCTCGGTCTTTTGTTCCTTGGTTTTCTTGCTTTGATTTGGTGGGAACATT
CCAAAGCATTCTTTTATCGCTTGGTTGGCGACTATTTGTGCTTGCCGACCTAAAACGTGATGAGTTGAGAAAGAAGATCAAGCTCCACCGACCTGGTGGCCCGACAACTG
TAACCGTCGATCGGAACGGCTTCAACGACTGTCCCACTATCTTAAAGGCCACACATTGGCAGCTCACTCCGATTAATTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTATAAAGACCTGACTGATCGACTCGCTGGGATTCTTGATTTTGTTTGGCCGATTGAGTGTTGTGTGGACTTATGTCGTCCTGTGACCTCTTTAGAGGTTAAGGA
GACTTTATTTTCTATGAGTAGTGGGAAGGCTCCTGGCCCTGATGGGTTTTCTCGGATTACCTCTTGTGTTCGGAATTGGTCGGCTAGGGTGCTTTCTTTTGCTGGCCAAT
TGCAGTTTATTCGATCGGTTCTGCAGAGTTTTCAGGTCTATTGTGCCAATGTTTTTATCCTTCCGGCTCCTGTTGTTCATGATGTTGAGCAGATTTTGTGTTCTTTCTTG
TGGAAGGGGCACAAGAGCAGGCAATCGGGGGTTAAGGTGGCTTGGTTTGAGCGGAATCTCTTTGGGTGGCTTGGGTTGAGGCTTATATTCTTCGGGGGTATTGTATTTGG
ACCGTTCGTGTTTCACCTCGGTTCTCTTGGTGTTGGTGAGAAATTCTTTCTGTTCGGAATGCTTTTCGGCCTCTGGTCCAATTTGCTATTCGATATGGTTGCTTATTTTC
TGTTTCCCGATGGTTCGTGGCGCTGGCCTAGGGTGTCGGTTGAGCTTTGGGAACTGGTCTCTGAGGTTTCATCTGTGCCGGTTTCAGTGGGGAGGGTTGATGTGCCCATT
TGGATTCCGGCATCGTCGGGTCTCTTTTCGGTGTCTAGTGCGTGGGATGTGCTGCGGCCGGCTCGGTCTTTTGTTCCTTGGTTTTCTTGCTTTGATTTGGTGGGAACATT
CCAAAGCATTCTTTTATCGCTTGGTTGGCGACTATTTGTGCTTGCCGACCTAAAACGTGATGAGTTGAGAAAGAAGATCAAGCTCCACCGACCTGGTGGCCCGACAACTG
TAACCGTCGATCGGAACGGCTTCAACGACTGTCCCACTATCTTAAAGGCCACACATTGGCAGCTCACTCCGATTAATTCGTGA
Protein sequenceShow/hide protein sequence
MGYKDLTDRLAGILDFVWPIECCVDLCRPVTSLEVKETLFSMSSGKAPGPDGFSRITSCVRNWSARVLSFAGQLQFIRSVLQSFQVYCANVFILPAPVVHDVEQILCSFL
WKGHKSRQSGVKVAWFERNLFGWLGLRLIFFGGIVFGPFVFHLGSLGVGEKFFLFGMLFGLWSNLLFDMVAYFLFPDGSWRWPRVSVELWELVSEVSSVPVSVGRVDVPI
WIPASSGLFSVSSAWDVLRPARSFVPWFSCFDLVGTFQSILLSLGWRLFVLADLKRDELRKKIKLHRPGGPTTVTVDRNGFNDCPTILKATHWQLTPINS