; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS027468 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS027468
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold476:148356..149888
RNA-Seq ExpressionMS027468
SyntenyMS027468
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]1.3e-1943.22Show/hide
Query:  CFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVM-LNNISWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDS
        CFTVD +G SGGL LLWK  + + ++SFS HHID  +   +   WRF G YG+     R+LTW LLRR+++  D PW++GGDFN +L   E         
Subjt:  CFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVM-LNNISWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDS

Query:  THISAFRNLVDHCGQTNL
          + AFRN++  C   +L
Subjt:  THISAFRNLVDHCGQTNL

PPR88589.1 hypothetical protein GOBAR_AA32102 [Gossypium barbadense]2.3e-1941.67Show/hide
Query:  KGCFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVMLNNIS-WRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSY
        +GC  VDS G SGGL LLW+D +D++++++S  HID  V L N    RFIGFYG +AS  R   W++LRR+H+  +  W++GGDFN+IL   E     S 
Subjt:  KGCFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVMLNNIS-WRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSY

Query:  DSTHISAFRNLVDHCGQTNL
            +  F N+++    T++
Subjt:  DSTHISAFRNLVDHCGQTNL

XP_020412490.1 uncharacterized protein LOC18793550 [Prunus persica]1.2e-2028.17Show/hide
Query:  CFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSV--MLNNISWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYD
        CF VD++GLSGGLCL WK  +++ IRS S HHID  V  + +++ WR  GFYG+ A+   HL+W LLR + +    PWV  GDFN +L + E        
Subjt:  CFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSV--MLNNISWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYD

Query:  STHISAFRNLVDHCGQTNLLGAITALLLHNFGSVLIVFYAMILSTIFFQVLMSLISIGQNLTTELLQCIYLLLLIIVARGIGNHSGLRSIGHGTHSISAT
           + AFR+ +  C   ++           F      +++                                           + G++      HS +  
Subjt:  STHISAFRNLVDHCGQTNLLGAITALLLHNFGSVLIVFYAMILSTIFFQVLMSLISIGQNLTTELLQCIYLLLLIIVARGIGNHSGLRSIGHGTHSISAT

Query:  SSALRSSG-------RSNVRELFRQIRTQKAAIANAYNQPHPLN-FSIIHMLEDDLAGLLELEEIYWKQRSREEWLKWGDCKTR
           L+  G       RS       +I+  ++ +   + QP+  N  +  H+L   L  LL  EE +WKQRS+  WLK GD  TR
Subjt:  SSALRSSG-------RSNVRELFRQIRTQKAAIANAYNQPHPLN-FSIIHMLEDDLAGLLELEEIYWKQRSREEWLKWGDCKTR

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.3e-4651.3Show/hide
Query:  SWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHCGQTNL--LGAITALLLHNF-GSVLIVFYAM
        S RF GFYGH A+HKRHLTWELLRRI N+D SPW+IGGD N+IL +YEAS +SSYD++ I AFRN++D C  T++   G I     + F G  L      
Subjt:  SWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHCGQTNL--LGAITALLLHNF-GSVLIVFYAM

Query:  ILSTIFFQVLMSLISIGQNLTTELLQCIYLLLLIIVARGIGNHSGLRSIGHGTHSISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQPHPLNFSII
         L    F  +    S                         G+ S        + SI A+SSALR  GRSNV +LF+QI+ QKAAI +AYNQP PL+F+II
Subjt:  ILSTIFFQVLMSLISIGQNLTTELLQCIYLLLLIIVARGIGNHSGLRSIGHGTHSISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQPHPLNFSII

Query:  HMLEDDLAGLLELEEIYWKQRSREEWLKWG
        H LE+DLAGLLELEEI+WKQRSRE+WLKWG
Subjt:  HMLEDDLAGLLELEEIYWKQRSREEWLKWG

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]6.6e-1937.78Show/hide
Query:  LQQVMFSTTSED--KINEVAKGCFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVMLNNIS-WRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWV
        LQ+   ST   D  K       CF+VDS G SGGL LLW   + + +RSFS +HID  + ++++  WRF G YGH  + +R  TW L+R + +++  PW+
Subjt:  LQQVMFSTTSED--KINEVAKGCFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVMLNNIS-WRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWV

Query:  IGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHC
        +GGD N +L  +E         + I AFR ++  C
Subjt:  IGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHC

TrEMBL top hitse value%identityAlignment
A0A2N9EFD5 Reverse transcriptase domain-containing protein5.3e-2230.46Show/hide
Query:  CFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVMLN-NISWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDS
        CF VD  G  GGL LLW   V++ IRSFS+HHID  V  +    W+  GFYG+    +RHL+W LLR++H++   PW++ GDFN I+   E         
Subjt:  CFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVMLN-NISWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDS

Query:  THISAFRNLVDHCGQTNLLGAITALLLHNFGSVLIVFYAMILSTIFFQ--VLMSLISIGQNLTTELLQ-----CIYLLLLIIVARGIGNHSGLRSIG---
        T ++AFR +++ CG            L + G V    +  +    F Q  ++ SL  IG  +  E  Q     C   +           HS +R  G   
Subjt:  THISAFRNLVDHCGQTNLLGAITALLLHNFGSVLIVFYAMILSTIFFQ--VLMSLISIGQNLTTELLQ-----CIYLLLLIIVARGIGNHSGLRSIG---

Query:  ----------HGT------HSISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQPHPLNFSI-IHMLEDDLAGLLELEEIYWKQRSREEWLKWGDCK
                  HGT        I      L     S V    + + +++A  ++  N P     ++ +++L  +L  L+E EEI+W+QRSR  WL+ GD  
Subjt:  ----------HGT------HSISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQPHPLNFSI-IHMLEDDLAGLLELEEIYWKQRSREEWLKWGDCK

Query:  TR
        T+
Subjt:  TR

A0A2N9FYH3 CCHC-type domain-containing protein9.9e-2131.5Show/hide
Query:  LKKQKLQQVMFSTTSEDKINEV--------AKGCFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSV-MLNNISWRFIGFYGHSASHKRHLTWELLR
        L K ++ Q++F   +   I ++         KGCF VD     GGL LLW DSVDI I+S+S HHIDC V      SWRF GFYG   +  RH +WELLR
Subjt:  LKKQKLQQVMFSTTSEDKINEV--------AKGCFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSV-MLNNISWRFIGFYGHSASHKRHLTWELLR

Query:  RIHNMDDSPWVIGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHCGQTNLLGAITALLLHNFGSVLIVFYAMILSTIFFQVLMSLISIGQNLTTELLQC
        R+  M +  W++ GDFN I  S E      Y+ T  +  RN  +   +    G  T   +  F    I  +    S     +L++  ++         + 
Subjt:  RIHNMDDSPWVIGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHCGQTNLLGAITALLLHNFGSVLIVFYAMILSTIFFQVLMSLISIGQNLTTELLQC

Query:  IYLLLLIIVARGIGNHSGLRSIG-------------HGTH------SISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQP-HPLNFSIIHMLEDDL
         +             H+ LR  G              GTH       I     AL S  +S  R L + I  ++A +   Y    + +N      L  DL
Subjt:  IYLLLLIIVARGIGNHSGLRSIG-------------HGTH------SISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQP-HPLNFSIIHMLEDDL

Query:  AGLLELEEIYWKQRSREEWLKWGDCKT
          LL  EEIYW+QRSR  WL+ GD  T
Subjt:  AGLLELEEIYWKQRSREEWLKWGDCKT

A0A2N9I509 Uncharacterized protein1.5e-2127.42Show/hide
Query:  INADPNVASNVNIPSKEFNLPLCSDPPQSSITTMDASPIFTGNLHRWKQKAR---------AIHNSSSPFGP----SLNI-----PEAQLQPTNVGPSKR
        + A P    +V IPS +      + P    ++T     + +GN   WK++AR         ++  + +P GP    SLN      PE  +    +   K 
Subjt:  INADPNVASNVNIPSKEFNLPLCSDPPQSSITTMDASPIFTGNLHRWKQKAR---------AIHNSSSPFGP----SLNI-----PEAQLQPTNVGPSKR

Query:  KASPHQKASKLKKQKLQQVMFSTTSEDKINEVAKGCFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVMLNNIS-WRFIGFYGHSASHKRHLTWEL
                ++L  + L+ +        ++    KGCF VD  GL GGL LLW DSV +TI+S+S HHID  V  +N S WR   FYGH     R  TW L
Subjt:  KASPHQKASKLKKQKLQQVMFSTTSEDKINEVAKGCFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVMLNNIS-WRFIGFYGHSASHKRHLTWEL

Query:  LRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHCGQTNL--LGAITALLLHNFGSVLI-----------VFYAMILSTIF------
        LR++ ++ D PW++ GDFN I+   E     + +   ++ FR+ + HC   +L   G          GS L+            + A+   T+       
Subjt:  LRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHCGQTNL--LGAITALLLHNFGSVLI-----------VFYAMILSTIF------

Query:  ----------FQVLMSLISIGQNLTTELLQCIYLLLL-----IIVARGIGNHSGLRSIGHGTHSISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQ
                   Q L+S  S       +L +  +  L       ++     +     ++   +  I     AL     S+VR   R I  +KA + +   Q
Subjt:  ----------FQVLMSLISIGQNLTTELLQCIYLLLL-----IIVARGIGNHSGLRSIGHGTHSISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQ

Query:  P-HPLNFSIIHMLEDDLAGLLELEEIYWKQRSREEWLKWGDCKTR
        P    +   I++L  +L GLLE EEI W+Q+SR  WL+ GD  T+
Subjt:  P-HPLNFSIIHMLEDDLAGLLELEEIYWKQRSREEWLKWGDCKTR

A0A6J1DX30 uncharacterized protein LOC1110248743.1e-4651.3Show/hide
Query:  SWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHCGQTNL--LGAITALLLHNF-GSVLIVFYAM
        S RF GFYGH A+HKRHLTWELLRRI N+D SPW+IGGD N+IL +YEAS +SSYD++ I AFRN++D C  T++   G I     + F G  L      
Subjt:  SWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHCGQTNL--LGAITALLLHNF-GSVLIVFYAM

Query:  ILSTIFFQVLMSLISIGQNLTTELLQCIYLLLLIIVARGIGNHSGLRSIGHGTHSISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQPHPLNFSII
         L    F  +    S                         G+ S        + SI A+SSALR  GRSNV +LF+QI+ QKAAI +AYNQP PL+F+II
Subjt:  ILSTIFFQVLMSLISIGQNLTTELLQCIYLLLLIIVARGIGNHSGLRSIGHGTHSISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQPHPLNFSII

Query:  HMLEDDLAGLLELEEIYWKQRSREEWLKWG
        H LE+DLAGLLELEEI+WKQRSRE+WLKWG
Subjt:  HMLEDDLAGLLELEEIYWKQRSREEWLKWG

M5XQU7 Uncharacterized protein3.1e-2231.21Show/hide
Query:  CFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSV--MLNNISWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYD
        CF VD++GLSGGLCL WK  +++ IRS S HHID  V  + +++ WR  GFYG+ A+   HL+W LLR + +    PWV  GDFN +L + E        
Subjt:  CFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSV--MLNNISWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWVIGGDFNSILRSYEASQSSSYD

Query:  STHISAFRNLVDHCGQTNLLGAITALLLHNFGSVLIVFYAMILSTIFFQVLMSLISIGQNLTTELLQCIYLLLLIIVARGIGNHSGLRSIGHGTHSISAT
           + AFR+ +  C   ++           F      +++                I + L   L  C +  L     +   +H    S  H    + A+
Subjt:  STHISAFRNLVDHCGQTNLLGAITALLLHNFGSVLIVFYAMILSTIFFQVLMSLISIGQNLTTELLQCIYLLLLIIVARGIGNHSGLRSIGHGTHSISAT

Query:  SSALRSSGRSNVR--ELFRQIRTQKAAIANAYNQPHP----LNFSIIHMLEDDLAGLLELEEIYWKQRSREEWLKWGDCKTR
         +      RS  R   ++ Q    ++ IANA+N           S  H+L   L  LL  EE +WKQRS+  WLK GD  TR
Subjt:  SSALRSSGRSNVR--ELFRQIRTQKAAIANAYNQPHP----LNFSIIHMLEDDLAGLLELEEIYWKQRSREEWLKWGDCKTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTGCAGCATTTATGGTGGCTTTGAAATTGCAAATTATTTCTTCAATAGTTACGGAATCTCCAGATCGAATTATAAATGCAGATCCTAATGTGGCCAGTAATGT
CAACATTCCTTCTAAGGAATTCAATTTACCATTATGTTCTGACCCTCCACAATCATCAATTACCACAATGGATGCTAGCCCCATTTTTACTGGTAATTTACATCGTTGGA
AGCAAAAGGCAAGGGCTATTCACAACAGTAGTTCACCCTTCGGCCCAAGTCTTAATATTCCAGAAGCGCAACTTCAGCCTACAAATGTTGGACCATCTAAGAGGAAGGCC
TCCCCACATCAAAAAGCCTCTAAACTAAAGAAGCAGAAGTTACAACAGGTCATGTTTTCAACCACGTCAGAGGACAAGATTAATGAAGTGGCGAAGGGTTGTTTTACAGT
GGATAGTTTGGGCCTCAGCGGAGGTCTTTGTTTACTTTGGAAGGATTCTGTTGATATTACAATTAGATCCTTCTCCTATCACCATATTGACTGTTCAGTTATGTTGAATA
ATATCTCATGGCGCTTCATAGGTTTCTATGGACATTCAGCAAGCCATAAACGCCACCTCACTTGGGAGCTTCTTCGTAGAATTCATAATATGGATGATTCTCCCTGGGTG
ATTGGAGGGGATTTCAATTCTATCTTACGGAGCTATGAAGCCTCTCAATCTAGTTCATATGATTCCACACATATTTCCGCCTTCAGGAATTTAGTGGACCACTGTGGCCA
GACCAATTTACTTGGTGCAATAACCGCTTTGCTACTTCACAACTTTGGAAGCGTCTTGATCGTTTTTTATGCAATGATTCTTTCTACTATCTTTTTCCAAGTGCTCATGT
CACTCATCTCAATTGGTCAAAATCTGACCACTGAGCTATTGCAATGTATCTATTTACTCCTCCTTATAATTGTGGCTCGAGGAATAGGAAACCATTCCGGTTTGAGGAGT
ATTGGTCACGGAACCCATAGTATTAGTGCAACTTCTTCTGCTTTGCGTTCTTCGGGTCGGTCAAATGTCAGGGAACTTTTTAGACAAATTCGAACCCAAAAAGCAGCCAT
TGCAAATGCTTATAATCAGCCTCATCCGCTGAATTTTTCTATTATTCATATGTTGGAGGATGATCTAGCTGGTCTTCTTGAATTGGAGGAGATATACTGGAAGCAAAGAT
CTCGCGAAGAGTGGCTCAAATGGGGTGATTGTAAAACACGAAGTGGTTCCACAAAAAAGCTTCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTGCAGCATTTATGGTGGCTTTGAAATTGCAAATTATTTCTTCAATAGTTACGGAATCTCCAGATCGAATTATAAATGCAGATCCTAATGTGGCCAGTAATGT
CAACATTCCTTCTAAGGAATTCAATTTACCATTATGTTCTGACCCTCCACAATCATCAATTACCACAATGGATGCTAGCCCCATTTTTACTGGTAATTTACATCGTTGGA
AGCAAAAGGCAAGGGCTATTCACAACAGTAGTTCACCCTTCGGCCCAAGTCTTAATATTCCAGAAGCGCAACTTCAGCCTACAAATGTTGGACCATCTAAGAGGAAGGCC
TCCCCACATCAAAAAGCCTCTAAACTAAAGAAGCAGAAGTTACAACAGGTCATGTTTTCAACCACGTCAGAGGACAAGATTAATGAAGTGGCGAAGGGTTGTTTTACAGT
GGATAGTTTGGGCCTCAGCGGAGGTCTTTGTTTACTTTGGAAGGATTCTGTTGATATTACAATTAGATCCTTCTCCTATCACCATATTGACTGTTCAGTTATGTTGAATA
ATATCTCATGGCGCTTCATAGGTTTCTATGGACATTCAGCAAGCCATAAACGCCACCTCACTTGGGAGCTTCTTCGTAGAATTCATAATATGGATGATTCTCCCTGGGTG
ATTGGAGGGGATTTCAATTCTATCTTACGGAGCTATGAAGCCTCTCAATCTAGTTCATATGATTCCACACATATTTCCGCCTTCAGGAATTTAGTGGACCACTGTGGCCA
GACCAATTTACTTGGTGCAATAACCGCTTTGCTACTTCACAACTTTGGAAGCGTCTTGATCGTTTTTTATGCAATGATTCTTTCTACTATCTTTTTCCAAGTGCTCATGT
CACTCATCTCAATTGGTCAAAATCTGACCACTGAGCTATTGCAATGTATCTATTTACTCCTCCTTATAATTGTGGCTCGAGGAATAGGAAACCATTCCGGTTTGAGGAGT
ATTGGTCACGGAACCCATAGTATTAGTGCAACTTCTTCTGCTTTGCGTTCTTCGGGTCGGTCAAATGTCAGGGAACTTTTTAGACAAATTCGAACCCAAAAAGCAGCCAT
TGCAAATGCTTATAATCAGCCTCATCCGCTGAATTTTTCTATTATTCATATGTTGGAGGATGATCTAGCTGGTCTTCTTGAATTGGAGGAGATATACTGGAAGCAAAGAT
CTCGCGAAGAGTGGCTCAAATGGGGTGATTGTAAAACACGAAGTGGTTCCACAAAAAAGCTTCCATAA
Protein sequenceShow/hide protein sequence
MNSAAFMVALKLQIISSIVTESPDRIINADPNVASNVNIPSKEFNLPLCSDPPQSSITTMDASPIFTGNLHRWKQKARAIHNSSSPFGPSLNIPEAQLQPTNVGPSKRKA
SPHQKASKLKKQKLQQVMFSTTSEDKINEVAKGCFTVDSLGLSGGLCLLWKDSVDITIRSFSYHHIDCSVMLNNISWRFIGFYGHSASHKRHLTWELLRRIHNMDDSPWV
IGGDFNSILRSYEASQSSSYDSTHISAFRNLVDHCGQTNLLGAITALLLHNFGSVLIVFYAMILSTIFFQVLMSLISIGQNLTTELLQCIYLLLLIIVARGIGNHSGLRS
IGHGTHSISATSSALRSSGRSNVRELFRQIRTQKAAIANAYNQPHPLNFSIIHMLEDDLAGLLELEEIYWKQRSREEWLKWGDCKTRSGSTKKLP