; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008092 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008092
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionzf-RVT domain-containing protein
Genome locationscaffold703:185096..185649
RNA-Seq ExpressionMS008092
SyntenyMS008092
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.7e-2032.93Show/hide
Query:  IWRMEPSGLFSTSSLLCDMMN---GPKNTEAPLLYKSIWKDLYPKKVKFVLWE-VSLKANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS
        +W++  + +F T+S+  D+      P N   P LYK++WK  +PKK KF +W  +    NT + LQ+R+P  ++SP+WC +C  S E   H+F+ C YS 
Subjt:  IWRMEPSGLFSTSSLLCDMMN---GPKNTEAPLLYKSIWKDLYPKKVKFVLWE-VSLKANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS

Query:  RVWSKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNL-WLHFVRALFWSIWKESNHRTFQDK
        ++WSK  +L  W++    N    +A  +       +K L   + +  L W IW E N+R F+ +
Subjt:  RVWSKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNL-WLHFVRALFWSIWKESNHRTFQDK

VVA32248.1 PREDICTED: ribonuclease H [Prunus dulcis]2.0e-2032.45Show/hide
Query:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS
        D+  W+++ SGLFS  S  C      ++ E    Y  IWK   P+KVK +LW+V+  + NT + LQR  PFM +SPHWC LCK   E+  H+F+ C Y+ 
Subjt:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS

Query:  RVWSKILS--LFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQD--KAQSIHFFESLGFLVVTWCKLS
        +VW  +L   +  W T               G      K LW   ++A+ W++W E N R F D    ++   ++ + F    W  ++
Subjt:  RVWSKILS--LFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQD--KAQSIHFFESLGFLVVTWCKLS

VVA39726.1 Hypothetical predicted protein, partial [Prunus dulcis]4.4e-2034.55Show/hide
Query:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS
        D+ IW+++PSGLF+  SL   + N  +    P  Y  IWK   P KVK  +W+  L K NT + LQRR P++ ISPHWC LC  + +S  H+ + C +S 
Subjt:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS

Query:  RVWSKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKN---LWLHFVRALFWSIWKESNHRTFQD
        ++W  +L     +T +V        +++        K    LW   ++A+ W++W E N R F+D
Subjt:  RVWSKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKN---LWLHFVRALFWSIWKESNHRTFQD

XP_030479135.1 uncharacterized protein LOC115696374 [Cannabis sativa]2.5e-2335.96Show/hide
Query:  IWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSSRVW
        IW+ + +G+FS+ S      N  +N   P   KS+WK +   +VK  +W V+  K N H+ LQRR PF+ ISP WCV CK + E   H+F+ C +SS++W
Subjt:  IWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSSRVW

Query:  SKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQS-IHFFESLGFLVVTW
        S +L  FG   A   +    +   + G     +  LW   V A  W+IW E N R F+D   S I  ++ + F   TW
Subjt:  SKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQS-IHFFESLGFLVVTW

XP_030505044.1 uncharacterized protein LOC115720016 [Cannabis sativa]3.6e-2237.2Show/hide
Query:  IWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSSRVW
        IW+ + +G+FS+ S      N  +N   P   KS+WK     +VK  +W V+  K N H+ LQRR PF+ ISP WCV CK S E   H+F+ C +SS++W
Subjt:  IWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSSRVW

Query:  SKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQSI
        S +L  FG   A   +    +   + G     +  LW   V A  W+IW E N R F+D   S+
Subjt:  SKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQSI

TrEMBL top hitse value%identityAlignment
A0A5E4FY07 PREDICTED: ribonuclease H9.5e-2132.45Show/hide
Query:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS
        D+  W+++ SGLFS  S  C      ++ E    Y  IWK   P+KVK +LW+V+  + NT + LQR  PFM +SPHWC LCK   E+  H+F+ C Y+ 
Subjt:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS

Query:  RVWSKILS--LFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQD--KAQSIHFFESLGFLVVTWCKLS
        +VW  +L   +  W T               G      K LW   ++A+ W++W E N R F D    ++   ++ + F    W  ++
Subjt:  RVWSKILS--LFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQD--KAQSIHFFESLGFLVVTWCKLS

A0A5E4GJ11 Reverse transcriptase domain-containing protein (Fragment)2.1e-2034.55Show/hide
Query:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS
        D+ IW+++PSGLF+  SL   + N  +    P  Y  IWK   P KVK  +W+  L K NT + LQRR P++ ISPHWC LC  + +S  H+ + C +S 
Subjt:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS

Query:  RVWSKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKN---LWLHFVRALFWSIWKESNHRTFQD
        ++W  +L     +T +V        +++        K    LW   ++A+ W++W E N R F+D
Subjt:  RVWSKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKN---LWLHFVRALFWSIWKESNHRTFQD

A0A803P465 Uncharacterized protein9.9e-2638.55Show/hide
Query:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS
        D  IW+ +PSG+FS  S    M++ P         KS+WK   P KVK   W ++L K N H+ +Q+R PF+ ISP WCV CK S ES GH+F+ C +  
Subjt:  DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSS

Query:  RVWSKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQS
        R+W  +L  FG S     + SH +A  L G     +  LW   + A  W++W E N R F+   +S
Subjt:  RVWSKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQS

A0A803PZR8 Uncharacterized protein1.7e-2237.2Show/hide
Query:  IWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSSRVW
        IW+ + +G+FS+ S      N  +N   P   KS+WK     +VK  +W V+  K N H+ LQRR PF+ ISP WCV CK S E   H+F+ C +SS++W
Subjt:  IWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSSRVW

Query:  SKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQSI
        S +L  FG   A   +    +   + G     +  LW   V A  W+IW E N R F+D   S+
Subjt:  SKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQSI

A0A803QGT5 Uncharacterized protein1.2e-2335.96Show/hide
Query:  IWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSSRVW
        IW+ + +G+FS+ S      N  +N   P   KS+WK +   +VK  +W V+  K N H+ LQRR PF+ ISP WCV CK + E   H+F+ C +SS++W
Subjt:  IWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSL-KANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSSRVW

Query:  SKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQS-IHFFESLGFLVVTW
        S +L  FG   A   +    +   + G     +  LW   V A  W+IW E N R F+D   S I  ++ + F   TW
Subjt:  SKILSLFGWSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQS-IHFFESLGFLVVTW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GACAGATTCATTTGGAGAATGGAACCATCTGGCCTCTTCTCAACCAGCTCCCTTTTATGTGACATGATGAATGGCCCCAAGAATACAGAAGCCCCACTGCTTTATAAATC
AATATGGAAAGATCTTTATCCAAAGAAGGTTAAATTTGTTTTATGGGAGGTGAGTCTCAAAGCTAACACACATGAGAATCTCCAAAGGAGGATGCCGTTCATGAGCATCT
CCCCTCATTGGTGCGTCCTTTGCAAGCATAGCAATGAATCCCAAGGCCACATATTTGTCTCTTGCAACTATTCCTCAAGGGTGTGGAGTAAAATTCTTTCACTTTTTGGA
TGGTCCACTGCCTTTGTCTCGAATACAAGCCACCCCATGGCATACACTCTTACAGGCCATCCTTTTGACCATGAGAAAAATCTCTGGTTGCATTTTGTTCGTGCACTATT
TTGGTCTATATGGAAGGAAAGTAATCATAGAACTTTCCAAGACAAGGCCCAATCGATTCACTTTTTTGAGTCTTTAGGTTTCTTGGTCGTTACATGGTGTAAACTTTCC
mRNA sequenceShow/hide mRNA sequence
GACAGATTCATTTGGAGAATGGAACCATCTGGCCTCTTCTCAACCAGCTCCCTTTTATGTGACATGATGAATGGCCCCAAGAATACAGAAGCCCCACTGCTTTATAAATC
AATATGGAAAGATCTTTATCCAAAGAAGGTTAAATTTGTTTTATGGGAGGTGAGTCTCAAAGCTAACACACATGAGAATCTCCAAAGGAGGATGCCGTTCATGAGCATCT
CCCCTCATTGGTGCGTCCTTTGCAAGCATAGCAATGAATCCCAAGGCCACATATTTGTCTCTTGCAACTATTCCTCAAGGGTGTGGAGTAAAATTCTTTCACTTTTTGGA
TGGTCCACTGCCTTTGTCTCGAATACAAGCCACCCCATGGCATACACTCTTACAGGCCATCCTTTTGACCATGAGAAAAATCTCTGGTTGCATTTTGTTCGTGCACTATT
TTGGTCTATATGGAAGGAAAGTAATCATAGAACTTTCCAAGACAAGGCCCAATCGATTCACTTTTTTGAGTCTTTAGGTTTCTTGGTCGTTACATGGTGTAAACTTTCC
Protein sequenceShow/hide protein sequence
DRFIWRMEPSGLFSTSSLLCDMMNGPKNTEAPLLYKSIWKDLYPKKVKFVLWEVSLKANTHENLQRRMPFMSISPHWCVLCKHSNESQGHIFVSCNYSSRVWSKILSLFG
WSTAFVSNTSHPMAYTLTGHPFDHEKNLWLHFVRALFWSIWKESNHRTFQDKAQSIHFFESLGFLVVTWCKLS