; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0046 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0046
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionzf-RVT domain-containing protein
Genome locationMC01:3634420..3634935
RNA-Seq ExpressionMC01g0046
SyntenyMC01g0046
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0009987 - cellular process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]2.07e-1734.68Show/hide
Query:  GSFSVKSYSWFLDSALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGL
        G FS KS S  L +A  + K+L  ++ +  SP+ I++L WI++F  + ++ ILQKK P ++  PS+C LC   S+   H+ L+C  ++  WER+F LF L
Subjt:  GSFSVKSYSWFLDSALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGL

Query:  SWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGL--RETKSFLNR-KTWFEAFDLAKYKKSLWCSI
         W   +S + ++ QLL G  LP   R+IW    K LL +    R  + F ++ +   E    A    + WCS+
Subjt:  SWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGL--RETKSFLNR-KTWFEAFDLAKYKKSLWCSI

KAA0062564.1 GPI-anchor transamidase isoform X1 [Cucumis melo var. makuwa]8.87e-1435.25Show/hide
Query:  ILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFLNRK
        ++Q+++  S L PS C+LC    E  +  L  C ++   WE L  LFG+ W    S   N+KQ+L G +L    RLIW N  K LL+D   E    + R 
Subjt:  ILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFLNRK

Query:  T---WFEAFDLAKYKKSLWCSI
            W E  D+AK   + WC +
Subjt:  T---WFEAFDLAKYKKSLWCSI

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]8.18e-1935.26Show/hide
Query:  GSFSVKSYSWFLDSALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGL
        G FS KS S  L +A  + K+L  ++ +  SP+ I++L WI++F  +N++ ILQKK P ++  PS+C LC   S+   H+ L+C  ++  WER+F LF L
Subjt:  GSFSVKSYSWFLDSALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGL

Query:  SWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGL--RETKSFLNR-KTWFEAFDLAKYKKSLWCSI
         W   +S + ++ QLL G  LP   R+IW    K LL +    R  + F ++ +   E    A    + WCS+
Subjt:  SWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGL--RETKSFLNR-KTWFEAFDLAKYKKSLWCSI

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]1.75e-2740.37Show/hide
Query:  SALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSWVLSNSATKNLK
        S   +PK+   +LWK  SP+ ++V  WI+  G LNT +I+QKK P   L PS C LC+ + E   H+   C FA+  W  LF  F + W     A  N+ 
Subjt:  SALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSWVLSNSATKNLK

Query:  QLLFGPA-LPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCSI
        QLL GP  L    R +W N VK LL++   E  S L    R+ + E+F  AK+K SLWCS+
Subjt:  QLLFGPA-LPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCSI

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]2.43e-2537.28Show/hide
Query:  FSVKSYSWFLDSALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSW
        ++VKS    L     L K +  ++WK  SP+ +++L WI+LFG LN   +LQKK P   L P+VC  C  +SE  +H+   C +++  W +L C F L  
Subjt:  FSVKSYSWFLDSALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSW

Query:  VLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGL--RETKSFLNRKTWFE-AFDLAKYKKSLWC
         L N    N+ QLL  P      RL+W NAVK LLAD    R  + F N+ T  +   + A+ + S WC
Subjt:  VLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGL--RETKSFLNRKTWFE-AFDLAKYKKSLWC

TrEMBL top hitse value%identityAlignment
A0A438FM03 Putative ribonuclease H protein3.81e-1331.82Show/hide
Query:  GSFSVKSYSWFL----DSALKLPKKLHLSLWKLDSP-KISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFC
        GSFSVKS+ + L    +  + LP K    LW    P K+  L+W+V  G +NT + LQ + P   L P  C LC  N E   H+ L C      W RLF 
Subjt:  GSFSVKSYSWFL----DSALKLPKKLHLSLWKLDSP-KISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFC

Query:  LFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCS
        L G+ WV   S    L     G     + +++W  A   L+    +E  + +     +T    +DL ++  SLW S
Subjt:  LFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCS

A0A438JRY4 Putative ribonuclease H protein1.96e-1331.82Show/hide
Query:  GSFSVKSYSWFL----DSALKLPKKLHLSLWKLDSP-KISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFC
        GSFSVKS+ + L    +  + LP K    LW    P K+ VL+W+V  G +NT + LQ + P   L P  C LC  N E   H+ L C      W +LF 
Subjt:  GSFSVKSYSWFL----DSALKLPKKLHLSLWKLDSP-KISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFC

Query:  LFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCS
        L G+ WV   S    L     G     + +++W  A   L+    +E  + +     +T    +DL ++  SLW S
Subjt:  LFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCS

A0A438KG54 Protein RETICULATA, chloroplastic1.16e-1332.95Show/hide
Query:  GSFSVKSYSWFLDSALK----LPKKLHLSLWKLDSP-KISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFC
        GSFSVKS+ + L   L     LP K    LW    P K+  L+W+V  G +NT + LQ + P   L P  C LC  N E   H+ L C      W RLF 
Subjt:  GSFSVKSYSWFLDSALK----LPKKLHLSLWKLDSP-KISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFC

Query:  LFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCS
        L G+SWV   S    L     G     + +++W  A   L+    +E  + +     +T    +DL ++  SLW S
Subjt:  LFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCS

A0A5A7V5N8 GPI-anchor transamidase isoform X14.29e-1435.25Show/hide
Query:  ILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFLNRK
        ++Q+++  S L PS C+LC    E  +  L  C ++   WE L  LFG+ W    S   N+KQ+L G +L    RLIW N  K LL+D   E    + R 
Subjt:  ILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSWVLSNSATKNLKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFLNRK

Query:  T---WFEAFDLAKYKKSLWCSI
            W E  D+AK   + WC +
Subjt:  T---WFEAFDLAKYKKSLWCSI

A0A6J1DIE2 uncharacterized protein LOC1110207658.49e-2840.37Show/hide
Query:  SALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSWVLSNSATKNLK
        S   +PK+   +LWK  SP+ ++V  WI+  G LNT +I+QKK P   L PS C LC+ + E   H+   C FA+  W  LF  F + W     A  N+ 
Subjt:  SALKLPKKLHLSLWKLDSPK-ISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSWVLSNSATKNLK

Query:  QLLFGPA-LPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCSI
        QLL GP  L    R +W N VK LL++   E  S L    R+ + E+F  AK+K SLWCS+
Subjt:  QLLFGPA-LPPKARLIWSNAVKPLLADGLRETKSFL---NRKTWFEAFDLAKYKKSLWCSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGATCTTTCTCCGTCAAGTCCTACTCATGGTTTTTGGATTCTGCACTTAAATTGCCAAAAAAGCTCCATTTGTCTCTTTGGAAATTAGACAGTCCAAAAATCAGTGTTCT
ATCTTGGATAGTTCTCTTTGGCAATCTAAATACTACAAATATTCTTCAAAAGAAGATGCCGCCTTCACTCCTACAGCCCTCAGTTTGTACTCTTTGTTCAGCAAACAGTG
AATGTCAAATACATGTGCTATTATTTTGTCAATTTGCAGCAAGTTTTTGGGAGAGACTCTTCTGTCTCTTCGGCCTCAGCTGGGTTCTCTCGAATTCAGCAACAAAAAAT
TTGAAACAGCTCCTTTTTGGTCCGGCTCTACCCCCAAAAGCTCGTTTGATTTGGTCTAATGCAGTTAAACCATTGCTTGCCGATGGTTTGAGAGAAACCAAAAGCTTTTT
GAATAGAAAAACATGGTTCGAAGCTTTTGATTTAGCTAAGTATAAGAAATCCCTTTGGTGCTCGATC
mRNA sequenceShow/hide mRNA sequence
GGATCTTTCTCCGTCAAGTCCTACTCATGGTTTTTGGATTCTGCACTTAAATTGCCAAAAAAGCTCCATTTGTCTCTTTGGAAATTAGACAGTCCAAAAATCAGTGTTCT
ATCTTGGATAGTTCTCTTTGGCAATCTAAATACTACAAATATTCTTCAAAAGAAGATGCCGCCTTCACTCCTACAGCCCTCAGTTTGTACTCTTTGTTCAGCAAACAGTG
AATGTCAAATACATGTGCTATTATTTTGTCAATTTGCAGCAAGTTTTTGGGAGAGACTCTTCTGTCTCTTCGGCCTCAGCTGGGTTCTCTCGAATTCAGCAACAAAAAAT
TTGAAACAGCTCCTTTTTGGTCCGGCTCTACCCCCAAAAGCTCGTTTGATTTGGTCTAATGCAGTTAAACCATTGCTTGCCGATGGTTTGAGAGAAACCAAAAGCTTTTT
GAATAGAAAAACATGGTTCGAAGCTTTTGATTTAGCTAAGTATAAGAAATCCCTTTGGTGCTCGATC
Protein sequenceShow/hide protein sequence
GSFSVKSYSWFLDSALKLPKKLHLSLWKLDSPKISVLSWIVLFGNLNTTNILQKKMPPSLLQPSVCTLCSANSECQIHVLLFCQFAASFWERLFCLFGLSWVLSNSATKN
LKQLLFGPALPPKARLIWSNAVKPLLADGLRETKSFLNRKTWFEAFDLAKYKKSLWCSI