; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016016 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016016
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionzf-RVT domain-containing protein
Genome locationscaffold114:322472..323170
RNA-Seq ExpressionMS016016
SyntenyMS016016
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]2.7e-3035.96Show/hide
Query:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ
        R L+DEE   F  LL  +S+  +    D R WS+E+   FS  SL           K   S++  S SP+R+N+L WI +   V ++E+LQKKSP   + 
Subjt:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ

Query:  PSICFLCAN-RENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFNII
        PSIC LC    +NL H+F +CP +S  W   F +FNL W +D+  + ++ Q+L    L     ++W I+    L + + ERNQRIF D +R   +  +  
Subjt:  PSICFLCAN-RENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFNII

Query:  KFKASQCCALSDSFFPYSPNLICMNWEA
           A+  C+L   F  YS   IC+NW A
Subjt:  KFKASQCCALSDSFFPYSPNLICMNWEA

RVX22527.1 hypothetical protein CK203_012735 [Vitis vinifera]2.4e-2333.48Show/hide
Query:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSS----LCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPY
        R L D E      L+S +SSV  S  SD+R WSL +S LFSV S    L K+    L F       LWSS  P +V +L W+  +GKVNT + L+ + PY
Subjt:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSS----LCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPY

Query:  TALQPSICFLC-ANRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDC
         AL P  C LC  N E+++H+F  CP     W   F +  L W+        +      +    +   LW I  L  ++  + ERN RIF+D  R+    
Subjt:  TALQPSICFLC-ANRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDC

Query:  FNIIKFKASQCCALSDSFFPYSPNLICMNW
        +++I F +S   + +++F     N++ +NW
Subjt:  FNIIKFKASQCCALSDSFFPYSPNLICMNW

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]8.3e-3236.09Show/hide
Query:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ
        R L+DEE   F  LL  +S+  +    D R WS+E+   FS  SL           K   S++  S SP+R+N+L WI +   VN++E+LQKKSP   + 
Subjt:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ

Query:  PSICFLCAN-RENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFNII
        PSIC LC    +NL H+F +CP +S  W   F +FNL W +D+  + ++ Q+L    L     ++W I+    L + + ERNQRIF D +R   +  +  
Subjt:  PSICFLCAN-RENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFNII

Query:  KFKASQCCALSDSFFPYSPNLICMNWEAFI
           A+  C+L   F  YS   IC+NW  F+
Subjt:  KFKASQCCALSDSFFPYSPNLICMNWEAFI

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]1.5e-3342.86Show/hide
Query:  GADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQPSICFLCA-NRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNI
        G++    K   ++LW + SP+RVN+  WI   GK+NTA+++QKKSP  AL PS C LC  + E+ +H+FF C +AS CW   F  FN+ W +D  A  N+
Subjt:  GADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQPSICFLCA-NRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNI

Query:  FQILH-RVRLEPKANLLWIMVLRRLF-QSYFERNQRIFKDSSRSSVDCFNIIKFKASQCCALSDSFFPYSPNLICMNWEAFI
        +Q+LH    L      LW+ V++ L  + +FERN R+F++  R   + F   KFKAS  C+L DSF  +SP++I  NW AFI
Subjt:  FQILH-RVRLEPKANLLWIMVLRRLF-QSYFERNQRIFKDSSRSSVDCFNIIKFKASQCCALSDSFFPYSPNLICMNWEAFI

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]1.0e-3439.83Show/hide
Query:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ
        R LKDEE   F +LL  I S + S   D R WS+ N+  ++V SL           K   S++W + SP+RVN+L WI L G++N AEVLQKK P  +L 
Subjt:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ

Query:  PSICFLCANRENLN-HVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLWIMVLRRLFQS-YFERNQRIFKDSSRSSVDCFNII
        P++C  C +   ++ H+FF CPY+S CW      FNL          N+FQ+L R        LLW   ++ L    +FERNQRIF + + S  D     
Subjt:  PSICFLCANRENLN-HVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLWIMVLRRLFQS-YFERNQRIFKDSSRSSVDCFNII

Query:  KFKASQCCALSDSFFPYSPNLICMNWEAFIT
        + +AS  C LSD F  YS +   +NWEAFI+
Subjt:  KFKASQCCALSDSFFPYSPNLICMNWEAFIT

TrEMBL top hitse value%identityAlignment
A0A438KG54 Protein RETICULATA, chloroplastic3.4e-2332.89Show/hide
Query:  RGLKDEEFVGFSELLSCISSVTLS-EESDARRWSLENSKLFSVSSLCKLGGADLRFSK-AHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTA
        R L D E      L+S + SV LS   SD+R WSL +S  FSV S       DL   K      LWSS  P +V  L W+  HGKVNT + LQ + PY A
Subjt:  RGLKDEEFVGFSELLSCISSVTLS-EESDARRWSLENSKLFSVSSLCKLGGADLRFSK-AHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTA

Query:  LQPSICFLC-ANRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFN
        L P  C LC  N E+++H+F  CP  +  W   F +  ++W+        +      +    +  +LW I  L  ++  + ERN RIF+D  R+    ++
Subjt:  LQPSICFLC-ANRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFN

Query:  IIKFKASQCCALSDSFFPYSPNLICMNW
        +I+F +S   + +++F     +++ +NW
Subjt:  IIKFKASQCCALSDSFFPYSPNLICMNW

A0A438KMW4 zf-RVT domain-containing protein1.2e-2333.48Show/hide
Query:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSS----LCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPY
        R L D E      L+S +SSV  S  SD+R WSL +S LFSV S    L K+    L F       LWSS  P +V +L W+  +GKVNT + L+ + PY
Subjt:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSS----LCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPY

Query:  TALQPSICFLC-ANRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDC
         AL P  C LC  N E+++H+F  CP     W   F +  L W+        +      +    +   LW I  L  ++  + ERN RIF+D  R+    
Subjt:  TALQPSICFLC-ANRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDC

Query:  FNIIKFKASQCCALSDSFFPYSPNLICMNW
        +++I F +S   + +++F     N++ +NW
Subjt:  FNIIKFKASQCCALSDSFFPYSPNLICMNW

A0A5A7T2Y0 zf-RVT domain-containing protein1.3e-3035.96Show/hide
Query:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ
        R L+DEE   F  LL  +S+  +    D R WS+E+   FS  SL           K   S++  S SP+R+N+L WI +   V ++E+LQKKSP   + 
Subjt:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ

Query:  PSICFLCAN-RENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFNII
        PSIC LC    +NL H+F +CP +S  W   F +FNL W +D+  + ++ Q+L    L     ++W I+    L + + ERNQRIF D +R   +  +  
Subjt:  PSICFLCAN-RENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFNII

Query:  KFKASQCCALSDSFFPYSPNLICMNWEA
           A+  C+L   F  YS   IC+NW A
Subjt:  KFKASQCCALSDSFFPYSPNLICMNWEA

A0A5D3DE60 zf-RVT domain-containing protein4.0e-3236.09Show/hide
Query:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ
        R L+DEE   F  LL  +S+  +    D R WS+E+   FS  SL           K   S++  S SP+R+N+L WI +   VN++E+LQKKSP   + 
Subjt:  RGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQ

Query:  PSICFLCAN-RENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFNII
        PSIC LC    +NL H+F +CP +S  W   F +FNL W +D+  + ++ Q+L    L     ++W I+    L + + ERNQRIF D +R   +  +  
Subjt:  PSICFLCAN-RENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLW-IMVLRRLFQSYFERNQRIFKDSSRSSVDCFNII

Query:  KFKASQCCALSDSFFPYSPNLICMNWEAFI
           A+  C+L   F  YS   IC+NW  F+
Subjt:  KFKASQCCALSDSFFPYSPNLICMNWEAFI

A0A6J1DIE2 uncharacterized protein LOC1110207657.3e-3442.86Show/hide
Query:  GADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQPSICFLCA-NRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNI
        G++    K   ++LW + SP+RVN+  WI   GK+NTA+++QKKSP  AL PS C LC  + E+ +H+FF C +AS CW   F  FN+ W +D  A  N+
Subjt:  GADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQPSICFLCA-NRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNI

Query:  FQILH-RVRLEPKANLLWIMVLRRLF-QSYFERNQRIFKDSSRSSVDCFNIIKFKASQCCALSDSFFPYSPNLICMNWEAFI
        +Q+LH    L      LW+ V++ L  + +FERN R+F++  R   + F   KFKAS  C+L DSF  +SP++I  NW AFI
Subjt:  FQILH-RVRLEPKANLLWIMVLRRLF-QSYFERNQRIFKDSSRSSVDCFNIIKFKASQCCALSDSFFPYSPNLICMNWEAFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATAAGGGGGTTGAAAGATGAAGAGTTTGTAGGGTTTTCGGAGTTACTATCATGTATTTCTTCGGTTACTCTTTCTGAAGAAAGTGATGCTCGTCGATGGAGCTTG
GAAAATTCGAAACTTTTCTCAGTTTCCTCTCTTTGTAAGCTCGGAGGAGCTGATTTGCGATTTTCAAAAGCTCATTTATCTTCTCTTTGGAGTTCGGCCAGCCCA
AAACGTGTAAACTTGTTAACTTGGATTCCCCTTCACGGCAAGGTTAATACAGCTGAGGTTCTACAAAAGAAGTCCCCTTATACGGCTTTACAGCCATCCATTTGT
TTTTTATGTGCTAACAGGGAGAACTTGAATCATGTGTTTTTCTTCTGTCCATATGCTTCTACATGTTGGTTTTCCTTCTTCCAGATGTTCAATTTGGCTTGGATT
TGGGATGCTGTGGCTGCTAGAAACATTTTTCAGATTCTCCATAGAGTTAGATTGGAGCCTAAAGCTAATCTACTATGGATAATGGTTTTAAGGCGATTATTTCAG
AGTTATTTTGAAAGGAACCAAAGGATTTTCAAAGATTCCAGCCGTTCGTCAGTTGACTGCTTCAATATTATCAAATTTAAAGCTTCTCAATGCTGTGCTCTTTCC
GATTCTTTCTTCCCCTATTCTCCTAATTTGATTTGTATGAATTGGGAGGCTTTTATAACTCCATTG
mRNA sequenceShow/hide mRNA sequence
ATAAGGGGGTTGAAAGATGAAGAGTTTGTAGGGTTTTCGGAGTTACTATCATGTATTTCTTCGGTTACTCTTTCTGAAGAAAGTGATGCTCGTCGATGGAGCTTG
GAAAATTCGAAACTTTTCTCAGTTTCCTCTCTTTGTAAGCTCGGAGGAGCTGATTTGCGATTTTCAAAAGCTCATTTATCTTCTCTTTGGAGTTCGGCCAGCCCA
AAACGTGTAAACTTGTTAACTTGGATTCCCCTTCACGGCAAGGTTAATACAGCTGAGGTTCTACAAAAGAAGTCCCCTTATACGGCTTTACAGCCATCCATTTGT
TTTTTATGTGCTAACAGGGAGAACTTGAATCATGTGTTTTTCTTCTGTCCATATGCTTCTACATGTTGGTTTTCCTTCTTCCAGATGTTCAATTTGGCTTGGATT
TGGGATGCTGTGGCTGCTAGAAACATTTTTCAGATTCTCCATAGAGTTAGATTGGAGCCTAAAGCTAATCTACTATGGATAATGGTTTTAAGGCGATTATTTCAG
AGTTATTTTGAAAGGAACCAAAGGATTTTCAAAGATTCCAGCCGTTCGTCAGTTGACTGCTTCAATATTATCAAATTTAAAGCTTCTCAATGCTGTGCTCTTTCC
GATTCTTTCTTCCCCTATTCTCCTAATTTGATTTGTATGAATTGGGAGGCTTTTATAACTCCATTG
Protein sequenceShow/hide protein sequence
IRGLKDEEFVGFSELLSCISSVTLSEESDARRWSLENSKLFSVSSLCKLGGADLRFSKAHLSSLWSSASPKRVNLLTWIPLHGKVNTAEVLQKKSPYTALQPSIC
FLCANRENLNHVFFFCPYASTCWFSFFQMFNLAWIWDAVAARNIFQILHRVRLEPKANLLWIMVLRRLFQSYFERNQRIFKDSSRSSVDCFNIIKFKASQCCALS
DSFFPYSPNLICMNWEAFITPL