; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g16550 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g16550
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:13022067..13022669
RNA-Seq ExpressionMoc06g16550
SyntenyMoc06g16550
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB38291.1 hypothetical protein L484_013924 [Morus notabilis]5.7e-2954.9Show/hide
Query:  DLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRD
        +L  R++E+P+F+GENPD W  R  RYF +N++T+ EKL+ AV+ L+GEALAW QWE+ ++PI++W   +L+LL+RFRP  EG+LC++F++++QETTVRD
Subjt:  DLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRD

Query:  YR
        YR
Subjt:  YR

TXG60193.1 hypothetical protein EZV62_014766 [Acer yangbiense]1.3e-2835.29Show/hide
Query:  MVATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEK----TRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAP
        M    M+SR++ +E+ +  V       RE  ++E+   KDE+          + M+E+         GKG E               + GS   +N    
Subjt:  MVATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEK----TRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAP

Query:  VFDLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTV
          D R RKLE+P+F+G NPD W+ +  RYF + R  ++EKLEA+V+  +G+AL W+QWE +K P+  WEE +LL+L++FR T EG+L ++F+A++Q+ TV
Subjt:  VFDLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTV

Query:  RDYR
        ++YR
Subjt:  RDYR

XP_022897442.1 uncharacterized protein LOC111411108 [Olea europaea var. sylvestris]2.9e-3339.9Show/hide
Query:  MESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEKTRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPV---FDLRL
        +  RV  +EQ++      ++    S + E  L    M      + ++   +EK R +KG   EK          +  S   SNQ     A      + R+
Subjt:  MESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEKTRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPV---FDLRL

Query:  RKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR
        R+LE+P+FEG +PD W+ RV RYF +NRL+++EKLEAA +C DGEALAW QWEER+ P++ WE+ +  LL+RFRP+ EG+LC +F++++Q TTVR+YR
Subjt:  RKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR

XP_024017591.1 uncharacterized protein LOC112090471 [Morus notabilis]1.8e-2739.15Show/hide
Query:  EQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGME--EKTRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPVFDLRLRKLEVPIFE
        E+ +  V S  ED  +  + E  L  + + + M  +S++ G E  E    D G   E+     +T    A   +G +          + R R++E+P+F+
Subjt:  EQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGME--EKTRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPVFDLRLRKLEVPIFE

Query:  GENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR
        GENPD W+ R  RYF +NRLTD EKL+ AV+ L+GEALAW QWE+R+  ++ W E +  +L+RF  T EGTLC++F+++ QETTVR+YR
Subjt:  GENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR

XP_038904464.1 uncharacterized protein LOC120090832 [Benincasa hispida]1.3e-2836.77Show/hide
Query:  RMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWM-----TAVSQRM-------------------GMEEKTRDDKGKGLEKIDVGEKTPQSI
        +ME+R+ AVE+Q++ +   +E+  +   +EM    + M +         + Q+M                   G  EKT  D+GK   + DVGE+     
Subjt:  RMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWM-----TAVSQRM-------------------GMEEKTRDDKGKGLEKIDVGEKTPQSI

Query:  ATSSDGSNQLTNALAPVFDLRLRKLEVPIFE---GENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRP
                        +FD+RLRKLE+PIF+   GE+P  W HRV RYF +NRL++ +K+EAA+LCL+GEAL WHQWEE + P+ TW +F+  LL RF P
Subjt:  ATSSDGSNQLTNALAPVFDLRLRKLEVPIFE---GENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRP

Query:  TLEGTLCDRFMAVKQETTVRDYR
          E     +F+ +KQ+ +VR YR
Subjt:  TLEGTLCDRFMAVKQETTVRDYR

TrEMBL top hitse value%identityAlignment
A0A5C7HSW3 Chromo domain-containing protein6.1e-2935.29Show/hide
Query:  MVATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEK----TRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAP
        M    M+SR++ +E+ +  V       RE  ++E+   KDE+          + M+E+         GKG E               + GS   +N    
Subjt:  MVATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEK----TRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAP

Query:  VFDLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTV
          D R RKLE+P+F+G NPD W+ +  RYF + R  ++EKLEA+V+  +G+AL W+QWE +K P+  WEE +LL+L++FR T EG+L ++F+A++Q+ TV
Subjt:  VFDLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTV

Query:  RDYR
        ++YR
Subjt:  RDYR

A0A5C7IJS7 Uncharacterized protein7.5e-2736.36Show/hide
Query:  ESWKKEMALFKDEMRLWMTAVSQRMGMEEK----TRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPVFDLRLRKLEVPIFEGENPDAWLHRVAR
        E  ++E+   KDE+      +   + M+E+         GKG E               + GS   +N      D R RKLE+P+F+G NPD W+ +   
Subjt:  ESWKKEMALFKDEMRLWMTAVSQRMGMEEK----TRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPVFDLRLRKLEVPIFEGENPDAWLHRVAR

Query:  YFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR
        YF + R  ++EKLEA+V+  +G+AL W+QWE +K P+  WEE +LL+L++FR T EG+L ++F+A++Q+ TV++YR
Subjt:  YFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR

A0A7J6HNQ9 Retrotrans_gag domain-containing protein1.7e-2636.82Show/hide
Query:  VATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDE-----------MRLWMT---AVSQRMGMEEKTRDDKGKGLEKIDVGEKTPQSIATSSDGS
        V TRMESRV+ VE  ++GV SA+            L K              RL ++    V +   + E +  + G G      GE++  +    S+G 
Subjt:  VATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDE-----------MRLWMT---AVSQRMGMEEKTRDDKGKGLEKIDVGEKTPQSIATSSDGS

Query:  N-------QLTNALAPVFDLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLE
        N       + T    P  +   +K+E+P+F G+NPD W +R  RYF + RL+  E+LEAAVLCL+G AL W +WE +++ I +WEE + LLL+RF    E
Subjt:  N-------QLTNALAPVFDLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLE

Query:  GTLCDRFMAVKQETTVRDYR
        GT+ DRF    Q TTV+DYR
Subjt:  GTLCDRFMAVKQETTVRDYR

W9QTX5 Mediator of RNA polymerase II transcription subunit 252.7e-2954.9Show/hide
Query:  DLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRD
        +L  R++E+P+F+GENPD W  R  RYF +N++T+ EKL+ AV+ L+GEALAW QWE+ ++PI++W   +L+LL+RFRP  EG+LC++F++++QETTVRD
Subjt:  DLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRD

Query:  YR
        YR
Subjt:  YR

W9RBJ4 Retrotrans_gag domain-containing protein4.4e-2755.1Show/hide
Query:  DLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTV
        DL LR+LE+P+FEG+NP+ WL RV RYF +NRLT+++KL AA +C  G+ALAW QWE+ +NP+++W E +  LL RFR + EGT  D+F+A++Q+ TV
Subjt:  DLRLRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein7.4e-1136.84Show/hide
Query:  LRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRP
        +R++E+P+F+G     W  +V R+FR+ R  D +KL+   L L+G AL W   E      + W  F   LL RF P
Subjt:  LRKLEVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRP

AT3G44713.1 unknown protein2.3e-0431.43Show/hide
Query:  PIFEGENPD--AWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRF
        P F G   +  +W+  +  +F     TDDEK+  A   ++GEA AW    ++    ++WE  R  L+ RF
Subjt:  PIFEGENPD--AWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGCTACGAGAATGGAGTCTCGAGTTGAAGCAGTGGAACAACAAATTTCCGGGGTGGTTTCGGCCATAGAAGATAGTAGGGAATCGTGGAAGAAGGAAATG
GCATTGTTTAAGGATGAGATGCGTCTGTGGATGACAGCGGTATCACAGAGGATGGGAATGGAAGAGAAGACTAGGGACGACAAAGGGAAAGGGCTGGAGAAAATC
GATGTGGGGGAGAAGACGCCTCAGAGTATTGCGACTTCAAGTGATGGGTCGAATCAACTCACGAACGCGCTAGCCCCGGTATTCGATCTCCGTTTACGCAAGTTG
GAGGTACCTATTTTTGAGGGGGAAAATCCCGATGCGTGGCTACACCGTGTGGCCCGATATTTTCGGATTAATCGATTGACGGATGACGAGAAATTAGAGGCGGCG
GTGCTCTGTTTGGACGGTGAGGCTTTGGCTTGGCATCAGTGGGAGGAGAGGAAGAATCCGATACAGACTTGGGAGGAGTTCCGGCTATTGTTATTGCAGCGTTTC
CGACCAACCTTGGAAGGGACTCTGTGCGACCGATTCATGGCAGTGAAGCAGGAGACGACGGTGAGGGATTACCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGCTACGAGAATGGAGTCTCGAGTTGAAGCAGTGGAACAACAAATTTCCGGGGTGGTTTCGGCCATAGAAGATAGTAGGGAATCGTGGAAGAAGGAAATG
GCATTGTTTAAGGATGAGATGCGTCTGTGGATGACAGCGGTATCACAGAGGATGGGAATGGAAGAGAAGACTAGGGACGACAAAGGGAAAGGGCTGGAGAAAATC
GATGTGGGGGAGAAGACGCCTCAGAGTATTGCGACTTCAAGTGATGGGTCGAATCAACTCACGAACGCGCTAGCCCCGGTATTCGATCTCCGTTTACGCAAGTTG
GAGGTACCTATTTTTGAGGGGGAAAATCCCGATGCGTGGCTACACCGTGTGGCCCGATATTTTCGGATTAATCGATTGACGGATGACGAGAAATTAGAGGCGGCG
GTGCTCTGTTTGGACGGTGAGGCTTTGGCTTGGCATCAGTGGGAGGAGAGGAAGAATCCGATACAGACTTGGGAGGAGTTCCGGCTATTGTTATTGCAGCGTTTC
CGACCAACCTTGGAAGGGACTCTGTGCGACCGATTCATGGCAGTGAAGCAGGAGACGACGGTGAGGGATTACCGCTGA
Protein sequenceShow/hide protein sequence
MVATRMESRVEAVEQQISGVVSAIEDSRESWKKEMALFKDEMRLWMTAVSQRMGMEEKTRDDKGKGLEKIDVGEKTPQSIATSSDGSNQLTNALAPVFDLRLRKL
EVPIFEGENPDAWLHRVARYFRINRLTDDEKLEAAVLCLDGEALAWHQWEERKNPIQTWEEFRLLLLQRFRPTLEGTLCDRFMAVKQETTVRDYR