; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr4:17024320..17025414
RNA-Seq ExpressionMoc04g23580
SyntenyMoc04g23580
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]6.0e-2136.32Show/hide
Query:  TARDKEWRPLIQLIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIR-DAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKV
        T R+++W+  I   +   L LVREFYA A   ++   +VRG+E+ FD+  IN  +NI  I  DA  N         +L   L   G   A W +T    V
Subjt:  TARDKEWRPLIQLIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIR-DAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKV

Query:  RLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ
          K   +  AA  WL+ +  R+LPT H   VT DRALL+Y ++ G   + G++I+ SI + A+ +R  L+ P L+T LC R GV+  E +
Subjt:  RLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ

QHN86150.1 DNA-directed RNA polymerase I subunit [Arachis hypogaea]2.2e-1827.2Show/hide
Query:  SEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALELVREFYAAV------------HPQSHIAIVRGKE
        SE  E+E   + A+      ++  +K++ E+ F L   + P+   E  R + W  L   IQ   + +V+EFYA               P++++ +VRGK 
Subjt:  SEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALELVREFYAAV------------HPQSHIAIVRGKE

Query:  IRFDATQINYTFNIKNIRDAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGK-VRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAML
        + F    +   FN+ N+ +             +LD+ L  +    A W + + GK V+L   D+   A GWL  ++  I+PT +   VT DRA+++++++
Subjt:  IRFDATQINYTFNIKNIRDAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGK-VRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAML

Query:  KGIDVNYGELINTSIHECAHR--TRGKLYHPRLVTSLCLRQGVQLPED-QIKRDAPIVEEK
         G +V   E+I   +++ A +  T  +L  P L+  LC   G+ +  D  IK+D PI ++K
Subjt:  KGIDVNYGELINTSIHECAHR--TRGKLYHPRLVTSLCLRQGVQLPED-QIKRDAPIVEEK

TYH88163.1 hypothetical protein ES332_D01G168900v1 [Gossypium tomentosum]1.5e-1930.37Show/hide
Query:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQLIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLV
        S+ K + +M E GFDL   D+   P   R+     +W            ELVREFYA++  Q     IVR K++   +  IN  FN+ ++ +     M+ 
Subjt:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQLIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLV

Query:  TPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH
              L + L  V    + W +  +G    + E +   A  W Y V+   +P  H   ++ +R LL+YA+L    +N G++I   IH CA +  G +Y 
Subjt:  TPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH

Query:  PRLVTSLCLRQGVQ
        P L+TSLCL+  V+
Subjt:  PRLVTSLCLRQGVQ

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.4e-7092.11Show/hide
Query:  MNEPKTRAVKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALEL
        MNEPKTRA KAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDLTLGDVPDDWR+TAR KEWRPLIQ IQCEALEL
Subjt:  MNEPKTRAVKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALEL

Query:  VREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTV
        VREFYAA HPQSHIAIVRGKEIRFDATQINYTFNIKNI+DAVGNKMLVTPT+
Subjt:  VREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTV

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]3.3e-2732.05Show/hide
Query:  KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALELVREFYAAVHP
        KA  A+ + VA  P++   E +    E+  ++     L  R     +R I+ E GFD     VP+  R+   +  W  L   I   +  LV+EFY A++P
Subjt:  KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALELVREFYAAVHP

Query:  QSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQ
               RG E+R                   GN++LV P+  Q++EA   + +P  TW ++T GK+ LKP D++  A  W+Y+VKNR++PT +D  + +
Subjt:  QSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQ

Query:  DRALLVYAMLKGIDVNYGELINTSIHECAHRTRG
        +RA++VY ++KG++ N+GELI   I  C+ +  G
Subjt:  DRALLVYAMLKGIDVNYGELINTSIHECAHRTRG

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein2.9e-2136.32Show/hide
Query:  TARDKEWRPLIQLIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIR-DAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKV
        T R+++W+  I   +   L LVREFYA A   ++   +VRG+E+ FD+  IN  +NI  I  DA  N         +L   L   G   A W +T    V
Subjt:  TARDKEWRPLIQLIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIR-DAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKV

Query:  RLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ
          K   +  AA  WL+ +  R+LPT H   VT DRALL+Y ++ G   + G++I+ SI + A+ +R  L+ P L+T LC R GV+  E +
Subjt:  RLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ

A0A2P5BCG4 Uncharacterized protein (Fragment)3.0e-1829.86Show/hide
Query:  KWNAANLATR-TSLMKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQLIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNI
        K+     ATR  + +++R +  E GF L    T+G +P    +      W+      +   + LVREFYA    P+ +   VRG ++ +    IN  F +
Subjt:  KWNAANLATR-TSLMKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQLIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNI

Query:  KNIRDAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSI
         +  D   ++ +   T   L   L  V    A W+++  G        ++ AA  W + +K+R+LPT H + V++DR LL+++ML G  +N G +I++ I
Subjt:  KNIRDAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSI

Query:  HECAHRTRGKLYHPRLVTSLC
          CA R  G L+ P L+T LC
Subjt:  HECAHRTRGKLYHPRLVTSLC

A0A5D2MA47 Uncharacterized protein7.2e-2030.37Show/hide
Query:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQLIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLV
        S+ K + +M E GFDL   D+   P   R+     +W            ELVREFYA++  Q     IVR K++   +  IN  FN+ ++ +     M+ 
Subjt:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQLIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLV

Query:  TPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH
              L + L  V    + W +  +G    + E +   A  W Y V+   +P  H   ++ +R LL+YA+L    +N G++I   IH CA +  G +Y 
Subjt:  TPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH

Query:  PRLVTSLCLRQGVQ
        P L+TSLCL+  V+
Subjt:  PRLVTSLCLRQGVQ

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195156.9e-7192.11Show/hide
Query:  MNEPKTRAVKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALEL
        MNEPKTRA KAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDLTLGDVPDDWR+TAR KEWRPLIQ IQCEALEL
Subjt:  MNEPKTRAVKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALEL

Query:  VREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTV
        VREFYAA HPQSHIAIVRGKEIRFDATQINYTFNIKNI+DAVGNKMLVTPT+
Subjt:  VREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTV

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.6e-2732.05Show/hide
Query:  KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALELVREFYAAVHP
        KA  A+ + VA  P++   E +    E+  ++     L  R     +R I+ E GFD     VP+  R+   +  W  L   I   +  LV+EFY A++P
Subjt:  KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALELVREFYAAVHP

Query:  QSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQ
               RG E+R                   GN++LV P+  Q++EA   + +P  TW ++T GK+ LKP D++  A  W+Y+VKNR++PT +D  + +
Subjt:  QSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQ

Query:  DRALLVYAMLKGIDVNYGELINTSIHECAHRTRG
        +RA++VY ++KG++ N+GELI   I  C+ +  G
Subjt:  DRALLVYAMLKGIDVNYGELINTSIHECAHRTRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGAACCTAAAACGAGAGCTGTGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTAGACTTGTCTGAG
GGAGAGGAGGTCGAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTA
GGAGATGTGCCTGATGATTGGAGGGAGACCGCTAGAGACAAAGAGTGGAGACCACTCATTCAGCTTATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTAT
GCTGCTGTCCATCCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTTAACATTAAGAATATTAGAGAT
GCTGTGGGCAATAAGATGTTAGTGACTCCGACTGTAGCACAGCTTGATGAGGCTCTAGCATGTGTTGGGAAGCCCTCTGCTACTTGGGATTTGACTACTCATGGC
AAGGTACGACTAAAACCCGAGGATGTTTCCCTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAG
GATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGAACACGTGGTAAG
CTTTATCACCCACGTTTGGTCACTTCATTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATT
CGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAG
GTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCT
CCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGAACCTAAAACGAGAGCTGTGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTAGACTTGTCTGAG
GGAGAGGAGGTCGAGACGAAATGGAACGCGGCAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTA
GGAGATGTGCCTGATGATTGGAGGGAGACCGCTAGAGACAAAGAGTGGAGACCACTCATTCAGCTTATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTAT
GCTGCTGTCCATCCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTACACCTTTAACATTAAGAATATTAGAGAT
GCTGTGGGCAATAAGATGTTAGTGACTCCGACTGTAGCACAGCTTGATGAGGCTCTAGCATGTGTTGGGAAGCCCTCTGCTACTTGGGATTTGACTACTCATGGC
AAGGTACGACTAAAACCCGAGGATGTTTCCCTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAG
GATAGGGCACTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAATTATGGAGAATTGATCAATACTAGTATCCATGAGTGTGCCCACCGAACACGTGGTAAG
CTTTATCACCCACGTTTGGTCACTTCATTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATT
CGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAG
GTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCT
CCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MNEPKTRAVKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQLIQCEALELVREFY
AAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKMLVTPTVAQLDEALACVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQ
DRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQIKRDAPIVEEKNIRRIIAHALQRREGTGMSPTSEIRRLREENQQLRDQ
VREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSPSTDTDPSPQPPTS