; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g17810 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g17810
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr4:13082629..13083798
RNA-Seq ExpressionMoc04g17810
SyntenyMoc04g17810
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]1.7e-2136.84Show/hide
Query:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIENIR-DAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKV
        T R+++W+  I   +   L LVREFYA A   ++   +VRG+E+ FD+  IN  +NI  I  DA  N        E+L   L   G   A W +T    V
Subjt:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIENIR-DAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKV

Query:  RLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ
          K   +  AA  WL+ +  R+LPT H   VT DRALL+Y ++ G   + G++I+ SI + A+ +R  L+ P L+T LC R GV+  E +
Subjt:  RLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ

QHN86150.1 DNA-directed RNA polymerase I subunit [Arachis hypogaea]7.1e-2027.59Show/hide
Query:  SEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAV------------HPQSHIAIVRGKE
        SE  E+E   + A+      ++  +K++ E+ F L   + P+   E  R + W  L  PIQ   + +V+EFYA               P++++ +VRGK 
Subjt:  SEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAV------------HPQSHIAIVRGKE

Query:  IRFDATQINYTFNIENIRDAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGK-VRLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAML
        + F    +   FN+ N+ +            ++LD+ L  +    A W + + GK V+L   D+   A GWL  ++  I+PT +   VT DRA+++++++
Subjt:  IRFDATQINYTFNIENIRDAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGK-VRLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAML

Query:  KGIDVNYGELINTSIHECAHR--TRGKLYHPRLVTSLCLRQGVQLPED-QIKRDAPIVEEK
         G +V   E+I   +++ A +  T  +L  P L+  LC   G+ +  D  IK+D PI ++K
Subjt:  KGIDVNYGELINTSIHECAHR--TRGKLYHPRLVTSLCLRQGVQLPED-QIKRDAPIVEEK

TYH88163.1 hypothetical protein ES332_D01G168900v1 [Gossypium tomentosum]1.4e-2030.37Show/hide
Query:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIENIRDAVGNKMLV
        S+ K + +M E GFDL   D+   P   R+     +W            ELVREFYA++  Q     IVR K++   +  IN  FN+ ++ +     M+ 
Subjt:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIENIRDAVGNKMLV

Query:  TPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH
            + L + L+ V    + W +  +G    + E +   A  W Y V+   +P  H   ++ +R LL+YA+L    +N G++I   IH CA +  G +Y 
Subjt:  TPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH

Query:  PRLVTSLCLRQGVQ
        P L+TSLCL+  V+
Subjt:  PRLVTSLCLRQGVQ

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]7.8e-8392.13Show/hide
Query:  MFQYKRREKKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPD
        MFQYKRRE KSSKRRAVQ  KPTVP+NEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDLTLGDVPD
Subjt:  MFQYKRREKKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPD

Query:  DWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE
        DWR+TAR KEWRPLIQPIQCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTLE
Subjt:  DWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]3.8e-2932.05Show/hide
Query:  VNEPKTRAAKA--------------KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKE
        V+ P+TR A A              KA  A+ + VA  P++   E +    E+  ++     L  R     +R I+ E GFD     VP+  R+   +  
Subjt:  VNEPKTRAAKA--------------KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKE

Query:  WRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVS
        W  L  PI   +  LV+EFY A++P       RG E+R                   GN++LV P+ EQ++EA   + +P  TW ++T GK+ LKP D++
Subjt:  WRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVS

Query:  LAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRG
          A  W+Y+VKNR++PT +D  + ++RA++VY ++KG++ N+GELI   I  C+ +  G
Subjt:  LAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRG

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein8.2e-2236.84Show/hide
Query:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIENIR-DAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKV
        T R+++W+  I   +   L LVREFYA A   ++   +VRG+E+ FD+  IN  +NI  I  DA  N        E+L   L   G   A W +T    V
Subjt:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIENIR-DAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKV

Query:  RLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ
          K   +  AA  WL+ +  R+LPT H   VT DRALL+Y ++ G   + G++I+ SI + A+ +R  L+ P L+T LC R GV+  E +
Subjt:  RLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ

A0A2P5BCG4 Uncharacterized protein (Fragment)2.9e-1930.32Show/hide
Query:  KWNAANLATR-TSLMKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNI
        K+     ATR  + +++R +  E GF L    T+G +P    +      W+      +   + LVREFYA    P+ +   VRG ++ +    IN  F +
Subjt:  KWNAANLATR-TSLMKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNI

Query:  ENIRDAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSI
         +  D   ++ +   T + L   LE V    A W+++  G        ++ AA  W + +K+R+LPT H   V++DR LL+++ML G  +N G +I++ I
Subjt:  ENIRDAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSI

Query:  HECAHRTRGKLYHPRLVTSLC
          CA R  G L+ P L+T LC
Subjt:  HECAHRTRGKLYHPRLVTSLC

A0A5D2MA47 Uncharacterized protein7.0e-2130.37Show/hide
Query:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIENIRDAVGNKMLV
        S+ K + +M E GFDL   D+   P   R+     +W            ELVREFYA++  Q     IVR K++   +  IN  FN+ ++ +     M+ 
Subjt:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIENIRDAVGNKMLV

Query:  TPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH
            + L + L+ V    + W +  +G    + E +   A  W Y V+   +P  H   ++ +R LL+YA+L    +N G++I   IH CA +  G +Y 
Subjt:  TPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH

Query:  PRLVTSLCLRQGVQ
        P L+TSLCL+  V+
Subjt:  PRLVTSLCLRQGVQ

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195153.8e-8392.13Show/hide
Query:  MFQYKRREKKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPD
        MFQYKRRE KSSKRRAVQ  KPTVP+NEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNAANLATRTSLMK  KIMTELGFDLTLGDVPD
Subjt:  MFQYKRREKKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPD

Query:  DWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE
        DWR+TAR KEWRPLIQPIQCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+DAVGNKMLVTPTLE
Subjt:  DWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLE

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.8e-2932.05Show/hide
Query:  VNEPKTRAAKA--------------KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKE
        V+ P+TR A A              KA  A+ + VA  P++   E +    E+  ++     L  R     +R I+ E GFD     VP+  R+   +  
Subjt:  VNEPKTRAAKA--------------KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKE

Query:  WRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVS
        W  L  PI   +  LV+EFY A++P       RG E+R                   GN++LV P+ EQ++EA   + +P  TW ++T GK+ LKP D++
Subjt:  WRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVS

Query:  LAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRG
          A  W+Y+VKNR++PT +D  + ++RA++VY ++KG++ N+GELI   I  C+ +  G
Subjt:  LAAAGWLYIVKNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGACGTGCAGTTCAGGCTAAGAAGCCAACAGTGCCCGTGAATGAACCTAAAACGAGAGCTGCGAAAGCTAA
AGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTGGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAATGGAACGCAGCAAATTTAG
CCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTTGATCTCACCCTAGGAGATGTGCCTGATGATTGGAGGGAGACCGCCAGAGACAAAGAA
TGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAATTCTATGCTGCAGTCCATCCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAAT
ACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTGGAACAGCTTGATGAGGCTC
TAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCCCTAGCTGCTGCAGGATGGTTATACATAGTC
AAAAACAGAATTCTGCCAACGGAGCATGATGACCATGTCACTCAGGATAGGGCATTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAATTATGGAGAATTGATTAA
TACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTA
AGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTT
CGAGAGGAGAACCACCAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGTT
AGCTGCTATCCTTGGTCATCCATCTCTCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGACGTGCAGTTCAGGCTAAGAAGCCAACAGTGCCCGTGAATGAACCTAAAACGAGAGCTGCGAAAGCTAA
AGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTGGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAATGGAACGCAGCAAATTTAG
CCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTTGATCTCACCCTAGGAGATGTGCCTGATGATTGGAGGGAGACCGCCAGAGACAAAGAA
TGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAATTCTATGCTGCAGTCCATCCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAAT
ACGGTTTGATGCCACTCAGATCAACTACACCTTCAACATTGAGAATATCAGAGATGCTGTGGGCAATAAGATGTTAGTGACTCCGACTCTGGAACAGCTTGATGAGGCTC
TAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCCCTAGCTGCTGCAGGATGGTTATACATAGTC
AAAAACAGAATTCTGCCAACGGAGCATGATGACCATGTCACTCAGGATAGGGCATTGCTGGTTTATGCCATGCTAAAGGGCATAGATGTGAATTATGGAGAATTGATTAA
TACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTA
AGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTT
CGAGAGGAGAACCACCAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGTT
AGCTGCTATCCTTGGTCATCCATCTCTCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MFQYKRREKKSSKRRAVQAKKPTVPVNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAANLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKE
WRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDAVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIV
KNRILPTEHDDHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQIKRDAPIVEEKNIRRIIAHALQRREGTGMSPTSEIRRL
REENHQLRDQVREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSLSTDTDPSPQPPTS