; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g23740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g23740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111019515
Genome locationchr5:16955275..16956369
RNA-Seq ExpressionMoc05g23740
SyntenyMoc05g23740
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]7.9e-2136.32Show/hide
Query:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIENIR-DTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKV
        T R+++W+  I   +   L LVREFYA A   ++   +VRG+E+ FD+  IN  +NI  I  D   N        E+L   L   G   A W +T    V
Subjt:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIENIR-DTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKV

Query:  RLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ
          K   +  AA  WL+ +  R+LPT H   VT DRALL+Y ++ G   + G++I+ SI + A+ +R  L+ P L+T LC R GV+  E +
Subjt:  RLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]8.7e-2030.77Show/hide
Query:  KWNAENLATR-TSLMKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNI
        K+  E  ATR  + +++R +  E GF L    T+G +P    +      W+      +   + LVREFYA    P+ +   VRG ++ +    IN  F +
Subjt:  KWNAENLATR-TSLMKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNI

Query:  ENIRDTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSI
         +  D   ++ +   T + L   LE V    A W+++  G        ++ AA  W + +K+R+LPT H + V++DR LL+++ML G  +N G +I++ I
Subjt:  ENIRDTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSI

Query:  HECAHRTRGKLYHPRLVTSLC
          CA R  G L+ P L+T LC
Subjt:  HECAHRTRGKLYHPRLVTSLC

TYH88163.1 hypothetical protein ES332_D01G168900v1 [Gossypium tomentosum]1.0e-2030.37Show/hide
Query:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIENIRDTVGNKMLV
        S+ K + +M E GFDL   D+   P   R+     +W            ELVREFYA++  Q     IVR K++   +  IN  FN+ ++ +     M+ 
Subjt:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIENIRDTVGNKMLV

Query:  TPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH
            + L + L+ V    + W +  +G    + E +   A  W Y V+   +P  H   ++ +R LL+YA+L    +N G++I   IH CA +  G +Y 
Subjt:  TPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH

Query:  PRLVTSLCLRQGVQ
        P L+TSLCL+  V+
Subjt:  PRLVTSLCLRQGVQ

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]2.9e-7192.16Show/hide
Query:  MNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAENLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALEL
        MNEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNA NLATRTSLMK  KIMTELGFDLTLGDVPDDWR+TAR KEWRPLIQPIQCEALEL
Subjt:  MNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAENLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALEL

Query:  VREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDTVGNKMLVTPTLE
        VREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+D VGNKMLVTPTLE
Subjt:  VREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDTVGNKMLVTPTLE

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]1.8e-2831.66Show/hide
Query:  MNEPKTRAAKA--------------KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAENLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKE
        ++ P+TR A A              KA  A+ + VA  P++   E +    E+  ++     L  R     +R I+ E GFD     VP+  R+   +  
Subjt:  MNEPKTRAAKA--------------KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAENLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKE

Query:  WRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVS
        W  L  PI   +  LV+EFY A++P       RG E+R                   GN++LV P+ EQ++EA   + +P  TW ++T GK+ LKP D++
Subjt:  WRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVS

Query:  LAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRG
          A  W+Y+VKNR++PT +D  + ++RA++VY ++KG++ N+GELI   I  C+ +  G
Subjt:  LAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRG

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein3.8e-2136.32Show/hide
Query:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIENIR-DTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKV
        T R+++W+  I   +   L LVREFYA A   ++   +VRG+E+ FD+  IN  +NI  I  D   N        E+L   L   G   A W +T    V
Subjt:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIENIR-DTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKV

Query:  RLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ
          K   +  AA  WL+ +  R+LPT H   VT DRALL+Y ++ G   + G++I+ SI + A+ +R  L+ P L+T LC R GV+  E +
Subjt:  RLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQ

A0A2P5BCG4 Uncharacterized protein (Fragment)4.2e-2030.77Show/hide
Query:  KWNAENLATR-TSLMKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNI
        K+  E  ATR  + +++R +  E GF L    T+G +P    +      W+      +   + LVREFYA    P+ +   VRG ++ +    IN  F +
Subjt:  KWNAENLATR-TSLMKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNI

Query:  ENIRDTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSI
         +  D   ++ +   T + L   LE V    A W+++  G        ++ AA  W + +K+R+LPT H + V++DR LL+++ML G  +N G +I++ I
Subjt:  ENIRDTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSI

Query:  HECAHRTRGKLYHPRLVTSLC
          CA R  G L+ P L+T LC
Subjt:  HECAHRTRGKLYHPRLVTSLC

A0A5D2MA47 Uncharacterized protein5.0e-2130.37Show/hide
Query:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIENIRDTVGNKMLV
        S+ K + +M E GFDL   D+   P   R+     +W            ELVREFYA++  Q     IVR K++   +  IN  FN+ ++ +     M+ 
Subjt:  SLMKSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIENIRDTVGNKMLV

Query:  TPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH
            + L + L+ V    + W +  +G    + E +   A  W Y V+   +P  H   ++ +R LL+YA+L    +N G++I   IH CA +  G +Y 
Subjt:  TPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRGKLYH

Query:  PRLVTSLCLRQGVQ
        P L+TSLCL+  V+
Subjt:  PRLVTSLCLRQGVQ

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195151.4e-7192.16Show/hide
Query:  MNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAENLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALEL
        MNEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNA NLATRTSLMK  KIMTELGFDLTLGDVPDDWR+TAR KEWRPLIQPIQCEALEL
Subjt:  MNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAENLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALEL

Query:  VREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDTVGNKMLVTPTLE
        VREFYAA HPQSHIAIVRGKEIRFDATQINYTFNI+NI+D VGNKMLVTPTLE
Subjt:  VREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDTVGNKMLVTPTLE

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220078.5e-2931.66Show/hide
Query:  MNEPKTRAAKA--------------KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAENLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKE
        ++ P+TR A A              KA  A+ + VA  P++   E +    E+  ++     L  R     +R I+ E GFD     VP+  R+   +  
Subjt:  MNEPKTRAAKA--------------KAAEAKKKVVAPGPVDTI-ELDLSEGEEVETKWNAENLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKE

Query:  WRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVS
        W  L  PI   +  LV+EFY A++P       RG E+R                   GN++LV P+ EQ++EA   + +P  TW ++T GK+ LKP D++
Subjt:  WRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIENIRDTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVS

Query:  LAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRG
          A  W+Y+VKNR++PT +D  + ++RA++VY ++KG++ N+GELI   I  C+ +  G
Subjt:  LAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTCGACTTGTCTGAGGGAGA
GGAGGTCGAGACGAAATGGAACGCGGAAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGC
CTGATGATTGGAGGGAGACCGCCAGAGACAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCC
CAATCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAAATCAACTACACCTTCAACATTGAGAATATCAGAGACACTGTGGGCAATAAGATGTT
AGTGACTCCGACTCTGGAACAGCTTGATGAGGCTCTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATG
TTTCCCTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCATTGCTGGTTTATGCCATGCTA
AAGGGCATAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTT
ACGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTA
CTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTG
GATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTACTGACACAGATCCTAGTCCACAACCTCCGACTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGAACCTAAAACGAGAGCTGCGAAAGCTAAAGCAGCTGAAGCTAAGAAGAAGGTAGTGGCACCTGGGCCAGTTGATACAATTGAACTCGACTTGTCTGAGGGAGA
GGAGGTCGAGACGAAATGGAACGCGGAAAATTTAGCCACTCGAACTTCATTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGC
CTGATGATTGGAGGGAGACCGCCAGAGACAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCC
CAATCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAAATCAACTACACCTTCAACATTGAGAATATCAGAGACACTGTGGGCAATAAGATGTT
AGTGACTCCGACTCTGGAACAGCTTGATGAGGCTCTAGAATGTGTTGGGAAGCCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATG
TTTCCCTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCATTGCTGGTTTATGCCATGCTA
AAGGGCATAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACACGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTATGCTT
ACGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGAAGGGAAGGTA
CTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACAGCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTAAGGGCTTCATTG
GATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCAGTACTGACACAGATCCTAGTCCACAACCTCCGACTTCATAA
Protein sequenceShow/hide protein sequence
MNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNAENLATRTSLMKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHP
QSHIAIVRGKEIRFDATQINYTFNIENIRDTVGNKMLVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVSLAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAML
KGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLPEDQIKRDAPIVEEKNIRRIIAHALQRREGTGMSPTSEIRRLREENQQLRDQVREVVQHIYNLRASL
DFAVLPSWPPALAAILGHPSPSTDTDPSPQPPTS