; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr4:14914788..14915720
RNA-Seq ExpressionMoc04g20540
SyntenyMoc04g20540
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]1.3e-1936.22Show/hide
Query:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIR-DAVGNKILVTPTLEQLDEALECVGKPSATWDLTTHGKA
        T R+++W+  I   +   L LVREFYA A   ++   +VRG+E+ FD+  IN  +NI  I  DA  N        E+L   L   G   A W +T     
Subjt:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIR-DAVGNKILVTPTLEQLDEALECVGKPSATWDLTTHGKA

Query:  RLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQ
          K   +   A  WL+ +  R+LPT H   VT DRALL+Y ++ G   + G++I+ SI + A+ +R  L+ P L+T LC R GV+
Subjt:  RLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQ

QHO49240.1 uncharacterized protein DS421_1g12310 [Arachis hypogaea]4.8e-1927.98Show/hide
Query:  RKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAV------------HPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNK
        +K++ E+ F L   + P + R   R + W  L  PIQ   + +V+EFYA               P++++ +VRGK + F    +   FN+  ++   G++
Subjt:  RKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAV------------HPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNK

Query:  ILVTPTL---EQLDEALECVGKPSATWDLTTHGK-ARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHR
           T  +   ++LD+ L  + + SA W   + GK  +L+  D+   A GWL  ++  I+PT +   VT DRA+++++++ G ++   E+I+  +++ A +
Subjt:  ILVTPTL---EQLDEALECVGKPSATWDLTTHGK-ARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHR

Query:  --TRGKLYHPRLVTSLCLRQGVQLLEDQIKRDAPIVEEKNIRR
          T  +L  P L+  LC   GV      I+ D PI E+K I +
Subjt:  --TRGKLYHPRLVTSLCLRQGVQLLEDQIKRDAPIVEEKNIRR

TYH88163.1 hypothetical protein ES332_D01G168900v1 [Gossypium tomentosum]1.3e-1929.86Show/hide
Query:  KSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPT
        K + +M E GFDL   D+   P   R+     +W            ELVREFYA++  Q     IVR K++   +  IN  FN+ ++ +     ++    
Subjt:  KSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPT

Query:  LEQLDEALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRL
         + L + L+ V    + W +  +G    + E +   A  W Y V+   +P  H   ++ +R LL+YA+L    +N G++I   IH CA +  G +Y P L
Subjt:  LEQLDEALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRL

Query:  VTSLCLRQGVQ
        +TSLCL+  V+
Subjt:  VTSLCLRQGVQ

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]6.7e-4592.93Show/hide
Query:  MKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPTLE
        MK  KIMTELGFDLTLGDVPDDWR+TAR KEWRPLIQPIQCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNIKNI+DAVGNK+LVTPTLE
Subjt:  MKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPTLE

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]1.2e-2529.84Show/hide
Query:  SRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPTLEQLD
        +R I+ E GFD     VP+  R+   +  W  L  PI   +  LV+EFY A++P       RG E+R                   GN+ILV P+ EQ++
Subjt:  SRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPTLEQLD

Query:  EALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLC
        EA   + +P  TW ++T GK  LK  D++  A  W+Y+VKNR++PT +D  + ++RA++VY +++G++ N+GELI   I  C+ +               
Subjt:  EALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLC

Query:  LRQGVQLLEDQIKRDAPIVEEKNIRRIIAHALQRREG---TGMSPTSEIRRLREENQQLRDQVREVVQHIYNLRASLDFA------VLPSWPPALAAILG
           GV+  +  +    P    +++R++  +++ R E    T   P +     RE+  +LR +   +   +   RA+  F         PS+P  LAA L 
Subjt:  LRQGVQLLEDQIKRDAPIVEEKNIRRIIAHALQRREG---TGMSPTSEIRRLREENQQLRDQVREVVQHIYNLRASLDFA------VLPSWPPALAAILG

Query:  HPSSS
         PSSS
Subjt:  HPSSS

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein6.1e-2036.22Show/hide
Query:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIR-DAVGNKILVTPTLEQLDEALECVGKPSATWDLTTHGKA
        T R+++W+  I   +   L LVREFYA A   ++   +VRG+E+ FD+  IN  +NI  I  DA  N        E+L   L   G   A W +T     
Subjt:  TARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIR-DAVGNKILVTPTLEQLDEALECVGKPSATWDLTTHGKA

Query:  RLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQ
          K   +   A  WL+ +  R+LPT H   VT DRALL+Y ++ G   + G++I+ SI + A+ +R  L+ P L+T LC R GV+
Subjt:  RLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQ

A0A2P5BCG4 Uncharacterized protein (Fragment)6.8e-1929.09Show/hide
Query:  MKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVT
        +++R +  E GF L    T+G +P    +      W+      +   + LVREFYA    P+ +   VRG ++ +    IN  F + +  D   ++ +  
Subjt:  MKSRKIMTELGFDL----TLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYA-AVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVT

Query:  PTLEQLDEALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHP
         T + L   LE V    A W+++  G        ++  A  W + +K+R+LPT H + V++DR LL+++ML G  +N G +I++ I  CA R  G L+ P
Subjt:  PTLEQLDEALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHP

Query:  RLVTSLCLRQGVQLLEDQIK
         L+T LC       L ++ K
Subjt:  RLVTSLCLRQGVQLLEDQIK

A0A5D2MA47 Uncharacterized protein6.1e-2029.86Show/hide
Query:  KSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPT
        K + +M E GFDL   D+   P   R+     +W            ELVREFYA++  Q     IVR K++   +  IN  FN+ ++ +     ++    
Subjt:  KSRKIMTELGFDLTLGDV---PDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHI-AIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPT

Query:  LEQLDEALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRL
         + L + L+ V    + W +  +G    + E +   A  W Y V+   +P  H   ++ +R LL+YA+L    +N G++I   IH CA +  G +Y P L
Subjt:  LEQLDEALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRL

Query:  VTSLCLRQGVQ
        +TSLCL+  V+
Subjt:  VTSLCLRQGVQ

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195153.2e-4592.93Show/hide
Query:  MKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPTLE
        MK  KIMTELGFDLTLGDVPDDWR+TAR KEWRPLIQPIQCEALELVREFYAA HPQSHIAIVRGKEIRFDATQINYTFNIKNI+DAVGNK+LVTPTLE
Subjt:  MKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPTLE

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220075.7e-2629.84Show/hide
Query:  SRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPTLEQLD
        +R I+ E GFD     VP+  R+   +  W  L  PI   +  LV+EFY A++P       RG E+R                   GN+ILV P+ EQ++
Subjt:  SRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPTLEQLD

Query:  EALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLC
        EA   + +P  TW ++T GK  LK  D++  A  W+Y+VKNR++PT +D  + ++RA++VY +++G++ N+GELI   I  C+ +               
Subjt:  EALECVGKPSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLC

Query:  LRQGVQLLEDQIKRDAPIVEEKNIRRIIAHALQRREG---TGMSPTSEIRRLREENQQLRDQVREVVQHIYNLRASLDFA------VLPSWPPALAAILG
           GV+  +  +    P    +++R++  +++ R E    T   P +     RE+  +LR +   +   +   RA+  F         PS+P  LAA L 
Subjt:  LRQGVQLLEDQIKRDAPIVEEKNIRRIIAHALQRREG---TGMSPTSEIRRLREENQQLRDQVREVVQHIYNLRASLDFA------VLPSWPPALAAILG

Query:  HPSSS
         PSSS
Subjt:  HPSSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGCCTGATGATTGGAGGGAGACCGCTAGAGACAAAGAATGGAGACCACTCATTCA
GCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTC
AGATCAACTACACCTTCAACATTAAGAATATCAGAGATGCTGTGGGCAACAAGATTTTAGTGACTCCGACTCTGGAACAGCTTGATGAGGCTCTAGAATGTGTTGGGAAG
CCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGCACGACTAAAACTCGAGGATGTTTCCCTAACTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCC
AACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTACAGGGCATAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGT
GTGCCCACCGAACACGTGGTAAGCTTTATCATCCACGATTGGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCTTGAGGATCAAATTAAGAGAGATGCCCCAATT
GTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCATTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACA
GCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTC
ATCCATCTTCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGCCTGATGATTGGAGGGAGACCGCTAGAGACAAAGAATGGAGACCACTCATTCA
GCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCCCAGTCACATATAGCCATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTC
AGATCAACTACACCTTCAACATTAAGAATATCAGAGATGCTGTGGGCAACAAGATTTTAGTGACTCCGACTCTGGAACAGCTTGATGAGGCTCTAGAATGTGTTGGGAAG
CCCTCTGCCACTTGGGATTTGACTACTCATGGCAAGGCACGACTAAAACTCGAGGATGTTTCCCTAACTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCC
AACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTACAGGGCATAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGT
GTGCCCACCGAACACGTGGTAAGCTTTATCATCCACGATTGGTCACTTCTTTATGCTTGCGACAAGGTGTACAGCTCCTTGAGGATCAAATTAAGAGAGATGCCCCAATT
GTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCATTACAAAGAAGGGAAGGTACTGGGATGTCTCCTACATCGGAGATCCGTCGTCTTCGAGAGGAGAACCAACA
GCTGCGAGATCAGGTTCGAGAAGTCGTGCAACATATCTACAACTTGAGGGCTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTC
ATCCATCTTCCAGTACTGACACTGATCCTAGTCCACAACCTCCGACTTCATGA
Protein sequenceShow/hide protein sequence
MKSRKIMTELGFDLTLGDVPDDWRETARDKEWRPLIQPIQCEALELVREFYAAVHPQSHIAIVRGKEIRFDATQINYTFNIKNIRDAVGNKILVTPTLEQLDEALECVGK
PSATWDLTTHGKARLKLEDVSLTAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLQGIDVNYGELINTSIHECAHRTRGKLYHPRLVTSLCLRQGVQLLEDQIKRDAPI
VEEKNIRRIIAHALQRREGTGMSPTSEIRRLREENQQLRDQVREVVQHIYNLRASLDFAVLPSWPPALAAILGHPSSSTDTDPSPQPPTS