; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr4:15129809..15131920
RNA-Seq ExpressionMoc04g20800
SyntenyMoc04g20800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]8.6e-5038.44Show/hide
Query:  KKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKRPRVH--------------------------------------------GEGVDDEQEPVP
        +++ +  + G +S+  TR A+A LAA+K+A+AGPS KAK  RV                                               G D+ QEPVP
Subjt:  KKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKRPRVH--------------------------------------------GEGVDDEQEPVP

Query:  EYVKRRLVENGWEKLFAPNACVGGLV-------------------GNEILVYPTDEQLDEARNLICRPHKSWTVSTTGKLSLKPLVINEQG---------
        EYV++R+VENGWE LFAP   V   +                   GNEILV+P+DEQ++EAR LICRPHK+WT+ST GKLSLKPL INEQ          
Subjt:  EYVKRRLVENGWEKLFAPNACVGGLV-------------------GNEILVYPTDEQLDEARNLICRPHKSWTVSTTGKLSLKPLVINEQG---------

Query:  ---------------------MMKGIEFNFGEIVRKEIQSCSEKMIGPLVFPELITELCLKVGIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTTAT
                             ++KG+EFNFGE++R EIQSCSEK+                 G+EA D ++V PKKP  S+++V GYSIV EEDS  TA 
Subjt:  ---------------------MMKGIEFNFGEIVRKEIQSCSEKMIGPLVFPELITELCLKVGIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTTAT

Query:  DSET------------------------------QELYGDGAPSFLYELEAELPSSSR-PT---DDDSLDDD
        D ET                              +++YGD APSF  EL A+LPSSSR PT   DD+S DD+
Subjt:  DSET------------------------------QELYGDGAPSFLYELEAELPSSSR-PT---DDDSLDDD

XP_022156786.1 uncharacterized protein LOC111023620 [Momordica charantia]5.8e-1444.36Show/hide
Query:  EEEVCLAKVVKKAKKKKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKR-----------------------------PRVHGE----------
        EE+VCL KVVKKA+ KK   +I PG  S+ CTRA IA LAA+K+A+AGP  KAKR                              RV  E          
Subjt:  EEEVCLAKVVKKAKKKKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKR-----------------------------PRVHGE----------

Query:  -----GVDDEQEPVPEYVKRRLVENGWEKLFAP
             G D+ QEPVP+Y+KRRL+ENGWE LFAP
Subjt:  -----GVDDEQEPVPEYVKRRLVENGWEKLFAP

XP_022158483.1 uncharacterized protein LOC111024964 [Momordica charantia]9.3e-0461.9Show/hide
Query:  GIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTTATDSET
        G++A+DED+VTPKK  TS++RV GY IV EEDS  T  D ET
Subjt:  GIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTTATDSET

XP_022158850.1 uncharacterized protein LOC111025315 isoform X1 [Momordica charantia]5.4e-0454.1Show/hide
Query:  ENPEEEVCLAKVVKKAKKKKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKRPRV
        E+ +EEVCLA  VK  KK+K      P  ISK CTRA +A LAA+K A  GPS KAKR ++
Subjt:  ENPEEEVCLAKVVKKAKKKKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKRPRV

XP_022159289.1 uncharacterized protein LOC111025702 [Momordica charantia]8.3e-2947.67Show/hide
Query:  KSWTVSTTGKLSLKPL-VINEQGMMKGIEFNFGEIVRKEIQSCSEKMIGPLVFPELITELCLKVGIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTT
        K+  +ST+   S++ + V+    +MKGIEFNF E++R EI  C+EKM+GPL+FP  I ELCLK G+EAD ED+V  KK  TSI+RV GY IV EEDS  T
Subjt:  KSWTVSTTGKLSLKPL-VINEQGMMKGIEFNFGEIVRKEIQSCSEKMIGPLVFPELITELCLKVGIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTT

Query:  ATDSET---------------------------------QELYGDGAPSFLYELEAELPSSSRPTDDDSLDD
        A D +T                                 ++LYGDGAPS   EL A+LPSSSRPT DDSL D
Subjt:  ATDSET---------------------------------QELYGDGAPSFLYELEAELPSSSRPTDDDSLDD

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220074.1e-5038.44Show/hide
Query:  KKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKRPRVH--------------------------------------------GEGVDDEQEPVP
        +++ +  + G +S+  TR A+A LAA+K+A+AGPS KAK  RV                                               G D+ QEPVP
Subjt:  KKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKRPRVH--------------------------------------------GEGVDDEQEPVP

Query:  EYVKRRLVENGWEKLFAPNACVGGLV-------------------GNEILVYPTDEQLDEARNLICRPHKSWTVSTTGKLSLKPLVINEQG---------
        EYV++R+VENGWE LFAP   V   +                   GNEILV+P+DEQ++EAR LICRPHK+WT+ST GKLSLKPL INEQ          
Subjt:  EYVKRRLVENGWEKLFAPNACVGGLV-------------------GNEILVYPTDEQLDEARNLICRPHKSWTVSTTGKLSLKPLVINEQG---------

Query:  ---------------------MMKGIEFNFGEIVRKEIQSCSEKMIGPLVFPELITELCLKVGIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTTAT
                             ++KG+EFNFGE++R EIQSCSEK+                 G+EA D ++V PKKP  S+++V GYSIV EEDS  TA 
Subjt:  ---------------------MMKGIEFNFGEIVRKEIQSCSEKMIGPLVFPELITELCLKVGIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTTAT

Query:  DSET------------------------------QELYGDGAPSFLYELEAELPSSSR-PT---DDDSLDDD
        D ET                              +++YGD APSF  EL A+LPSSSR PT   DD+S DD+
Subjt:  DSET------------------------------QELYGDGAPSFLYELEAELPSSSR-PT---DDDSLDDD

A0A6J1DW11 uncharacterized protein LOC1110236202.8e-1444.36Show/hide
Query:  EEEVCLAKVVKKAKKKKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKR-----------------------------PRVHGE----------
        EE+VCL KVVKKA+ KK   +I PG  S+ CTRA IA LAA+K+A+AGP  KAKR                              RV  E          
Subjt:  EEEVCLAKVVKKAKKKKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKR-----------------------------PRVHGE----------

Query:  -----GVDDEQEPVPEYVKRRLVENGWEKLFAP
             G D+ QEPVP+Y+KRRL+ENGWE LFAP
Subjt:  -----GVDDEQEPVPEYVKRRLVENGWEKLFAP

A0A6J1DW79 uncharacterized protein LOC1110249644.5e-0461.9Show/hide
Query:  GIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTTATDSET
        G++A+DED+VTPKK  TS++RV GY IV EEDS  T  D ET
Subjt:  GIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTTATDSET

A0A6J1DX02 uncharacterized protein LOC111025315 isoform X12.6e-0454.1Show/hide
Query:  ENPEEEVCLAKVVKKAKKKKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKRPRV
        E+ +EEVCLA  VK  KK+K      P  ISK CTRA +A LAA+K A  GPS KAKR ++
Subjt:  ENPEEEVCLAKVVKKAKKKKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKRPRV

A0A6J1E204 uncharacterized protein LOC1110257024.0e-2947.67Show/hide
Query:  KSWTVSTTGKLSLKPL-VINEQGMMKGIEFNFGEIVRKEIQSCSEKMIGPLVFPELITELCLKVGIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTT
        K+  +ST+   S++ + V+    +MKGIEFNF E++R EI  C+EKM+GPL+FP  I ELCLK G+EAD ED+V  KK  TSI+RV GY IV EEDS  T
Subjt:  KSWTVSTTGKLSLKPL-VINEQGMMKGIEFNFGEIVRKEIQSCSEKMIGPLVFPELITELCLKVGIEADDEDIVTPKKPTTSIKRVWGYSIVHEEDSLTT

Query:  ATDSET---------------------------------QELYGDGAPSFLYELEAELPSSSRPTDDDSLDD
        A D +T                                 ++LYGDGAPS   EL A+LPSSSRPT DDSL D
Subjt:  ATDSET---------------------------------QELYGDGAPSFLYELEAELPSSSRPTDDDSLDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAACCCGGTTATGGTCCAACCAATAACCAGAAGTCCCTCTCGGCCAATGACAGGGTGGGGCCCCTTGTTCAAGTCTCGGAGTCAGCATTTAAGGGAACACTCATC
TACTCCCCTAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAAGAATGTTGAGTCGGCGACTCGGG
CCACTCTCACCCATACAGATCAAAGGACGAGCCCTCACGGGCAGGAGTCCATAACTCACCCAGGATTGAGAATGAGTTGTCTGGTCATCCTAAGAAATAGCAATCTATTA
GTTAACGGTGTTACATCTAAAGATTGCCTATTTCGCGAGAACCCAGAAGAAGAGGTATGTTTGGCCAAAGTGGTGAAGAAAGCAAAGAAAAAGAAAACTCCGGCCCAAAT
TACGCCTGGCGAAATTTCGAAACTGTGCACCCGAGCTGCTATAGCTTGGTTGGCCGCCAAAAAAAAAGCTAAGGCTGGTCCATCTAATAAAGCCAAGAGGCCTAGGGTAC
ATGGAGAGGGTGTTGATGATGAGCAAGAGCCCGTGCCAGAGTATGTCAAGCGACGACTTGTGGAGAATGGTTGGGAGAAGTTGTTTGCCCCAAACGCGTGTGTCGGAGGC
CTTGTAGGTAATGAGATTTTGGTGTATCCGACAGATGAACAATTGGACGAGGCGCGTAACCTCATCTGTAGGCCACACAAGTCATGGACCGTCTCAACAACGGGAAAGCT
TTCTTTAAAACCACTGGTTATCAATGAGCAAGGCATGATGAAGGGCATTGAGTTCAACTTTGGCGAAATCGTAAGGAAAGAGATTCAGAGTTGCTCGGAGAAAATGATAG
GCCCACTTGTTTTTCCTGAACTTATAACAGAGCTATGTTTGAAAGTAGGTATAGAGGCCGATGATGAGGATATAGTGACGCCTAAGAAGCCGACCACATCCATAAAGAGA
GTTTGGGGATACTCGATCGTTCATGAAGAGGATTCCCTCACTACCGCCACGGATTCTGAGACTCAAGAGCTTTATGGAGATGGTGCACCTTCTTTTCTATATGAGCTTGA
GGCCGAATTGCCTTCTTCTTCACGACCTACCGACGATGATTCTTTGGATGATGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAACCCGGTTATGGTCCAACCAATAACCAGAAGTCCCTCTCGGCCAATGACAGGGTGGGGCCCCTTGTTCAAGTCTCGGAGTCAGCATTTAAGGGAACACTCATC
TACTCCCCTAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAAGAATGTTGAGTCGGCGACTCGGG
CCACTCTCACCCATACAGATCAAAGGACGAGCCCTCACGGGCAGGAGTCCATAACTCACCCAGGATTGAGAATGAGTTGTCTGGTCATCCTAAGAAATAGCAATCTATTA
GTTAACGGTGTTACATCTAAAGATTGCCTATTTCGCGAGAACCCAGAAGAAGAGGTATGTTTGGCCAAAGTGGTGAAGAAAGCAAAGAAAAAGAAAACTCCGGCCCAAAT
TACGCCTGGCGAAATTTCGAAACTGTGCACCCGAGCTGCTATAGCTTGGTTGGCCGCCAAAAAAAAAGCTAAGGCTGGTCCATCTAATAAAGCCAAGAGGCCTAGGGTAC
ATGGAGAGGGTGTTGATGATGAGCAAGAGCCCGTGCCAGAGTATGTCAAGCGACGACTTGTGGAGAATGGTTGGGAGAAGTTGTTTGCCCCAAACGCGTGTGTCGGAGGC
CTTGTAGGTAATGAGATTTTGGTGTATCCGACAGATGAACAATTGGACGAGGCGCGTAACCTCATCTGTAGGCCACACAAGTCATGGACCGTCTCAACAACGGGAAAGCT
TTCTTTAAAACCACTGGTTATCAATGAGCAAGGCATGATGAAGGGCATTGAGTTCAACTTTGGCGAAATCGTAAGGAAAGAGATTCAGAGTTGCTCGGAGAAAATGATAG
GCCCACTTGTTTTTCCTGAACTTATAACAGAGCTATGTTTGAAAGTAGGTATAGAGGCCGATGATGAGGATATAGTGACGCCTAAGAAGCCGACCACATCCATAAAGAGA
GTTTGGGGATACTCGATCGTTCATGAAGAGGATTCCCTCACTACCGCCACGGATTCTGAGACTCAAGAGCTTTATGGAGATGGTGCACCTTCTTTTCTATATGAGCTTGA
GGCCGAATTGCCTTCTTCTTCACGACCTACCGACGATGATTCTTTGGATGATGATTAG
Protein sequenceShow/hide protein sequence
MNNPVMVQPITRSPSRPMTGWGPLFKSRSQHLREHSSTPLKSGRSEFHLVKLCSQLPTRSRPQNGKNVESATRATLTHTDQRTSPHGQESITHPGLRMSCLVILRNSNLL
VNGVTSKDCLFRENPEEEVCLAKVVKKAKKKKTPAQITPGEISKLCTRAAIAWLAAKKKAKAGPSNKAKRPRVHGEGVDDEQEPVPEYVKRRLVENGWEKLFAPNACVGG
LVGNEILVYPTDEQLDEARNLICRPHKSWTVSTTGKLSLKPLVINEQGMMKGIEFNFGEIVRKEIQSCSEKMIGPLVFPELITELCLKVGIEADDEDIVTPKKPTTSIKR
VWGYSIVHEEDSLTTATDSETQELYGDGAPSFLYELEAELPSSSRPTDDDSLDDD