; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g14170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g14170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDUF4219 domain-containing protein
Genome locationchr6:11042427..11044262
RNA-Seq ExpressionMoc06g14170
SyntenyMoc06g14170
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD1825768.1 unnamed protein product [Ananas comosus var. bracteatus]5.0e-3646.19Show/hide
Query:  DYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL----------------
        D WEA+EQDYE+APL +NPT+ QIK HKER T+KA A++CLYAAVSPAI +RIMT +SAK  W+FLK EY+GDERI+ M  LNL                
Subjt:  DYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL----------------

Query:  -----------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGKKTSGNSSSESAAKDVG-----------SACKHC
                                           RRL+R EGS EGAL+A+ QQ  + +  KWKGKK +G+S SE AA +             S C+HC
Subjt:  -----------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGKKTSGNSSSESAAKDVG-----------SACKHC

Query:  GKHNHPHFRC
        GK NHPHFRC
Subjt:  GKHNHPHFRC

XP_016669911.2 uncharacterized protein LOC107889873 [Gossypium hirsutum]2.4e-3843.21Show/hide
Query:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------
        CDYWEAIE+DYE+ PL +NPTM QIK HKER  +KA AR+CLYA+VSPAIFNRIM    AKE W++ K+EY+GDERIK M VLNL               
Subjt:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------

Query:  --------------------------------------------------------------------RRLIRQEG----SIEGALKARMQQGESGRENK
                                                                            RRL+ QEG    SIEGALKA+MQQGE   E K
Subjt:  --------------------------------------------------------------------RRLIRQEG----SIEGALKARMQQGESGRENK

Query:  WKGKKTSGNSSSESAAKDVG-------SACKHCGKHNHPHFRC
        W GKK+  NS SE+ AK          S+CK+CGKHNHPHFRC
Subjt:  WKGKKTSGNSSSESAAKDVG-------SACKHCGKHNHPHFRC

XP_022148138.1 uncharacterized protein LOC111016891 [Momordica charantia]1.2e-5052.59Show/hide
Query:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------
        CDYWEAI++DYEIAP+ DNPTM +I THKERVT+K  A ACL AAVSPAIFNRIM +KSAKE WEFLK+EYEG+ERIKGM VLNL               
Subjt:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------

Query:  --------------------------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGK
                                                                            RRLI QEGS+EGALKARMQ GE GRE KWKGK
Subjt:  --------------------------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGK

Query:  KTSGNSSSESAAKDVGSACKHCGKHNHPHFRC
        K SG+SSSE A+KDVGSACKHCGKHNHPHFRC
Subjt:  KTSGNSSSESAAKDVGSACKHCGKHNHPHFRC

XP_022151329.1 uncharacterized protein LOC111019291 [Momordica charantia]3.7e-3951.74Show/hide
Query:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------
        CDYWEAIEQDYEIAPL DNPTM QIKTHK RVT+KA ARACLYAAVSP IFNRIM +KSAKE WEFLKSEYEGDER KGM VLNL               
Subjt:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------

Query:  --------------------------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGK
                                                                            RRL+RQEGSIEGALKARMQQGE GRE KWKGK
Subjt:  --------------------------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGK

Query:  K
        K
Subjt:  K

XP_038889190.1 uncharacterized protein LOC120079069 [Benincasa hispida]1.8e-4957.08Show/hide
Query:  LRNAAAGAPCDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL------
        +R  A    CDYWE IEQDYEIAPL DNPT+ QIKTHKERVT+K  AR CLY AVSPAIFNRIM +KS KE WEFLKSEYEGDERIKGM VLNL      
Subjt:  LRNAAAGAPCDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL------

Query:  ------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGKKTSGNSSSESAAKDVGSACK
                                                         RLIRQEGSIEGALKAR+ QGE  RE     KK SG+SSSESA KD GS CK
Subjt:  ------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGKKTSGNSSSESAAKDVGSACK

Query:  HCGKHNHPHFRC
        HCGK NHPHFRC
Subjt:  HCGKHNHPHFRC

TrEMBL top hitse value%identityAlignment
A0A1U8HV73 uncharacterized protein LOC1078898731.4e-3943.62Show/hide
Query:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------
        CDYWEAIE+DYE+ PL +NPTM QIK HKER  +KA AR+CLYA+VSPAIFNRIM    AKE W++ K+EY+GDERIK M VLNL               
Subjt:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------

Query:  --------------------------------------------------------------------RRLIRQEG----SIEGALKARMQQGESGRENK
                                                                            RRL+ QEG    SIEGALKA+MQQGE G E K
Subjt:  --------------------------------------------------------------------RRLIRQEG----SIEGALKARMQQGESGRENK

Query:  WKGKKTSGNSSSESAAKDVG-------SACKHCGKHNHPHFRC
        W GKK+  NS SE+ AK          S+CK+CGKHNHPHFRC
Subjt:  WKGKKTSGNSSSESAAKDVG-------SACKHCGKHNHPHFRC

A0A1U8PA84 uncharacterized protein LOC1079558829.1e-3641.56Show/hide
Query:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLN----------------
        CDYWE IE+DYE+ PL  NPTM QIK H ER T+K  AR+CLYA+VSPAIFNRIM   S KE W++LK +Y+GDERIK M VLN                
Subjt:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLN----------------

Query:  -------------------------------------------------------------------LRRLIRQEG----SIEGALKARMQQGESGRENK
                                                                           LRRL+RQ+G    SIEGALKA MQQGE G E K
Subjt:  -------------------------------------------------------------------LRRLIRQEG----SIEGALKARMQQGESGRENK

Query:  WKGKKTSGNSSSESAAK-------DVGSACKHCGKHNHPHFRC
        W GKK+  NS  E+ AK       +  S+CK+C K NHPHFRC
Subjt:  WKGKKTSGNSSSESAAK-------DVGSACKHCGKHNHPHFRC

A0A6J1D394 uncharacterized protein LOC1110168915.9e-5152.59Show/hide
Query:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------
        CDYWEAI++DYEIAP+ DNPTM +I THKERVT+K  A ACL AAVSPAIFNRIM +KSAKE WEFLK+EYEG+ERIKGM VLNL               
Subjt:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------

Query:  --------------------------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGK
                                                                            RRLI QEGS+EGALKARMQ GE GRE KWKGK
Subjt:  --------------------------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGK

Query:  KTSGNSSSESAAKDVGSACKHCGKHNHPHFRC
        K SG+SSSE A+KDVGSACKHCGKHNHPHFRC
Subjt:  KTSGNSSSESAAKDVGSACKHCGKHNHPHFRC

A0A6J1DCR7 uncharacterized protein LOC1110192911.8e-3951.74Show/hide
Query:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------
        CDYWEAIEQDYEIAPL DNPTM QIKTHK RVT+KA ARACLYAAVSP IFNRIM +KSAKE WEFLKSEYEGDER KGM VLNL               
Subjt:  CDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL---------------

Query:  --------------------------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGK
                                                                            RRL+RQEGSIEGALKARMQQGE GRE KWKGK
Subjt:  --------------------------------------------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGK

Query:  K
        K
Subjt:  K

A0A6V7P513 Uncharacterized protein2.4e-3646.19Show/hide
Query:  DYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL----------------
        D WEA+EQDYE+APL +NPT+ QIK HKER T+KA A++CLYAAVSPAI +RIMT +SAK  W+FLK EY+GDERI+ M  LNL                
Subjt:  DYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNL----------------

Query:  -----------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGKKTSGNSSSESAAKDVG-----------SACKHC
                                           RRL+R EGS EGAL+A+ QQ  + +  KWKGKK +G+S SE AA +             S C+HC
Subjt:  -----------------------------------RRLIRQEGSIEGALKARMQQGESGRENKWKGKKTSGNSSSESAAKDVG-----------SACKHC

Query:  GKHNHPHFRC
        GK NHPHFRC
Subjt:  GKHNHPHFRC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGCATTACCACCAAGACCGACTTCGTATAGATGTGGGTCACCTAGATCTAGCTCGTCAAGGTGTGGGTAGCCTAAATCGAGCTTCGAACGGATGTGGGTCACCGCC
AAACGTCCTAAACACCGCTGTAACGTACAATTCTGCGTCGTTCCGCTGCTGTACCGTCGTTCGCTGTGCGCCGCCGCGAGGGGGAGGTCGCGCTGGTTCGTGGCAGATTG
TCGAGGCGCGCCGCCGTTGCTCCAGGCCGTCGGAACGTCGGGGAGTCGCGCTGCCGGTCCGCCATTGTAGCAGTCCGTCGTGCGCCGCTGCGAGGACGGCCGCGCCGCCG
CTGCACCTGTTACGAAACGCCGCTGCTGGAGCGCCTTGTGATTATTGGGAAGCAATTGAGCAAGATTATGAAATTGCTCCACTCCTTGATAATCCAACAATGCGTCAGAT
CAAGACTCACAAGGAGAGGGTCACCAAGAAGGCAATGGCTCGAGCTTGCCTATATGCAGCTGTGTCTCCCGCCATATTCAATAGAATTATGACCATGAAGTCCGCAAAGG
AGAGTTGGGAGTTCCTCAAAAGTGAGTATGAAGGCGATGAGAGGATTAAAGGCATGAATGTGTTAAACTTGAGGAGGTTGATCAGGCAAGAAGGAAGCATTGAGGGGGCA
CTGAAAGCTAGAATGCAGCAGGGAGAAAGTGGAAGAGAGAATAAGTGGAAAGGGAAGAAGACGAGTGGAAACAGTAGCTCAGAATCTGCTGCAAAGGATGTTGGTAGTGC
ATGCAAGCACTGTGGAAAGCACAATCATCCACACTTCAGATGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGCATTACCACCAAGACCGACTTCGTATAGATGTGGGTCACCTAGATCTAGCTCGTCAAGGTGTGGGTAGCCTAAATCGAGCTTCGAACGGATGTGGGTCACCGCC
AAACGTCCTAAACACCGCTGTAACGTACAATTCTGCGTCGTTCCGCTGCTGTACCGTCGTTCGCTGTGCGCCGCCGCGAGGGGGAGGTCGCGCTGGTTCGTGGCAGATTG
TCGAGGCGCGCCGCCGTTGCTCCAGGCCGTCGGAACGTCGGGGAGTCGCGCTGCCGGTCCGCCATTGTAGCAGTCCGTCGTGCGCCGCTGCGAGGACGGCCGCGCCGCCG
CTGCACCTGTTACGAAACGCCGCTGCTGGAGCGCCTTGTGATTATTGGGAAGCAATTGAGCAAGATTATGAAATTGCTCCACTCCTTGATAATCCAACAATGCGTCAGAT
CAAGACTCACAAGGAGAGGGTCACCAAGAAGGCAATGGCTCGAGCTTGCCTATATGCAGCTGTGTCTCCCGCCATATTCAATAGAATTATGACCATGAAGTCCGCAAAGG
AGAGTTGGGAGTTCCTCAAAAGTGAGTATGAAGGCGATGAGAGGATTAAAGGCATGAATGTGTTAAACTTGAGGAGGTTGATCAGGCAAGAAGGAAGCATTGAGGGGGCA
CTGAAAGCTAGAATGCAGCAGGGAGAAAGTGGAAGAGAGAATAAGTGGAAAGGGAAGAAGACGAGTGGAAACAGTAGCTCAGAATCTGCTGCAAAGGATGTTGGTAGTGC
ATGCAAGCACTGTGGAAAGCACAATCATCCACACTTCAGATGCTGA
Protein sequenceShow/hide protein sequence
MTHYHQDRLRIDVGHLDLARQGVGSLNRASNGCGSPPNVLNTAVTYNSASFRCCTVVRCAPPRGGGRAGSWQIVEARRRCSRPSERRGVALPVRHCSSPSCAAARTAAPP
LHLLRNAAAGAPCDYWEAIEQDYEIAPLLDNPTMRQIKTHKERVTKKAMARACLYAAVSPAIFNRIMTMKSAKESWEFLKSEYEGDERIKGMNVLNLRRLIRQEGSIEGA
LKARMQQGESGRENKWKGKKTSGNSSSESAAKDVGSACKHCGKHNHPHFRC