; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g02640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g02640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF1068)
Genome locationchr4:1674309..1675758
RNA-Seq ExpressionMoc04g02640
SyntenyMoc04g02640
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572284.1 hypothetical protein SDJN03_29012, partial [Cucurbita argyrosperma subsp. sororia]1.2e-8989.12Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_022136333.1 uncharacterized protein LOC111008045 [Momordica charantia]1.2e-100100Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_022952351.1 uncharacterized protein LOC111455059 isoform X1 [Cucurbita moschata]9.5e-9089.12Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_022969266.1 uncharacterized protein LOC111468321 [Cucurbita maxima]2.8e-8988.6Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+ PPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_023554621.1 uncharacterized protein LOC111811814 isoform X1 [Cucurbita pepo subsp. pepo]1.0e-8888.08Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RAR+RGWKEG  +SR QKQ NI TA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

TrEMBL top hitse value%identityAlignment
A0A1S3C135 uncharacterized protein LOC103495614 isoform X15.6e-8887.56Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQ TSC P+LIK GLA IA+ I GYI+GPPLYWHF EGLAVVS SS+SSSCPPCFCDCPSQPVISIPEELRN+TF DCVK DPEVS+DTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTA WE RARQRGWKEG  +SR QKQGNIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1C7C2 uncharacterized protein LOC1110080455.8e-101100Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1ERF9 uncharacterized protein LOC111437126 isoform X11.1e-8889.64Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA SC P+LIK GLA IAL IAGYI+GPPLYWH  EG AVVSRSSSSSSCPPCFCDCPSQPVISIPEELRN+TF DCVK DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKEAEA E+ RRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA WE RARQRGWKEGA +SRIQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1GK01 uncharacterized protein LOC111455059 isoform X14.6e-9089.12Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1HXB7 uncharacterized protein LOC1114683211.3e-8988.6Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+ PPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)4.4e-6162.3Show/hide
Query:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA
        A +K GLA + L +AGYI+GPPLYWH  E LA V    S+SSCP C C+C +   ++IP+EL N +F DC K DPEV++DTEKN+A+LL EELKL+EAE+
Subjt:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA

Query:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        LE  +RADM LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  LA QK+LT+ WE RARQ+GW+EG+ +  ++ + N+Q A
Subjt:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

AT2G32580.1 Protein of unknown function (DUF1068)4.3e-5659.02Show/hide
Query:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA
        A +K GLA +AL + GYI+GPPLYWH  E LAV     S++SC  C CDC S P+++IP  L N +F DC KRDPEV++DTEKN+A+LL EELK +EA +
Subjt:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA

Query:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        +E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT++WE RARQ+G+K+GA +S ++ +   + A
Subjt:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

AT2G32580.2 Protein of unknown function (DUF1068)5.4e-3562.61Show/hide
Query:  DCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEG
        +C KRDPEV++DTEKN+A+LL EELK +EA ++E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT++WE RARQ+G+K+G
Subjt:  DCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEG

Query:  AGRSRIQKQGNIQTA
        A +S ++ +   + A
Subjt:  AGRSRIQKQGNIQTA

AT4G04360.1 Protein of unknown function (DUF1068)1.2e-4557.58Show/hide
Query:  IALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADM
        + LCI  YI GP LYWH  E +A     S  SSCPPC CDC SQP++SIP+ L N +F DC++ + E S+++E +F +++ EELKL+EA+A E++ RAD 
Subjt:  IALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADM

Query:  ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRI
         LL+AKK  SQYQKEADKC+ GMETCE AREKAEA L  Q+RL+ +WELRARQ GWKEG   S +
Subjt:  ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRI

AT4G30996.1 Protein of unknown function (DUF1068)1.9e-3548.45Show/hide
Query:  LATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQ-PVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQR
        L   A+  A  + GP LYW F +G   V  + ++S CPPC CDCP    ++ I   L N +  DC   DPE+ Q+ EK F DLL EELKL+EA A E+ R
Subjt:  LATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQ-PVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQR

Query:  RADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWK
          ++ L EAK++ SQYQKEA+KCN+  E CE ARE+AEA+L  ++++T++WE RARQ GW+
Subjt:  RADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCAGGCCACTTCTTGCACTCCGGCACTGATTAAGTTCGGATTGGCTACCATTGCCCTCTGTATCGCCGGCTACATTATCGGCCCTCCTCTCTATTGGCACTT
CGTCGAAGGTTTGGCCGTCGTTAGCCGCTCCTCCTCTTCTTCCTCCTGCCCTCCTTGTTTCTGCGACTGCCCTTCTCAGCCCGTCATCTCAATTCCCGAAGAATTGAGGA
ACACTACCTTTGGAGATTGTGTTAAGCGTGACCCAGAAGTGAGTCAAGACACCGAGAAGAACTTTGCAGACCTATTGTTAGAGGAGCTGAAGCTAAAGGAAGCCGAAGCG
TTGGAAAATCAGCGACGTGCTGATATGGCTCTACTCGAGGCGAAGAAGATGACATCTCAGTATCAAAAAGAGGCAGACAAGTGCAATTCCGGGATGGAAACATGTGAAGA
AGCGAGAGAGAAAGCCGAGGCAGTATTAGCTGCACAGAAGAGACTAACAGCAGTGTGGGAGCTAAGGGCTCGCCAAAGGGGATGGAAGGAAGGGGCTGGCAGGTCTCGTA
TCCAAAAGCAGGGAAATATTCAGACTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTCAGGCCACTTCTTGCACTCCGGCACTGATTAAGTTCGGATTGGCTACCATTGCCCTCTGTATCGCCGGCTACATTATCGGCCCTCCTCTCTATTGGCACTT
CGTCGAAGGTTTGGCCGTCGTTAGCCGCTCCTCCTCTTCTTCCTCCTGCCCTCCTTGTTTCTGCGACTGCCCTTCTCAGCCCGTCATCTCAATTCCCGAAGAATTGAGGA
ACACTACCTTTGGAGATTGTGTTAAGCGTGACCCAGAAGTGAGTCAAGACACCGAGAAGAACTTTGCAGACCTATTGTTAGAGGAGCTGAAGCTAAAGGAAGCCGAAGCG
TTGGAAAATCAGCGACGTGCTGATATGGCTCTACTCGAGGCGAAGAAGATGACATCTCAGTATCAAAAAGAGGCAGACAAGTGCAATTCCGGGATGGAAACATGTGAAGA
AGCGAGAGAGAAAGCCGAGGCAGTATTAGCTGCACAGAAGAGACTAACAGCAGTGTGGGAGCTAAGGGCTCGCCAAAGGGGATGGAAGGAAGGGGCTGGCAGGTCTCGTA
TCCAAAAGCAGGGAAATATTCAGACTGCATAA
Protein sequenceShow/hide protein sequence
MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA
LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA