; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1675 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1675
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionroot hair specific 4
Genome locationMC01:20903665..20904144
RNA-Seq ExpressionMC01g1675
SyntenyMC01g1675
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011658372.1 uncharacterized protein LOC105435976 [Cucumis sativus]2.73e-4160.36Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA
        G + RC  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRVSLEKFECGSWASSGMVVHED E SGSLYFDLP+ELIRNSVS     
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA

Query:  QSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        QSPV AAFVF+      K KLA  + A+S                +ITPRLRKAR+EFNALLEAHT  L
Subjt:  QSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

XP_022146694.1 uncharacterized protein LOC111015837 [Momordica charantia]1.96e-102100Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP
        GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP

Query:  VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
Subjt:  VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

XP_022925963.1 uncharacterized protein LOC111433224 [Cucurbita moschata]1.39e-4056.25Show/hide
Query:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV
        G+  R   GAL LL+ VL G  F+ GKG  ER+EE + G+EG GCISISIS  RVSLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV  
Subjt:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV

Query:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH
            QSP + AFVF+             KAKLA  + A+S                VITPRLR+AREEFNALLEAH
Subjt:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH

XP_022978627.1 uncharacterized protein LOC111478548 [Cucurbita maxima]1.75e-4156.82Show/hide
Query:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV
        G+  R   GAL LL+ +L G  F+ GKG  ER+EE + G+EG GCISISIS  RVSLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV  
Subjt:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV

Query:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH
            QSP +AAFVFD             KAKLA  + A+S                VITPRLR+AREEFNALLEAH
Subjt:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH

XP_023543164.1 uncharacterized protein LOC111803119 [Cucurbita pepo subsp. pepo]2.47e-4156.82Show/hide
Query:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV
        G+  R   GAL LL+ VL G  F+ GKG  ER+EE + G+EG GCISISIS  RVSLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV  
Subjt:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV

Query:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH
            QSP + AFVFD             KAKLA  + A+S                VITPRLR+AREEFNALLEAH
Subjt:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein1.32e-4160.36Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA
        G + RC  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRVSLEKFECGSWASSGMVVHED E SGSLYFDLP+ELIRNSVS     
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA

Query:  QSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        QSPV AAFVF+      K KLA  + A+S                +ITPRLRKAR+EFNALLEAHT  L
Subjt:  QSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

A0A1S3AZD3 uncharacterized protein LOC1034842322.24e-4058.82Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA
        G + +CGA  L LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRVSL+KFECGSWASSGMVVHE+ E SGSLYFDLP+ELIRNSVS    +
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA

Query:  QSPVKAAFVFD-------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        QSPV AAFVFD       K KLA  + A+S                +ITPRLRKAR+EFNALLEAHT  L
Subjt:  QSPVKAAFVFD-------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

A0A6J1CYT9 uncharacterized protein LOC1110158379.48e-103100Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP
        GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP

Query:  VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
Subjt:  VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

A0A6J1EGQ7 uncharacterized protein LOC1114332246.74e-4156.25Show/hide
Query:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV
        G+  R   GAL LL+ VL G  F+ GKG  ER+EE + G+EG GCISISIS  RVSLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV  
Subjt:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV

Query:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH
            QSP + AFVF+             KAKLA  + A+S                VITPRLR+AREEFNALLEAH
Subjt:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH

A0A6J1ILL2 uncharacterized protein LOC1114785488.46e-4256.82Show/hide
Query:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV
        G+  R   GAL LL+ +L G  F+ GKG  ER+EE + G+EG GCISISIS  RVSLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV  
Subjt:  GEKLRCGAGALWLLVAVLPG--FRKGKG--EREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEES---GSLYFDLPIELIRNSVSV

Query:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH
            QSP +AAFVFD             KAKLA  + A+S                VITPRLR+AREEFNALLEAH
Subjt:  SVGAQSPVKAAFVFD-------------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 45.9e-1132.43Show/hide
Query:  EKLRCGAGALWLLVAVLPGFRKGK-----GEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIR-NSVSVSV
        E  +C A  L      LPGF K K      +R+   +     +   + S  S R SLEKFECGSWAS+  ++    +++G L+FD P+E+ + NS   + 
Subjt:  EKLRCGAGALWLLVAVLPGFRKGK-----GEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIR-NSVSVSV

Query:  G--AQSPVKAAFVFDKAKLAAAAAA-----------------------SSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEA
        G   Q PV + F+FD+     A  +                        S S  +++   PTS    ITPRLRKAR++FN  L A
Subjt:  G--AQSPVKAAFVFDKAKLAAAAAA-----------------------SSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)1.9e-0933.33Show/hide
Query:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDE---GSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQ
        E  +C A  L      LPGF K      +  D   +    +   S S  S   SLEKFECGSWAS+  +      E+G LY DLP+E+I+         Q
Subjt:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDE---GSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQ

Query:  SPVKAAFVFDK-----AKLAAAAAASSVSVR------------------ASAPPTPTSSSIVITPRLRKAREEFNALLEA
         PV + F FDK     A  +    +SS+S R                   ++   P S    ITPRL KAR++FN  L A
Subjt:  SPVKAAFVFDK-----AKLAAAAAASSVSVR------------------ASAPPTPTSSSIVITPRLRKAREEFNALLEA

AT4G20190.1 unknown protein2.8e-1336.78Show/hide
Query:  LVAVLPGFRKGKGER-EEEGDGGDEGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQS
        L   LPGF KGK  R   +GD     +  ++ S             + S R SLE+FECGSW SS M +++DN + G  +FDLP ELI+     +     
Subjt:  LVAVLPGFRKGKGER-EEEGDGGDEGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQS

Query:  PVKAAFVFDK-----------AKLAAAAAASS------VSVRASAPPT-PTSSSIVITPRLRKAREEFNALLEA
        PV AAFVFDK            K + + +  S      V    S+P + PTS +  ITPRL +A E+F++ LEA
Subjt:  PVKAAFVFDK-----------AKLAAAAAASS------VSVRASAPPT-PTSSSIVITPRLRKAREEFNALLEA

AT5G44660.1 unknown protein6.5e-1032.47Show/hide
Query:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGD----------EGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSL
        ++ +C A     L   LPGF KGK  R  + D               S  I++S             + S R S+EKF+CGS+ S         EE G+ 
Subjt:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGD----------EGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSL

Query:  YFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAA-------------AAAASSVSVR----ASAPPTPTSSSIVITPRLRKAREEFNALLEA
        +FDLP ELI+ S S       PV AAFVFDK  +                 A  S S+R    +++ P    +S  I+PRL +A + FNA LEA
Subjt:  YFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAA-------------AAAASSVSVR----ASAPPTPTSSSIVITPRLRKAREEFNALLEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTGAGAAATTGAGATGTGGGGCAGGGGCACTATGGTTGTTGGTAGCAGTACTGCCGGGGTTTAGGAAGGGGAAGGGTGAGAGAGAGGAAGAGGGAGATGGAGGTGATGA
GGGAAGCGGGTGCATATCCATATCCATATCATCGAGGAGAGTTTCTCTGGAAAAATTCGAATGCGGTTCATGGGCTTCGTCGGGCATGGTGGTTCATGAGGACAATGAGG
AAAGTGGGAGCCTCTATTTTGATCTGCCAATAGAGTTGATAAGGAACAGCGTCAGCGTCAGCGTGGGCGCACAATCACCAGTAAAAGCCGCATTTGTATTCGACAAAGCA
AAATTAGCTGCCGCCGCCGCCGCCTCGTCAGTATCAGTACGTGCATCTGCCCCGCCCACACCAACTTCATCTTCAATCGTCATTACCCCACGCTTGCGCAAAGCTAGGGA
AGAGTTCAATGCACTTCTGGAAGCGCATACTACTACTCTC
mRNA sequenceShow/hide mRNA sequence
GGTGAGAAATTGAGATGTGGGGCAGGGGCACTATGGTTGTTGGTAGCAGTACTGCCGGGGTTTAGGAAGGGGAAGGGTGAGAGAGAGGAAGAGGGAGATGGAGGTGATGA
GGGAAGCGGGTGCATATCCATATCCATATCATCGAGGAGAGTTTCTCTGGAAAAATTCGAATGCGGTTCATGGGCTTCGTCGGGCATGGTGGTTCATGAGGACAATGAGG
AAAGTGGGAGCCTCTATTTTGATCTGCCAATAGAGTTGATAAGGAACAGCGTCAGCGTCAGCGTGGGCGCACAATCACCAGTAAAAGCCGCATTTGTATTCGACAAAGCA
AAATTAGCTGCCGCCGCCGCCGCCTCGTCAGTATCAGTACGTGCATCTGCCCCGCCCACACCAACTTCATCTTCAATCGTCATTACCCCACGCTTGCGCAAAGCTAGGGA
AGAGTTCAATGCACTTCTGGAAGCGCATACTACTACTCTC
Protein sequenceShow/hide protein sequence
GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKA
KLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL