; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022520 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022520
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionroot hair specific 4
Genome locationscaffold720:257614..258093
RNA-Seq ExpressionMS022520
SyntenyMS022520
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052411.1 ycf3-interacting protein 1 [Cucumis melo var. makuwa]7.5e-3259.76Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA
        G + +C  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRVSLEKFECGSWASSGMVVHE+  ESGSLYFDLP+ELIRNSVS    +
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA

Query:  QSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        QSPV AAFVFD      K KLA  + A              +S  +ITPRLRKAR+EFNALLEAHT  L
Subjt:  QSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

XP_011658372.1 uncharacterized protein LOC105435976 [Cucumis sativus]4.4e-3260.36Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA
        G + RC  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRVSLEKFECGSWASSGMVVHED  ESGSLYFDLP+ELIRNSVS     
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA

Query:  QSPVKAAFVF------DKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        QSPV AAFVF      +K KLA  + A              +S  +ITPRLRKAR+EFNALLEAHT  L
Subjt:  QSPVKAAFVF------DKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

XP_022146694.1 uncharacterized protein LOC111015837 [Momordica charantia]1.1e-78100Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP
        GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP

Query:  VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
Subjt:  VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

XP_022978627.1 uncharacterized protein LOC111478548 [Cucurbita maxima]4.4e-3257.67Show/hide
Query:  GEKLRCGAGALWLLVAVLPGF----RKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSV
        G+  R   GAL LL+ +L G      KGK ER+EE + G+EG GCISISIS  RVSLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV  
Subjt:  GEKLRCGAGALWLLVAVLPGF----RKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSV

Query:  SVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH
            QSP +AAFVFD+  +           +  A  +  +S  VITPRLR+AREEFNALLEAH
Subjt:  SVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH

XP_023543164.1 uncharacterized protein LOC111803119 [Cucurbita pepo subsp. pepo]5.7e-3257.67Show/hide
Query:  GEKLRCGAGALWLLVAVLPGF----RKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSV
        G+  R   GAL LL+ VL G      KGK ER+EE + G+EG GCISISIS  RVSLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV  
Subjt:  GEKLRCGAGALWLLVAVLPGF----RKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSV

Query:  SVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH
            QSP + AFVFD+  +           +  A  +  +S  VITPRLR+AREEFNALLEAH
Subjt:  SVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein2.1e-3260.36Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA
        G + RC  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRVSLEKFECGSWASSGMVVHED  ESGSLYFDLP+ELIRNSVS     
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA

Query:  QSPVKAAFVF------DKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        QSPV AAFVF      +K KLA  + A              +S  +ITPRLRKAR+EFNALLEAHT  L
Subjt:  QSPVKAAFVF------DKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

A0A5A7UFW0 Ycf3-interacting protein 13.6e-3259.76Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA
        G + +C  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRVSLEKFECGSWASSGMVVHE+  ESGSLYFDLP+ELIRNSVS    +
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGA

Query:  QSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        QSPV AAFVFD      K KLA  + A              +S  +ITPRLRKAR+EFNALLEAHT  L
Subjt:  QSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

A0A6J1CYT9 uncharacterized protein LOC1110158375.1e-79100Show/hide
Query:  GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP
        GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP
Subjt:  GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSP

Query:  VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
        VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL
Subjt:  VKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL

A0A6J1EGQ7 uncharacterized protein LOC1114332241.1e-3157.06Show/hide
Query:  GEKLRCGAGALWLLVAVLPGF----RKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSV
        G+  R   GAL LL+ VL G      KGK ER+EE + G+EG GCISISIS  RVSLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV  
Subjt:  GEKLRCGAGALWLLVAVLPGF----RKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSV

Query:  SVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH
            QSP + AFVF++  +           +  A  +  +S  VITPRLR+AREEFNALLEAH
Subjt:  SVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH

A0A6J1ILL2 uncharacterized protein LOC1114785482.1e-3257.67Show/hide
Query:  GEKLRCGAGALWLLVAVLPGF----RKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSV
        G+  R   GAL LL+ +L G      KGK ER+EE + G+EG GCISISIS  RVSLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV  
Subjt:  GEKLRCGAGALWLLVAVLPGF----RKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSV

Query:  SVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH
            QSP +AAFVFD+  +           +  A  +  +S  VITPRLR+AREEFNALLEAH
Subjt:  SVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 45.9e-1132.43Show/hide
Query:  EKLRCGAGALWLLVAVLPGFRKGK-----GEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIR-NSVSVSV
        E  +C A  L      LPGF K K      +R+   +     +   + S  S R SLEKFECGSWAS+  ++    +++G L+FD P+E+ + NS   + 
Subjt:  EKLRCGAGALWLLVAVLPGFRKGK-----GEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIR-NSVSVSV

Query:  G--AQSPVKAAFVFDKAKLAAAAAA-----------------------SSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEA
        G   Q PV + F+FD+     A  +                        S S  +++   PTS    ITPRLRKAR++FN  L A
Subjt:  G--AQSPVKAAFVFDKAKLAAAAAA-----------------------SSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)1.9e-0933.33Show/hide
Query:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDE---GSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQ
        E  +C A  L      LPGF K      +  D   +    +   S S  S   SLEKFECGSWAS+  +      E+G LY DLP+E+I+         Q
Subjt:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDE---GSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQ

Query:  SPVKAAFVFDK-----AKLAAAAAASSVSVR------------------ASAPPTPTSSSIVITPRLRKAREEFNALLEA
         PV + F FDK     A  +    +SS+S R                   ++   P S    ITPRL KAR++FN  L A
Subjt:  SPVKAAFVFDK-----AKLAAAAAASSVSVR------------------ASAPPTPTSSSIVITPRLRKAREEFNALLEA

AT4G20190.1 unknown protein2.8e-1336.78Show/hide
Query:  LVAVLPGFRKGKGER-EEEGDGGDEGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQS
        L   LPGF KGK  R   +GD     +  ++ S             + S R SLE+FECGSW SS M +++DN + G  +FDLP ELI+     +     
Subjt:  LVAVLPGFRKGKGER-EEEGDGGDEGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQS

Query:  PVKAAFVFDK-----------AKLAAAAAASS------VSVRASAPPT-PTSSSIVITPRLRKAREEFNALLEA
        PV AAFVFDK            K + + +  S      V    S+P + PTS +  ITPRL +A E+F++ LEA
Subjt:  PVKAAFVFDK-----------AKLAAAAAASS------VSVRASAPPT-PTSSSIVITPRLRKAREEFNALLEA

AT5G44660.1 unknown protein6.5e-1032.47Show/hide
Query:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGD----------EGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSL
        ++ +C A     L   LPGF KGK  R  + D               S  I++S             + S R S+EKF+CGS+ S         EE G+ 
Subjt:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGD----------EGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSL

Query:  YFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAA-------------AAAASSVSVR----ASAPPTPTSSSIVITPRLRKAREEFNALLEA
        +FDLP ELI+ S S       PV AAFVFDK  +                 A  S S+R    +++ P    +S  I+PRL +A + FNA LEA
Subjt:  YFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAA-------------AAAASSVSVR----ASAPPTPTSSSIVITPRLRKAREEFNALLEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTGAGAAATTGAGATGTGGGGCAGGGGCACTATGGTTGTTGGTAGCAGTACTGCCGGGGTTTAGGAAGGGGAAGGGTGAGAGAGAGGAAGAGGGAGATGGAGGTGATGA
GGGAAGCGGGTGCATATCCATATCCATATCATCGAGGAGAGTTTCTCTGGAAAAATTCGAATGCGGTTCATGGGCTTCGTCGGGCATGGTGGTTCATGAGGACAATGAGG
AAAGTGGGAGCCTCTATTTTGATCTGCCAATAGAGTTGATAAGGAACAGCGTCAGCGTCAGCGTGGGCGCACAATCACCAGTAAAAGCCGCTTTTGTATTCGACAAAGCA
AAATTAGCTGCCGCCGCCGCCGCCTCGTCAGTATCAGTACGTGCATCTGCCCCGCCCACACCAACTTCATCTTCAATCGTCATTACCCCACGCTTGCGCAAAGCTAGGGA
AGAGTTCAATGCACTTCTGGAAGCGCATACTACTACTCTC
mRNA sequenceShow/hide mRNA sequence
GGTGAGAAATTGAGATGTGGGGCAGGGGCACTATGGTTGTTGGTAGCAGTACTGCCGGGGTTTAGGAAGGGGAAGGGTGAGAGAGAGGAAGAGGGAGATGGAGGTGATGA
GGGAAGCGGGTGCATATCCATATCCATATCATCGAGGAGAGTTTCTCTGGAAAAATTCGAATGCGGTTCATGGGCTTCGTCGGGCATGGTGGTTCATGAGGACAATGAGG
AAAGTGGGAGCCTCTATTTTGATCTGCCAATAGAGTTGATAAGGAACAGCGTCAGCGTCAGCGTGGGCGCACAATCACCAGTAAAAGCCGCTTTTGTATTCGACAAAGCA
AAATTAGCTGCCGCCGCCGCCGCCTCGTCAGTATCAGTACGTGCATCTGCCCCGCCCACACCAACTTCATCTTCAATCGTCATTACCCCACGCTTGCGCAAAGCTAGGGA
AGAGTTCAATGCACTTCTGGAAGCGCATACTACTACTCTC
Protein sequenceShow/hide protein sequence
GEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKA
KLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL