; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g34610 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g34610
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionroot hair specific 4
Genome locationchr1:24503918..24504523
RNA-Seq ExpressionMoc01g34610
SyntenyMoc01g34610
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0080183 - response to photooxidative stress (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR040340 - Chloroplast enhancing stress tolerance protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052411.1 ycf3-interacting protein 1 [Cucumis melo var. makuwa]3.6e-3152.09Show/hide
Query:  LNRRKWCG-------------HKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRV
        LNR K CG             +K   W  E G++ + + G      G + +C  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRV
Subjt:  LNRRKWCG-------------HKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRV

Query:  SLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKA
        SLEKFECGSWASSGMVVHE+  ESGSLYFDLP+ELIRNSVS    +QSPV AAFVFD      K KLA  + A              +S  +ITPRLRKA
Subjt:  SLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKA

Query:  REEFNALLEAHTTTL
        R+EFNALLEAHT  L
Subjt:  REEFNALLEAHTTTL

XP_011658372.1 uncharacterized protein LOC105435976 [Cucumis sativus]2.1e-3152.31Show/hide
Query:  LNRRKWCG-------------HKERCW-RSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRR
        LNR + CG             +K   W + +  + EEGK        G + RC  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRR
Subjt:  LNRRKWCG-------------HKERCW-RSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRR

Query:  VSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVF------DKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRK
        VSLEKFECGSWASSGMVVHED  ESGSLYFDLP+ELIRNSVS     QSPV AAFVF      +K KLA  + A              +S  +ITPRLRK
Subjt:  VSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVF------DKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRK

Query:  AREEFNALLEAHTTTL
        AR+EFNALLEAHT  L
Subjt:  AREEFNALLEAHTTTL

XP_022146694.1 uncharacterized protein LOC111015837 [Momordica charantia]1.1e-104100Show/hide
Query:  MQSQCEGHLNRRKWCGHKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECG
        MQSQCEGHLNRRKWCGHKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECG
Subjt:  MQSQCEGHLNRRKWCGHKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECG

Query:  SWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTT
        SWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTT
Subjt:  SWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTT

Query:  L
        L
Subjt:  L

XP_022978627.1 uncharacterized protein LOC111478548 [Cucurbita maxima]2.7e-3150Show/hide
Query:  LNRRKWCG-------------HKERCW-RSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKG--EREEEGDGGDEGSGCISISISSRRV
        LNR K CG             ++   W +    + EEGK          + RCGA  L L +    GF+ GKG  ER+EE + G+EG GCISISIS  RV
Subjt:  LNRRKWCG-------------HKERCW-RSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKG--EREEEGDGGDEGSGCISISISSRRV

Query:  SLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREE
        SLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV      QSP +AAFVFD+  +           +  A  +  +S  VITPRLR+AREE
Subjt:  SLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREE

Query:  FNALLEAH
        FNALLEAH
Subjt:  FNALLEAH

XP_023543164.1 uncharacterized protein LOC111803119 [Cucurbita pepo subsp. pepo]4.7e-3151.71Show/hide
Query:  LNRRKWCGH-----------KERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKG--EREEEGDGGDEGSGCISISISSRRVSLE
        LNR K CG            + R    E G + + +  GKA R     RCGA  L L V    GF+ GKG  ER+EE + G+EG GCISISIS  RVSLE
Subjt:  LNRRKWCGH-----------KERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKG--EREEEGDGGDEGSGCISISISSRRVSLE

Query:  KFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNA
        KFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV      QSP + AFVFD+  +           +  A  +  +S  VITPRLR+AREEFNA
Subjt:  KFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNA

Query:  LLEAH
        LLEAH
Subjt:  LLEAH

TrEMBL top hitse value%identityAlignment
A0A0A0KME4 Uncharacterized protein1.0e-3152.31Show/hide
Query:  LNRRKWCG-------------HKERCW-RSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRR
        LNR + CG             +K   W + +  + EEGK        G + RC  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRR
Subjt:  LNRRKWCG-------------HKERCW-RSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRR

Query:  VSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVF------DKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRK
        VSLEKFECGSWASSGMVVHED  ESGSLYFDLP+ELIRNSVS     QSPV AAFVF      +K KLA  + A              +S  +ITPRLRK
Subjt:  VSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVF------DKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRK

Query:  AREEFNALLEAHTTTL
        AR+EFNALLEAHT  L
Subjt:  AREEFNALLEAHTTTL

A0A1S3AZD3 uncharacterized protein LOC1034842326.6e-3151.39Show/hide
Query:  LNRRKWCG-------------HKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRV
        LNR K CG             +K   W  E G++ + + G      G + +C  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRV
Subjt:  LNRRKWCG-------------HKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRV

Query:  SLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFD-------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRK
        SL+KFECGSWASSGMVVHE+  ESGSLYFDLP+ELIRNSVS    +QSPV AAFVFD       K KLA  + A              +S  +ITPRLRK
Subjt:  SLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFD-------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRK

Query:  AREEFNALLEAHTTTL
        AR+EFNALLEAHT  L
Subjt:  AREEFNALLEAHTTTL

A0A5A7UFW0 Ycf3-interacting protein 11.7e-3152.09Show/hide
Query:  LNRRKWCG-------------HKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRV
        LNR K CG             +K   W  E G++ + + G      G + +C  GAL LL+ VL GF+ GKG    +EE+ +  +EG  CISISI SRRV
Subjt:  LNRRKWCG-------------HKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGE---REEEGDGGDEGSGCISISISSRRV

Query:  SLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKA
        SLEKFECGSWASSGMVVHE+  ESGSLYFDLP+ELIRNSVS    +QSPV AAFVFD      K KLA  + A              +S  +ITPRLRKA
Subjt:  SLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFD------KAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKA

Query:  REEFNALLEAHTTTL
        R+EFNALLEAHT  L
Subjt:  REEFNALLEAHTTTL

A0A6J1CYT9 uncharacterized protein LOC1110158375.2e-105100Show/hide
Query:  MQSQCEGHLNRRKWCGHKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECG
        MQSQCEGHLNRRKWCGHKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECG
Subjt:  MQSQCEGHLNRRKWCGHKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECG

Query:  SWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTT
        SWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTT
Subjt:  SWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTT

Query:  L
        L
Subjt:  L

A0A6J1ILL2 uncharacterized protein LOC1114785481.3e-3150Show/hide
Query:  LNRRKWCG-------------HKERCW-RSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKG--EREEEGDGGDEGSGCISISISSRRV
        LNR K CG             ++   W +    + EEGK          + RCGA  L L +    GF+ GKG  ER+EE + G+EG GCISISIS  RV
Subjt:  LNRRKWCG-------------HKERCW-RSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKG--EREEEGDGGDEGSGCISISISSRRV

Query:  SLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREE
        SLEKFECGSWASSGMV HED E     GSLYFDLP+ELIRNSV      QSP +AAFVFD+  +           +  A  +  +S  VITPRLR+AREE
Subjt:  SLEKFECGSWASSGMVVHEDNEE---SGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREE

Query:  FNALLEAH
        FNALLEAH
Subjt:  FNALLEAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30850.1 root hair specific 41.7e-1032.43Show/hide
Query:  EKLRCGAGALWLLVAVLPGFRKGK-----GEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIR-NSVSVSV
        E  +C A  L      LPGF K K      +R+   +     +   + S  S R SLEKFECGSWAS+  ++    +++G L+FD P+E+ + NS   + 
Subjt:  EKLRCGAGALWLLVAVLPGFRKGK-----GEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIR-NSVSVSV

Query:  G--AQSPVKAAFVFDKAKLAAAAAA-----------------------SSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEA
        G   Q PV + F+FD+     A  +                        S S  +++   PTS    ITPRLRKAR++FN  L A
Subjt:  G--AQSPVKAAFVFDKAKLAAAAAA-----------------------SSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEA

AT2G34910.1 BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1)5.3e-0933.33Show/hide
Query:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDE---GSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQ
        E  +C A  L      LPGF K      +  D   +    +   S S  S   SLEKFECGSWAS+  +      E+G LY DLP+E+I+         Q
Subjt:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDE---GSGCISISISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQ

Query:  SPVKAAFVFDK-----AKLAAAAAASSVSVR------------------ASAPPTPTSSSIVITPRLRKAREEFNALLEA
         PV + F FDK     A  +    +SS+S R                   ++   P S    ITPRL KAR++FN  L A
Subjt:  SPVKAAFVFDK-----AKLAAAAAASSVSVR------------------ASAPPTPTSSSIVITPRLRKAREEFNALLEA

AT4G20190.1 unknown protein3.6e-1336.78Show/hide
Query:  LVAVLPGFRKGKGER-EEEGDGGDEGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQS
        L   LPGF KGK  R   +GD     +  ++ S             + S R SLE+FECGSW SS M +++DN + G  +FDLP ELI+     +     
Subjt:  LVAVLPGFRKGKGER-EEEGDGGDEGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSLYFDLPIELIRNSVSVSVGAQS

Query:  PVKAAFVFDK-----------AKLAAAAAASS------VSVRASAPPT-PTSSSIVITPRLRKAREEFNALLEA
        PV AAFVFDK            K + + +  S      V    S+P + PTS +  ITPRL +A E+F++ LEA
Subjt:  PVKAAFVFDK-----------AKLAAAAAASS------VSVRASAPPT-PTSSSIVITPRLRKAREEFNALLEA

AT5G44660.1 unknown protein1.1e-0932.47Show/hide
Query:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGD----------EGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSL
        ++ +C A     L   LPGF KGK  R  + D               S  I++S             + S R S+EKF+CGS+ S         EE G+ 
Subjt:  EKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGD----------EGSGCISIS-------------ISSRRVSLEKFECGSWASSGMVVHEDNEESGSL

Query:  YFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAA-------------AAAASSVSVR----ASAPPTPTSSSIVITPRLRKAREEFNALLEA
        +FDLP ELI+ S S       PV AAFVFDK  +                 A  S S+R    +++ P    +S  I+PRL +A + FNA LEA
Subjt:  YFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAA-------------AAAASSVSVR----ASAPPTPTSSSIVITPRLRKAREEFNALLEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTCGCAGTGTGAAGGGCATTTGAATAGGAGGAAATGGTGTGGGCACAAGGAGAGGTGTTGGAGAAGCGAAAATGGGGAGAGAGAGGAAGGTAAAAGGGGAGGGAA
AGCGGGGCGTTGTGGTGAGAAATTGAGATGTGGGGCAGGGGCACTATGGTTGTTGGTAGCAGTACTGCCGGGGTTTAGGAAGGGGAAGGGTGAGAGAGAGGAAGAGGGAG
ATGGAGGTGATGAGGGAAGCGGGTGCATATCCATATCCATATCATCGAGGAGAGTTTCTCTGGAAAAATTCGAATGCGGTTCATGGGCTTCGTCGGGCATGGTGGTTCAT
GAGGACAATGAGGAAAGTGGGAGCCTCTATTTTGATCTGCCAATAGAGTTGATAAGGAACAGCGTCAGCGTCAGCGTGGGCGCACAATCACCAGTAAAAGCCGCATTTGT
ATTCGACAAAGCAAAATTAGCTGCCGCCGCCGCCGCCTCGTCAGTATCAGTACGTGCATCTGCCCCGCCCACACCAACTTCATCTTCAATCGTCATTACCCCACGCTTGC
GCAAAGCTAGGGAAGAGTTCAATGCACTTCTGGAAGCGCATACTACTACTCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTCGCAGTGTGAAGGGCATTTGAATAGGAGGAAATGGTGTGGGCACAAGGAGAGGTGTTGGAGAAGCGAAAATGGGGAGAGAGAGGAAGGTAAAAGGGGAGGGAA
AGCGGGGCGTTGTGGTGAGAAATTGAGATGTGGGGCAGGGGCACTATGGTTGTTGGTAGCAGTACTGCCGGGGTTTAGGAAGGGGAAGGGTGAGAGAGAGGAAGAGGGAG
ATGGAGGTGATGAGGGAAGCGGGTGCATATCCATATCCATATCATCGAGGAGAGTTTCTCTGGAAAAATTCGAATGCGGTTCATGGGCTTCGTCGGGCATGGTGGTTCAT
GAGGACAATGAGGAAAGTGGGAGCCTCTATTTTGATCTGCCAATAGAGTTGATAAGGAACAGCGTCAGCGTCAGCGTGGGCGCACAATCACCAGTAAAAGCCGCATTTGT
ATTCGACAAAGCAAAATTAGCTGCCGCCGCCGCCGCCTCGTCAGTATCAGTACGTGCATCTGCCCCGCCCACACCAACTTCATCTTCAATCGTCATTACCCCACGCTTGC
GCAAAGCTAGGGAAGAGTTCAATGCACTTCTGGAAGCGCATACTACTACTCTCTGA
Protein sequenceShow/hide protein sequence
MQSQCEGHLNRRKWCGHKERCWRSENGEREEGKRGGKAGRCGEKLRCGAGALWLLVAVLPGFRKGKGEREEEGDGGDEGSGCISISISSRRVSLEKFECGSWASSGMVVH
EDNEESGSLYFDLPIELIRNSVSVSVGAQSPVKAAFVFDKAKLAAAAAASSVSVRASAPPTPTSSSIVITPRLRKAREEFNALLEAHTTTL