; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002679 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002679
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionNuclear transcription factor Y subunit B-3
Genome locationscaffold928:247852..248583
RNA-Seq ExpressionMS002679
SyntenyMS002679
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607751.1 hypothetical protein SDJN03_01093, partial [Cucurbita argyrosperma subsp. sororia]5.4e-5059.83Show/hide
Query:  SRLNLALASTFLASIFFLTLRFSPSFLTL--LLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN
        SRLN+ + ST LASI FL+LRFSPSFL+L  LLLIPS  FL  K    S+S++ S  NP LI                  S  G L    FD      + 
Subjt:  SRLNLALASTFLASIFFLTLRFSPSFLTL--LLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN

Query:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV
           +   Q  EELSSD D+SSSNGG    GDF+SDLWMCLDELARNLP ++DSSSEDDDSLIEI LLP S+AIR DS   L  VRNLETCLPDLLPDSV+
Subjt:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV

Query:  RQRGFVELLEEIDEEENLIEIDISRGLNR
        +Q GFVELLEEI+EE+NLIEIDISRG NR
Subjt:  RQRGFVELLEEIDEEENLIEIDISRGLNR

XP_022941469.1 uncharacterized protein LOC111446758 [Cucurbita moschata]2.7e-4959.39Show/hide
Query:  SRLNLALASTFLASIFFLTLRFSPSFLTL--LLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN
        SRLN+ + ST LASI FL+LRFSPSFL+L  LLLIPS  FL  K    S+S++ S  NP LI                  S  G L    FD      + 
Subjt:  SRLNLALASTFLASIFFLTLRFSPSFLTL--LLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN

Query:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV
           +   Q  EELSSD D+SSSNGG    GDF+SDLWMCLDEL RNLP ++DSSSEDDDSLIEI LLP S+AIR DS   L  VRNLETCLPDLLPDSV+
Subjt:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV

Query:  RQRGFVELLEEIDEEENLIEIDISRGLNR
        +Q GFVELLEEI+EE+NLIEIDISRG NR
Subjt:  RQRGFVELLEEIDEEENLIEIDISRGLNR

XP_022981598.1 uncharacterized protein LOC111480668 [Cucurbita maxima]2.4e-5060.09Show/hide
Query:  SRLNLALASTFLASIFFLTLRFSPSFLT-LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALNP
        SRLN+ + ST LASI FL+LRFSPSFL+ LLLLIPS  FL  K    S+S++ S  NP LI                  S  G L    FD      +  
Subjt:  SRLNLALASTFLASIFFLTLRFSPSFLT-LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALNP

Query:  PLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVVR
          +   Q  EELSSD D+SSSNGG    GDF+SDLWMCLDELARNLP ++DSSSEDDDSLIEI LLP S+AIR DS   L  VRNLETCLPDLLPDSV++
Subjt:  PLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVVR

Query:  QRGFVELLEEIDEEENLIEIDISRGLNR
        Q GFVELLEEI+EE+NLIEIDISRG NR
Subjt:  QRGFVELLEEIDEEENLIEIDISRGLNR

XP_023524758.1 uncharacterized protein LOC111788599 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-4959.83Show/hide
Query:  SRLNLALASTFLASIFFLTLRFSPSFLT--LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN
        SRLN+ + ST LASI FL+LRFSPSFLT  LLLLIPS  FL  K    S+S++ S  NP LI                  S  G L    FD      + 
Subjt:  SRLNLALASTFLASIFFLTLRFSPSFLT--LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN

Query:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV
           +   Q  EELSSD D+SSSNGG    GDF+SDLWMCLDELA NLP ++DSSSEDDDSLIEI LLP S+AIR DS   L  VRNLETCLPDLLPDSV+
Subjt:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV

Query:  RQRGFVELLEEIDEEENLIEIDISRGLNR
        +Q GFVELLEEI+EE+NLIEIDISRG NR
Subjt:  RQRGFVELLEEIDEEENLIEIDISRGLNR

XP_023524760.1 uncharacterized protein LOC111788599 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-4959.83Show/hide
Query:  SRLNLALASTFLASIFFLTLRFSPSFLT--LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN
        SRLN+ + ST LASI FL+LRFSPSFLT  LLLLIPS  FL  K    S+S++ S  NP LI                  S  G L    FD      + 
Subjt:  SRLNLALASTFLASIFFLTLRFSPSFLT--LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN

Query:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV
           +   Q  EELSSD D+SSSNGG    GDF+SDLWMCLDELA NLP ++DSSSEDDDSLIEI LLP S+AIR DS   L  VRNLETCLPDLLPDSV+
Subjt:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV

Query:  RQRGFVELLEEIDEEENLIEIDISRGLNR
        +Q GFVELLEEI+EE+NLIEIDISRG NR
Subjt:  RQRGFVELLEEIDEEENLIEIDISRGLNR

TrEMBL top hitse value%identityAlignment
A0A0A0K6W9 Uncharacterized protein6.5e-4150.4Show/hide
Query:  MENFTHRPSS----------SRLNLALASTFLASIFFLTLRFSPSFLT--LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSS
        ME FT   SS          SR NL + ST LASIFFLTLRFSPSF T  LLLLIP+ LF+  K  +            PL   L      ++   ES++
Subjt:  MENFTHRPSS----------SRLNLALASTFLASIFFLTLRFSPSFLT--LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSS

Query:  SNGGRLGSGDFDSNRSSALNPPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSS---EDDDSLIEIALLPNSNAIRADS
          G  + S D  S                          SSSN GG GSGDF SDLWM LDEL RNLP +ED SS   +DDDSLIEI LLPNSN      
Subjt:  SNGGRLGSGDFDSNRSSALNPPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSS---EDDDSLIEIALLPNSNAIRADS

Query:  MNGLGGVRNLETCLPDLLPDSVVRQRGFVELLEEIDEEENLIEIDISRGLNR
             G+RNLETCLP+LLPDSV+RQ GFVELLEEI+EE+NLIEIDIS+G NR
Subjt:  MNGLGGVRNLETCLPDLLPDSVVRQRGFVELLEEIDEEENLIEIDISRGLNR

A0A2I4E0W2 uncharacterized protein LOC1089852591.6e-0731.08Show/hide
Query:  LASIFFLTLRFSPSFLTLLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALNPPLIGVLQPSEEL
        L   +FL+  F P  +T+ +L+ STL +    +     + +S  +    G+    + L           G +L      S ++ A+    +G++    ++
Subjt:  LASIFFLTLRFSPSFLTLLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALNPPLIGVLQPSEEL

Query:  SSDGD-----ESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDS-SSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVVRQRGFVE
         S  D     +S S+      G FD DL +  + + ++   ++ S S EDDDSLIEI+ LP S   ++  + G      L++ LP LLPDSV  Q+G +E
Subjt:  SSDGD-----ESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDS-SSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVVRQRGFVE

Query:  LLEEI---DEEENLIEIDISRG
        LL EI   +EEENLIEID+S G
Subjt:  LLEEI---DEEENLIEIDISRG

A0A5D3C7W2 Uncharacterized protein6.5e-4150.98Show/hide
Query:  MENFTHRPSS----------SRLNLALASTFLASIFFL-TLRFSPSFLT--LLLLIPSTLFL-TKKSKS-DSNSNRSSGHNPPLIGVLRPSEELSSDGDE
        ME FT   SS          SR NL + ST LASIFFL TLRFSPSF T  LLLL+P+TLFL  +KS S D +      H  P+       E  +  G+ 
Subjt:  MENFTHRPSS----------SRLNLALASTFLASIFFL-TLRFSPSFLT--LLLLIPSTLFL-TKKSKS-DSNSNRSSGHNPPLIGVLRPSEELSSDGDE

Query:  SSSSNGGRLGSGDFDSNRSSALNPPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSS---EDDDSLIEIALLPNSNAIR
        SSS                              +E S     +SSN GG GSGDF SDLWM LDEL RNLP +ED SS   +DDDSLIEI LLPNSN   
Subjt:  SSSSNGGRLGSGDFDSNRSSALNPPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSS---EDDDSLIEIALLPNSNAIR

Query:  ADSMNGLGGVRNLETCLPDLLPDSVVRQRGFVELLEEIDEEENLIEIDISRGLNR
                G+RNLETCLP+LLPDSV+RQ GFVELLEEI+EE+NLIEIDIS+G NR
Subjt:  ADSMNGLGGVRNLETCLPDLLPDSVVRQRGFVELLEEIDEEENLIEIDISRGLNR

A0A6J1FNF5 uncharacterized protein LOC1114467581.3e-4959.39Show/hide
Query:  SRLNLALASTFLASIFFLTLRFSPSFLTL--LLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN
        SRLN+ + ST LASI FL+LRFSPSFL+L  LLLIPS  FL  K    S+S++ S  NP LI                  S  G L    FD      + 
Subjt:  SRLNLALASTFLASIFFLTLRFSPSFLTL--LLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALN

Query:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV
           +   Q  EELSSD D+SSSNGG    GDF+SDLWMCLDEL RNLP ++DSSSEDDDSLIEI LLP S+AIR DS   L  VRNLETCLPDLLPDSV+
Subjt:  PPLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVV

Query:  RQRGFVELLEEIDEEENLIEIDISRGLNR
        +Q GFVELLEEI+EE+NLIEIDISRG NR
Subjt:  RQRGFVELLEEIDEEENLIEIDISRGLNR

A0A6J1J2I9 uncharacterized protein LOC1114806681.2e-5060.09Show/hide
Query:  SRLNLALASTFLASIFFLTLRFSPSFLT-LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALNP
        SRLN+ + ST LASI FL+LRFSPSFL+ LLLLIPS  FL  K    S+S++ S  NP LI                  S  G L    FD      +  
Subjt:  SRLNLALASTFLASIFFLTLRFSPSFLT-LLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALNP

Query:  PLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVVR
          +   Q  EELSSD D+SSSNGG    GDF+SDLWMCLDELARNLP ++DSSSEDDDSLIEI LLP S+AIR DS   L  VRNLETCLPDLLPDSV++
Subjt:  PLIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVVR

Query:  QRGFVELLEEIDEEENLIEIDISRGLNR
        Q GFVELLEEI+EE+NLIEIDISRG NR
Subjt:  QRGFVELLEEIDEEENLIEIDISRGLNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATTTCACTCACCGCCCATCATCGTCTCGCCTCAATCTCGCCCTCGCTTCAACTTTTCTCGCCTCCATTTTCTTCCTCACTCTCCGATTCTCCCCTTCATTCCT
CACGCTTCTTCTTCTCATCCCTTCCACTCTCTTCCTCACCAAGAAATCCAAATCCGATTCCAATTCCAATCGATCTTCCGGCCACAATCCGCCTCTAATCGGAGTTCTCC
GGCCGAGCGAAGAGCTTTCTTCCGACGGCGATGAGAGTAGTAGCAGTAATGGCGGCCGATTAGGAAGTGGAGACTTCGATTCCAATCGATCTTCCGCCCTGAATCCGCCT
CTAATCGGAGTTCTCCAGCCGAGCGAAGAGCTTTCTTCCGACGGCGACGAGAGTAGCAGCAATGGCGGCGGATTCGGAAGTGGAGACTTCGATTCCGATCTCTGGATGTG
TCTGGACGAACTGGCGCGGAATCTGCCGTGCGCGGAGGATTCGAGTTCGGAAGACGACGATAGCTTGATCGAAATCGCTCTTCTTCCGAACTCGAACGCGATTCGAGCGG
ATTCGATGAACGGATTAGGAGGAGTTCGAAATCTGGAGACGTGTTTGCCTGATTTGTTGCCGGATTCTGTTGTGAGACAGCGAGGGTTTGTGGAGCTGTTGGAAGAGATC
GATGAGGAAGAGAACTTGATTGAGATCGATATTTCCAGAGGTTTGAATCGAGATTTTTCACTGTTTGCAGTC
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATTTCACTCACCGCCCATCATCGTCTCGCCTCAATCTCGCCCTCGCTTCAACTTTTCTCGCCTCCATTTTCTTCCTCACTCTCCGATTCTCCCCTTCATTCCT
CACGCTTCTTCTTCTCATCCCTTCCACTCTCTTCCTCACCAAGAAATCCAAATCCGATTCCAATTCCAATCGATCTTCCGGCCACAATCCGCCTCTAATCGGAGTTCTCC
GGCCGAGCGAAGAGCTTTCTTCCGACGGCGATGAGAGTAGTAGCAGTAATGGCGGCCGATTAGGAAGTGGAGACTTCGATTCCAATCGATCTTCCGCCCTGAATCCGCCT
CTAATCGGAGTTCTCCAGCCGAGCGAAGAGCTTTCTTCCGACGGCGACGAGAGTAGCAGCAATGGCGGCGGATTCGGAAGTGGAGACTTCGATTCCGATCTCTGGATGTG
TCTGGACGAACTGGCGCGGAATCTGCCGTGCGCGGAGGATTCGAGTTCGGAAGACGACGATAGCTTGATCGAAATCGCTCTTCTTCCGAACTCGAACGCGATTCGAGCGG
ATTCGATGAACGGATTAGGAGGAGTTCGAAATCTGGAGACGTGTTTGCCTGATTTGTTGCCGGATTCTGTTGTGAGACAGCGAGGGTTTGTGGAGCTGTTGGAAGAGATC
GATGAGGAAGAGAACTTGATTGAGATCGATATTTCCAGAGGTTTGAATCGAGATTTTTCACTGTTTGCAGTC
Protein sequenceShow/hide protein sequence
MENFTHRPSSSRLNLALASTFLASIFFLTLRFSPSFLTLLLLIPSTLFLTKKSKSDSNSNRSSGHNPPLIGVLRPSEELSSDGDESSSSNGGRLGSGDFDSNRSSALNPP
LIGVLQPSEELSSDGDESSSNGGGFGSGDFDSDLWMCLDELARNLPCAEDSSSEDDDSLIEIALLPNSNAIRADSMNGLGGVRNLETCLPDLLPDSVVRQRGFVELLEEI
DEEENLIEIDISRGLNRDFSLFAV