; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019111 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019111
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionagamous-like MADS-box protein AGL14
Genome locationscaffold20:1566512..1566979
RNA-Seq ExpressionMS019111
SyntenyMS019111
Gene Ontology termsGO:0045944 - positive regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0000987 - proximal promoter sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR002100 - Transcription factor, MADS-box
IPR036879 - Transcription factor, MADS-box superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143271.1 uncharacterized protein LOC111013182 isoform X1 [Momordica charantia]2.6e-6699.21Show/hide
Query:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER
        MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIK PSWDER
Subjt:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER

Query:  LECLSGHQLRLVMEDLGSKSEVAEGMI
        LECLSGHQLRLVMEDLGSKSEVAEGMI
Subjt:  LECLSGHQLRLVMEDLGSKSEVAEGMI

XP_022143272.1 uncharacterized protein LOC111013182 isoform X2 [Momordica charantia]2.6e-6699.21Show/hide
Query:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER
        MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIK PSWDER
Subjt:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER

Query:  LECLSGHQLRLVMEDLGSKSEVAEGMI
        LECLSGHQLRLVMEDLGSKSEVAEGMI
Subjt:  LECLSGHQLRLVMEDLGSKSEVAEGMI

XP_022143273.1 uncharacterized protein LOC111013182 isoform X3 [Momordica charantia]2.6e-6699.21Show/hide
Query:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER
        MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIK PSWDER
Subjt:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER

Query:  LECLSGHQLRLVMEDLGSKSEVAEGMI
        LECLSGHQLRLVMEDLGSKSEVAEGMI
Subjt:  LECLSGHQLRLVMEDLGSKSEVAEGMI

XP_022143274.1 uncharacterized protein LOC111013182 isoform X4 [Momordica charantia]2.6e-6699.21Show/hide
Query:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER
        MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIK PSWDER
Subjt:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER

Query:  LECLSGHQLRLVMEDLGSKSEVAEGMI
        LECLSGHQLRLVMEDLGSKSEVAEGMI
Subjt:  LECLSGHQLRLVMEDLGSKSEVAEGMI

XP_022143403.1 uncharacterized protein LOC111013282 [Momordica charantia]5.2e-5475.33Show/hide
Query:  LLLSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKK
        L L   ++ + RTA FRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGD S VESHTWP NREEIEGIIRAYKT+Y +KP MKSF LY +FS RMKK
Subjt:  LLLSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKK

Query:  IEAERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSKSEVAEGMI
        IE ERAKF   +E+IKY  WDERL CLS  QLRL M++LGSK E AEGMI
Subjt:  IEAERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSKSEVAEGMI

TrEMBL top hitse value%identityAlignment
A0A6J1CNU9 uncharacterized protein LOC111013182 isoform X41.3e-6699.21Show/hide
Query:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER
        MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIK PSWDER
Subjt:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER

Query:  LECLSGHQLRLVMEDLGSKSEVAEGMI
        LECLSGHQLRLVMEDLGSKSEVAEGMI
Subjt:  LECLSGHQLRLVMEDLGSKSEVAEGMI

A0A6J1CPS5 uncharacterized protein LOC111013182 isoform X11.3e-6699.21Show/hide
Query:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER
        MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIK PSWDER
Subjt:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER

Query:  LECLSGHQLRLVMEDLGSKSEVAEGMI
        LECLSGHQLRLVMEDLGSKSEVAEGMI
Subjt:  LECLSGHQLRLVMEDLGSKSEVAEGMI

A0A6J1CQB2 uncharacterized protein LOC111013182 isoform X31.3e-6699.21Show/hide
Query:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER
        MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIK PSWDER
Subjt:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER

Query:  LECLSGHQLRLVMEDLGSKSEVAEGMI
        LECLSGHQLRLVMEDLGSKSEVAEGMI
Subjt:  LECLSGHQLRLVMEDLGSKSEVAEGMI

A0A6J1CQB5 uncharacterized protein LOC111013182 isoform X21.3e-6699.21Show/hide
Query:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER
        MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIK PSWDER
Subjt:  MKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRNDAEDIKYPSWDER

Query:  LECLSGHQLRLVMEDLGSKSEVAEGMI
        LECLSGHQLRLVMEDLGSKSEVAEGMI
Subjt:  LECLSGHQLRLVMEDLGSKSEVAEGMI

A0A6J1CQM2 uncharacterized protein LOC1110132822.5e-5475.33Show/hide
Query:  LLLSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKK
        L L   ++ + RTA FRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGD S VESHTWP NREEIEGIIRAYKT+Y +KP MKSF LY +FS RMKK
Subjt:  LLLSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKK

Query:  IEAERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSKSEVAEGMI
        IE ERAKF   +E+IKY  WDERL CLS  QLRL M++LGSK E AEGMI
Subjt:  IEAERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSKSEVAEGMI

SwissProt top hitse value%identityAlignment
Q9FIM0 Agamous-like MADS-box protein AGL822.8e-1034.27Show/hide
Query:  LSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIE
        L R A  + R   ++KR  SL KKA E STLC V TC++++GP    D   S  E   WP +  ++  IIR YK +       K  ++  + +D  K  E
Subjt:  LSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIE

Query:  A---ERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSK
            +R K  N     KY SW+E+L+  S  QL  +   + SK
Subjt:  A---ERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSK

Q9FIX0 Agamous-like MADS-box protein AGL813.8e-0730.97Show/hide
Query:  SRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEA
        S ++++   + +   R +++ KKA EL TLCD+  CV+ +GP    DG     E  TWP  RE++E I   Y          KS  LY + + +  K   
Subjt:  SRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEA

Query:  ERAKFRNDAEDIK
        E+     D +D+K
Subjt:  ERAKFRNDAEDIK

Q9FJK3 Agamous-like MADS-box protein AGL803.2e-0628.36Show/hide
Query:  RTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRND
        R A F+KR K LMKK  ELSTLC +  C +I+ P               WP N   ++ ++  ++T        K  D  G+   R+ K      + R D
Subjt:  RTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAERAKFRND

Query:  AEDIKYPSWDERLECLSGH----QLRLV-MEDLG
        + +++    +   +CL G+     L +V + DLG
Subjt:  AEDIKYPSWDERLECLSGH----QLRLV-MEDLG

Q9FLL0 Agamous-like MADS-box protein AGL752.7e-0527.87Show/hide
Query:  SRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKK-IE
        S ++++     +   R +++ KKA EL TLCD+  CV+ +GP    DG     E  TWP  +E++  I   Y          KS +L+G+ + +  K ++
Subjt:  SRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKK-IE

Query:  AERAKFRNDAEDI---KYPSWD
            K +   +++   KYP  D
Subjt:  AERAKFRNDAEDI---KYPSWD

Q9LSB2 Agamous-like MADS-box protein AGL1039.0e-0931.76Show/hide
Query:  KTRNLLLSRTAAF-RKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYF-
        K +       +AF   R  +  KR +++ KKA ELS LCD+  CV+ +G         S  E  TWP  RE+++ I R Y      K    S DL+ +  
Subjt:  KTRNLLLSRTAAF-RKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYF-

Query:  ---SDRMKKIEAERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDL
            D  +K E ++ K R     +KYP WD R +  S  QL  +++ L
Subjt:  ---SDRMKKIEAERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDL

Arabidopsis top hitse value%identityAlignment
AT3G18650.1 AGAMOUS-like 1036.4e-1031.76Show/hide
Query:  KTRNLLLSRTAAF-RKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYF-
        K +       +AF   R  +  KR +++ KKA ELS LCD+  CV+ +G         S  E  TWP  RE+++ I R Y      K    S DL+ +  
Subjt:  KTRNLLLSRTAAF-RKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYF-

Query:  ---SDRMKKIEAERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDL
            D  +K E ++ K R     +KYP WD R +  S  QL  +++ L
Subjt:  ---SDRMKKIEAERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDL

AT5G39750.1 AGAMOUS-like 812.7e-0830.97Show/hide
Query:  SRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEA
        S ++++   + +   R +++ KKA EL TLCD+  CV+ +GP    DG     E  TWP  RE++E I   Y          KS  LY + + +  K   
Subjt:  SRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEA

Query:  ERAKFRNDAEDIK
        E+     D +D+K
Subjt:  ERAKFRNDAEDIK

AT5G40220.1 AGAMOUS-like 437.8e-0829.69Show/hide
Query:  RNLLLSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDR-
        R+ L S ++A+   + +   R +++ KKA EL TLCD+  CV+ +GP    DG     E  TWP  RE++  I   Y          KS +L+G+ + + 
Subjt:  RNLLLSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDR-

Query:  ----MKKIEAERAKFRNDAEDIKYPSWD
            +K  + +R         +KYP  D
Subjt:  ----MKKIEAERAKFRNDAEDIKYPSWD

AT5G55690.1 MADS-box transcription factor family protein3.7e-1027.78Show/hide
Query:  LSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIE
        ++R    + R   ++KR   L KKA E STLC V TCV+++GP+  + GD   +E   WP +  ++  I+  Y+ +       K++ +        + +E
Subjt:  LSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIE

Query:  AERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSKSEVA
            K        KYP+WD++L+  S + L  V   + +K + A
Subjt:  AERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSKSEVA

AT5G58890.1 AGAMOUS-like 822.0e-1134.27Show/hide
Query:  LSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIE
        L R A  + R   ++KR  SL KKA E STLC V TC++++GP    D   S  E   WP +  ++  IIR YK +       K  ++  + +D  K  E
Subjt:  LSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIE

Query:  A---ERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSK
            +R K  N     KY SW+E+L+  S  QL  +   + SK
Subjt:  A---ERAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CGCGCCAAGACAAGAAACCTATTGCTCTCCCGCACCGCAGCATTCCGCAAGAGAACCGCAGCATTCCGCAAGAGAACCAAGAGTTTGATGAAGAAAGCCTTCGAGTTATC
CACGCTCTGCGATGTTCGGACCTGCGTTTTGATCCATGGCCCTGCTCCTCCACAAGATGGTGATTGCTCGGCAGTGGAATCTCACACGTGGCCTCTCAATCGCGAAGAAA
TCGAAGGTATTATCCGGGCCTATAAGACTAGTTATTTTCAGAAACCATTGATGAAGTCTTTCGATTTGTACGGTTATTTCTCCGATCGTATGAAGAAAATCGAAGCAGAG
AGGGCTAAATTTCGCAATGATGCTGAAGATATCAAGTACCCGAGTTGGGACGAGCGGCTTGAGTGTTTGTCCGGACATCAGTTGCGGTTGGTCATGGAGGATTTGGGTTC
GAAGAGTGAAGTTGCAGAGGGAATGATT
mRNA sequenceShow/hide mRNA sequence
CGCGCCAAGACAAGAAACCTATTGCTCTCCCGCACCGCAGCATTCCGCAAGAGAACCGCAGCATTCCGCAAGAGAACCAAGAGTTTGATGAAGAAAGCCTTCGAGTTATC
CACGCTCTGCGATGTTCGGACCTGCGTTTTGATCCATGGCCCTGCTCCTCCACAAGATGGTGATTGCTCGGCAGTGGAATCTCACACGTGGCCTCTCAATCGCGAAGAAA
TCGAAGGTATTATCCGGGCCTATAAGACTAGTTATTTTCAGAAACCATTGATGAAGTCTTTCGATTTGTACGGTTATTTCTCCGATCGTATGAAGAAAATCGAAGCAGAG
AGGGCTAAATTTCGCAATGATGCTGAAGATATCAAGTACCCGAGTTGGGACGAGCGGCTTGAGTGTTTGTCCGGACATCAGTTGCGGTTGGTCATGGAGGATTTGGGTTC
GAAGAGTGAAGTTGCAGAGGGAATGATT
Protein sequenceShow/hide protein sequence
RAKTRNLLLSRTAAFRKRTAAFRKRTKSLMKKAFELSTLCDVRTCVLIHGPAPPQDGDCSAVESHTWPLNREEIEGIIRAYKTSYFQKPLMKSFDLYGYFSDRMKKIEAE
RAKFRNDAEDIKYPSWDERLECLSGHQLRLVMEDLGSKSEVAEGMI