; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004251 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004251
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionYae1_N domain-containing protein
Genome locationscaffold92:1185178..1187195
RNA-Seq ExpressionMS004251
SyntenyMS004251
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0004386 - helicase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal
IPR038881 - Yae1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601147.1 Crossover junction endonuclease MUS81, partial [Cucurbita argyrosperma subsp. sororia]8.8e-8082.61Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GS V+D C+DDGSLWGGSDEGLEE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
        N SKFQSLYE  NSLSTADALRLF+D+I+AQ   EECVDA+T+S+TI LLKQN D+ RLG+FYGELQALLP SPAL +HLHE+Q
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

KAG7031946.1 hypothetical protein SDJN02_05988 [Cucurbita argyrosperma subsp. argyrosperma]8.8e-8082.61Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GS V+D C+DDGSLWGGSDEGLEE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
        N SKFQSLYE  NSLSTADALRLF+D+I+AQ   EECVDA+T+S+TI LLKQN D+ RLG+FYGELQALLP SPAL +HLHE+Q
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

XP_022138890.1 uncharacterized protein LOC111009960 [Momordica charantia]5.0e-9998.37Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLI+GKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
        NSSKFQSLYECTNS+STADALRLFNDEILAQD TEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

XP_023514282.1 uncharacterized protein LOC111778596 isoform X1 [Cucurbita pepo subsp. pepo]6.7e-8082.61Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GS V+D C+DDGSLWGGSDEGLEE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
        N SKFQSLYE  NSLSTADALRLF+++I+AQ   EECVDA+T+S+TI LLKQN D+ RLGKFYGELQALLP SPAL +HLHE+Q
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

XP_023514291.1 uncharacterized protein LOC111778596 isoform X2 [Cucurbita pepo subsp. pepo]6.7e-8082.61Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GS V+D C+DDGSLWGGSDEGLEE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
        N SKFQSLYE  NSLSTADALRLF+++I+AQ   EECVDA+T+S+TI LLKQN D+ RLGKFYGELQALLP SPAL +HLHE+Q
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

TrEMBL top hitse value%identityAlignment
A0A6J1CAS2 uncharacterized protein LOC1110099602.4e-9998.37Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLI+GKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
        NSSKFQSLYECTNS+STADALRLFNDEILAQD TEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

A0A6J1H273 uncharacterized protein LOC1114589032.8e-7982.61Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GS V+D C+DDGSLWGGSDE LEE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
          SKFQSLYE  NSLSTADALRLF+D+I+AQ   EECVDA+T+S+TI LLKQN D GRLG+FYGELQALLPKSPAL +HLHE+Q
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

A0A6J1JR95 uncharacterized protein LOC111487537 isoform X21.8e-7882.07Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GS V+D C+DDGSLWGGSDEGLEE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
        N SKFQSLYE  NSLSTADALRLF+D+I+AQ   EE VDA+T+S+TI LLKQN D  RLG+FYGELQALLP SPAL +HLHE+Q
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

A0A6J1JRH1 uncharacterized protein LOC111487537 isoform X31.8e-7882.07Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GS V+D C+DDGSLWGGSDEGLEE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
        N SKFQSLYE  NSLSTADALRLF+D+I+AQ   EE VDA+T+S+TI LLKQN D  RLG+FYGELQALLP SPAL +HLHE+Q
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

A0A6J1JSZ3 uncharacterized protein LOC111487537 isoform X11.8e-7882.07Show/hide
Query:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE
        GS V+D C+DDGSLWGGSDEGLEE SDLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSV IGYKLGLVRGVSSVLA LPDDLKEKL+G EE
Subjt:  GSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEE

Query:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ
        N SKFQSLYE  NSLSTADALRLF+D+I+AQ   EE VDA+T+S+TI LLKQN D  RLG+FYGELQALLP SPAL +HLHE+Q
Subjt:  NSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ

SwissProt top hitse value%identityAlignment
Q9NRH1 Protein YAE1 homolog4.3e-0537.18Show/hide
Query:  LCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVL
        L Q  G      DE  +E     REWQ    +    GYRDG+ AGK    Q+GFN G+K+   +    G +RG  S L
Subjt:  LCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVL

Arabidopsis top hitse value%identityAlignment
AT1G34550.1 Protein of unknown function (DUF616)2.0e-0532.35Show/hide
Query:  SVLACLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLH
        +VLA LPD+L+EKL+  +E   KFQ L+   ++LST  A++ F   +     T+E +     + T             G +  EL +LL KSP ++  L 
Subjt:  SVLACLPDDLKEKLMGNEENSSKFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLH

Query:  ED
        +D
Subjt:  ED

AT1G34570.1 Essential protein Yae1, N-terminal1.4e-3041.42Show/hide
Query:  DGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEENSSKFQSLYE
        D   +G SDE   E   LD E ++R  +FH+ GYRDG++ GKEA AQEG+N G+K+SV  GYK G+VRGVSS LA LP + +EKL+  +E   KFQ L+ 
Subjt:  DGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEENSSKFQSLYE

Query:  CTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTI-GLLKQNPDDGRLGKFYGELQALLPKSPALKV
          ++LST  A++ F + +  +   E+  +   +S ++ G          LG +  EL +LL KSP ++V
Subjt:  CTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTI-GLLKQNPDDGRLGKFYGELQALLPKSPALKV

AT3G15750.1 Essential protein Yae1, N-terminal6.8e-3042.37Show/hide
Query:  LCQDDGSLWGGSD-EGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEENSSKF
        L + D   +G SD E   E   L  E ++R  +FH+ GYRDG++AGKEA AQEG+N G+K+SV  GY+ GLVRGVSS LA LPD+L+EKL+  +E   KF
Subjt:  LCQDDGSLWGGSD-EGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEENSSKF

Query:  QSLYECTNSLSTADALRLFNDEILA----QDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKV
        Q L+   ++LST  A++ F + +      +   EE  D+ ++S +   +    D   LG +  EL +LL KSP ++V
Subjt:  QSLYECTNSLSTADALRLFNDEILA----QDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTACTGTCTGGTTTTAGGTTCTGGTGTGGATGACTTGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTTGGAGGAAATATCTGATTTGGACAG
GGAGTGGCAGAGGAGACATGACCAATTCCATACGATTGGATACCGTGATGGTTTAATCGCTGGTAAAGAAGCTGCAGCTCAAGAGGGATTTAATGTTGGCTTCAAGCAGT
CAGTCTTTATTGGGTATAAGTTGGGTCTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTGCCTTCCTGATGACTTGAAAGAGAAGCTAATGGGAAATGAAGAGAACAGTAGT
AAATTCCAAAGCTTGTATGAATGTACGAACTCTCTTTCGACAGCAGATGCGCTTAGACTATTCAATGACGAGATTTTGGCACAAGACATGACAGAAGAGTGTGTCGACGC
AGATACTAATTCCCGAACGATAGGTTTGCTGAAGCAAAATCCAGATGATGGACGTCTAGGGAAGTTCTATGGAGAGCTTCAAGCACTTTTACCAAAATCACCTGCTCTGA
AAGTTCATCTACATGAAGACCAG
mRNA sequenceShow/hide mRNA sequence
ATGACTTACTGTCTGGTTTTAGGTTCTGGTGTGGATGACTTGTGCCAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTTGGAGGAAATATCTGATTTGGACAG
GGAGTGGCAGAGGAGACATGACCAATTCCATACGATTGGATACCGTGATGGTTTAATCGCTGGTAAAGAAGCTGCAGCTCAAGAGGGATTTAATGTTGGCTTCAAGCAGT
CAGTCTTTATTGGGTATAAGTTGGGTCTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTGCCTTCCTGATGACTTGAAAGAGAAGCTAATGGGAAATGAAGAGAACAGTAGT
AAATTCCAAAGCTTGTATGAATGTACGAACTCTCTTTCGACAGCAGATGCGCTTAGACTATTCAATGACGAGATTTTGGCACAAGACATGACAGAAGAGTGTGTCGACGC
AGATACTAATTCCCGAACGATAGGTTTGCTGAAGCAAAATCCAGATGATGGACGTCTAGGGAAGTTCTATGGAGAGCTTCAAGCACTTTTACCAAAATCACCTGCTCTGA
AAGTTCATCTACATGAAGACCAG
Protein sequenceShow/hide protein sequence
MTYCLVLGSGVDDLCQDDGSLWGGSDEGLEEISDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVFIGYKLGLVRGVSSVLACLPDDLKEKLMGNEENSS
KFQSLYECTNSLSTADALRLFNDEILAQDMTEECVDADTNSRTIGLLKQNPDDGRLGKFYGELQALLPKSPALKVHLHEDQ