; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001627 (gene) of Chayote v1 genome

Gene IDSed0001627
OrganismSechium edule (Chayote v1)
DescriptionUPF0400 protein C337.03
Genome locationLG05:24989562..24992147
RNA-Seq ExpressionSed0001627
SyntenySed0001627
Gene Ontology termsGO:0031124 - mRNA 3'-end processing (biological process)
GO:0016591 - RNA polymerase II, holoenzyme (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602704.1 Regulation of nuclear pre-mRNA domain-containing protein 1B, partial [Cucurbita argyrosperma subsp. sororia]2.7e-3481.25Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVV
        SR K SEFVGEFWKVLP+A R VIENGDDFGRNAAL +IGI EERKVFGSRGQSLKEEIMGK+ME+ NRNGKQ ++KLKQS+S SLDKIVS YQ+V
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVV

XP_004138638.1 UPF0400 protein C337.03 [Cucumis sativus]1.0e-3380.41Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH
        SR K SEFVGEFWKVLP A R VI NGD+FGRNAAL +IGI EERKVFGSRGQSLKEEIMGK++E+ NRNGK FN KLKQS+SVSLDKIVS YQVV+
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH

XP_008441251.1 PREDICTED: UPF0400 protein C337.03 isoform X1 [Cucumis melo]1.0e-3380.41Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH
        SR K SEFVGEFWKVLP A R VI NGD+FGRNAAL +IGI EERKVFGSRGQSLKEEIMGK++E+ NRNGK FN KLKQS+SVSLDKIVS YQVV+
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH

XP_008441252.1 PREDICTED: UPF0400 protein C337.03 isoform X2 [Cucumis melo]1.0e-3380.41Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH
        SR K SEFVGEFWKVLP A R VI NGD+FGRNAAL +IGI EERKVFGSRGQSLKEEIMGK++E+ NRNGK FN KLKQS+SVSLDKIVS YQVV+
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH

XP_022152479.1 UPF0400 protein C337.03 [Momordica charantia]7.0e-3570.69Show/hide
Query:  ASREQQNLFSTDSGSPSAISRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQS
        A REQ+  +   +      SR K SEFVGEFWKVLP A R VIENGDDFGRNAAL +IGI EERKVFGSRGQSLKEEIMGK++E+ NRNGKQF++KLKQS
Subjt:  ASREQQNLFSTDSGSPSAISRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQS

Query:  SSVSLDKIVSDYQVVH
        +S SLDKIV+ YQVV+
Subjt:  SSVSLDKIVSDYQVVH

TrEMBL top hitse value%identityAlignment
A0A0A0LMU3 CID domain-containing protein4.9e-3480.41Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH
        SR K SEFVGEFWKVLP A R VI NGD+FGRNAAL +IGI EERKVFGSRGQSLKEEIMGK++E+ NRNGK FN KLKQS+SVSLDKIVS YQVV+
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH

A0A1S3B3S0 UPF0400 protein C337.03 isoform X24.9e-3480.41Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH
        SR K SEFVGEFWKVLP A R VI NGD+FGRNAAL +IGI EERKVFGSRGQSLKEEIMGK++E+ NRNGK FN KLKQS+SVSLDKIVS YQVV+
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH

A0A5D3C5Q5 UPF0400 protein isoform X14.9e-3480.41Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH
        SR K SEFVGEFWKVLP A R VI NGD+FGRNAAL +IGI EERKVFGSRGQSLKEEIMGK++E+ NRNGK FN KLKQS+SVSLDKIVS YQVV+
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH

A0A6J1DG44 UPF0400 protein C337.033.4e-3570.69Show/hide
Query:  ASREQQNLFSTDSGSPSAISRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQS
        A REQ+  +   +      SR K SEFVGEFWKVLP A R VIENGDDFGRNAAL +IGI EERKVFGSRGQSLKEEIMGK++E+ NRNGKQF++KLKQS
Subjt:  ASREQQNLFSTDSGSPSAISRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQS

Query:  SSVSLDKIVSDYQVVH
        +S SLDKIV+ YQVV+
Subjt:  SSVSLDKIVSDYQVVH

E5GB39 CID domain-containing protein4.9e-3480.41Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH
        SR K SEFVGEFWKVLP A R VI NGD+FGRNAAL +IGI EERKVFGSRGQSLKEEIMGK++E+ NRNGK FN KLKQS+SVSLDKIVS YQVV+
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26990.1 ENTH/VHS family protein3.1e-2550.83Show/hide
Query:  ASREQQNLFSTDSGSPSAISRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKL---
        A REQ+  +   +      SR K SEFVGEFWKVLP A R +IENGDDFGR +A  ++ I EERKVFGSRGQ LKEE++G+  E+  RNG    +KL   
Subjt:  ASREQQNLFSTDSGSPSAISRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKL---

Query:  -KQSSSVSLDKIVSDYQVVH
         +Q +  +L+K+VS  +V+H
Subjt:  -KQSSSVSLDKIVSDYQVVH

AT5G10060.1 ENTH/VHS family protein1.2e-1339.09Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYM--------------ESSNRNGKQFNIKLKQSSSVSL
        S+ + +EFV EFW VLP A + ++  GDD G++A   +I I EER+VFGSR +SLK+ ++G+ +              +SS R  K    KL  S  V+ 
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYM--------------ESSNRNGKQFNIKLKQSSSVSL

Query:  DKIVSDYQVV
        +KI S Y +V
Subjt:  DKIVSDYQVV

AT5G65180.1 ENTH/VHS family protein3.1e-1235.19Show/hide
Query:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMG------------KYMESSNRNGKQFNIKLKQSSSVSLDK
        S+ + +EFV EFWKVLP A + ++  GDD+G+     ++ I EER+VFGSR +SLK+ ++             ++  S +      + K K SS    +K
Subjt:  SRHKDSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMG------------KYMESSNRNGKQFNIKLKQSSSVSLDK

Query:  IVSDYQVV
        IVS + +V
Subjt:  IVSDYQVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCACCTCTTCCACGATGTCGCCGGTCCACTCGTCGGCTCGGAAGGTTCCTTTGATGGATTCGATGGATAGAACCATCACTCTGCTGCTCGGAGATTCGGAGGTCTC
CAGAATCCGGTCCATGGGCATTGAATTAGAGTTAGGGCTCAGCCTCAGAAGTGGGGTTTCCGTCACCAAGGGTAAGCGCAGTGGTTGCAGTGACCGCGGTCGGATATTGA
CGGCGAAGGATTTTCCCTATGCGATCTCCCTACAGTTTCCGGCGTCAAGAGAGCAACAAAACTTGTTTTCCACCGACAGTGGATCTCCTTCGGCTATCAGTAGGCATAAA
GACTCAGAGTTTGTTGGTGAATTTTGGAAAGTCCTTCCCAGTGCATTTCGTCATGTAATTGAAAATGGGGATGATTTTGGAAGAAATGCTGCCCTACCAATGATTGGCAT
CTGTGAAGAGAGAAAAGTTTTTGGATCTCGAGGGCAGAGTCTTAAGGAAGAGATAATGGGAAAATATATGGAAAGTAGTAATCGAAATGGGAAGCAATTCAACATTAAAC
TGAAACAATCTTCGAGCGTATCATTGGATAAAATAGTTTCTGATTACCAAGTTGTTCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCACCTCTTCCACGATGTCGCCGGTCCACTCGTCGGCTCGGAAGGTTCCTTTGATGGATTCGATGGATAGAACCATCACTCTGCTGCTCGGAGATTCGGAGGTCTC
CAGAATCCGGTCCATGGGCATTGAATTAGAGTTAGGGCTCAGCCTCAGAAGTGGGGTTTCCGTCACCAAGGGTAAGCGCAGTGGTTGCAGTGACCGCGGTCGGATATTGA
CGGCGAAGGATTTTCCCTATGCGATCTCCCTACAGTTTCCGGCGTCAAGAGAGCAACAAAACTTGTTTTCCACCGACAGTGGATCTCCTTCGGCTATCAGTAGGCATAAA
GACTCAGAGTTTGTTGGTGAATTTTGGAAAGTCCTTCCCAGTGCATTTCGTCATGTAATTGAAAATGGGGATGATTTTGGAAGAAATGCTGCCCTACCAATGATTGGCAT
CTGTGAAGAGAGAAAAGTTTTTGGATCTCGAGGGCAGAGTCTTAAGGAAGAGATAATGGGAAAATATATGGAAAGTAGTAATCGAAATGGGAAGCAATTCAACATTAAAC
TGAAACAATCTTCGAGCGTATCATTGGATAAAATAGTTTCTGATTACCAAGTTGTTCATTGA
Protein sequenceShow/hide protein sequence
MVTSSTMSPVHSSARKVPLMDSMDRTITLLLGDSEVSRIRSMGIELELGLSLRSGVSVTKGKRSGCSDRGRILTAKDFPYAISLQFPASREQQNLFSTDSGSPSAISRHK
DSEFVGEFWKVLPSAFRHVIENGDDFGRNAALPMIGICEERKVFGSRGQSLKEEIMGKYMESSNRNGKQFNIKLKQSSSVSLDKIVSDYQVVH