; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G020590 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G020590
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionLarge proline-rich protein bag6-B isoform X1
Genome locationCmo_Chr01:14377269..14378521
RNA-Seq ExpressionCmoCh01G020590
SyntenyCmoCh01G020590
Gene Ontology termsGO:0030433 - ubiquitin-dependent ERAD pathway (biological process)
GO:0071818 - BAT3 complex (cellular component)
GO:0031593 - polyubiquitin modification-dependent protein binding (molecular function)
GO:0051787 - misfolded protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037937.1 hypothetical protein SDJN02_01570 [Cucurbita argyrosperma subsp. argyrosperma]6.6e-8397.6Show/hide
Query:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
        S N GALPRFELP NPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
Subjt:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST

Query:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCN+EDIADELCSHDTLAEEYLGILSNDIKRRLQGN
Subjt:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

XP_022940175.1 uncharacterized protein LOC111445881 isoform X1 [Cucurbita moschata]6.0e-8498.8Show/hide
Query:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
        S N GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
Subjt:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST

Query:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
Subjt:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

XP_022940176.1 uncharacterized protein LOC111445881 isoform X2 [Cucurbita moschata]3.9e-83100Show/hide
Query:  GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK
        GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK
Subjt:  GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK

Query:  DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
Subjt:  DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

XP_023525662.1 uncharacterized protein LOC111789201 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-8299.39Show/hide
Query:  GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK
        GALPRFELP NPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK
Subjt:  GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK

Query:  DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
Subjt:  DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

XP_023525663.1 uncharacterized protein LOC111789201 isoform X2 [Cucurbita pepo subsp. pepo]1.7e-8398.2Show/hide
Query:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
        S N GALPRFELP NPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
Subjt:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST

Query:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
Subjt:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

TrEMBL top hitse value%identityAlignment
A0A5A7VE91 Large proline-rich protein bag6-B isoform X13.9e-6577.51Show/hide
Query:  TFSYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQ
        + S N G  PRFELP NPQ+LQ+P+SLSIVLRNAQ LLS+YAI+SLPRIAERFEQDGSS+DP VRGQIQEELVQVGLRMQQFGALLLDLGSSILTL LEQ
Subjt:  TFSYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQ

Query:  STIDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        ST+D +DLA+R+ STN PMDVFRAVVESSA++SG +  DIA+ LC+HD+LAEEYL IL++DI RRLQ N
Subjt:  STIDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

A0A6J1FHQ5 uncharacterized protein LOC111445881 isoform X21.9e-83100Show/hide
Query:  GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK
        GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK
Subjt:  GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK

Query:  DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
Subjt:  DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

A0A6J1FPT9 uncharacterized protein LOC111445881 isoform X12.9e-8498.8Show/hide
Query:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
        S N GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
Subjt:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST

Query:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
Subjt:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

A0A6J1IWC0 uncharacterized protein LOC111481190 isoform X13.9e-8195.21Show/hide
Query:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
        S N G LPRFELP NPQDLQTPKSLSIVLRNAQHLLS+YAI+SLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST
Subjt:  SYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQST

Query:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHD LAEEYL ILSNDIKRRLQGN
Subjt:  IDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

A0A6J1J4J7 uncharacterized protein LOC111481190 isoform X22.5e-8096.32Show/hide
Query:  GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK
        G LPRFELP NPQDLQTPKSLSIVLRNAQHLLS+YAI+SLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK
Subjt:  GALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQSTIDHK

Query:  DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN
        DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHD LAEEYL ILSNDIKRRLQGN
Subjt:  DLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN

SwissProt top hitse value%identityAlignment
D5LXJ0 Ubiquitin-like domain-containing protein CIP731.2e-0531.82Show/hide
Query:  LPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQS
        L + P+ L +P SL+ VL + + ++ E A   L ++A + E      DP+ R   Q   ++ G+     GA LL+LG + +TL L Q+
Subjt:  LPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQS

Arabidopsis top hitse value%identityAlignment
AT5G42220.1 Ubiquitin-like superfamily protein1.9e-2459.41Show/hide
Query:  TFSYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQ
        T S   G  PR ELP N +   TP++LS+VLRNAQHLLS   + SL  IA R EQDGSS+DP +R QIQ E VQVGL MQ  GALLL+LG +ILTL +  
Subjt:  TFSYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQQFGALLLDLGSSILTLHLEQ

Query:  S
        S
Subjt:  S


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCTCAAAATGGTGGTAAATCTTAAACCCCTGTATCCTCCATTACTTGATTTTGTTCCAATTAATCTTTTTACATTTTCTTACAACGTAGGAGCTCTACCA
AGGTTCGAACTCCCCGCCAATCCCCAAGATCTGCAGACTCCTAAATCTTTGAGTATCGTTTTGCGAAATGCACAACACCTCCTAAGTGAATATGCAATATATTCA
CTTCCGCGCATCGCTGAGCGTTTCGAGCAAGACGGTTCCTCGACCGATCCAATGGTGAGGGGTCAAATACAGGAAGAGTTAGTGCAAGTAGGACTTAGGATGCAA
CAGTTCGGCGCGCTTCTTCTCGATCTCGGCAGTTCAATCTTGACGCTCCATTTGGAACAATCCACGATTGACCATAAAGATCTTGCTCAAAGGATTACATCTACC
AATTCTCCTATGGATGTGTTTCGCGCTGTGGTTGAGAGTTCGGCTCAGGTTTCTGGCTGCAACAATGAAGATATTGCAGATGAGTTGTGCAGCCATGACACACTT
GCTGAGGAATATTTAGGGATATTATCAAATGATATCAAACGACGTTTGCAAGGCAATTGA
mRNA sequenceShow/hide mRNA sequence
TGAAGATTAAAAAATAAAATAAAAAATGTAAGCCTTCCATTTTTCTTGGCCTCTCAATTTTGTTGCCTCTTCCTGCGCTTTCATTACAAACCAAATAGGATCCCC
TTTGATTTTCCGGGGAAAGGTCCCTAAGGATGAACGCCCATCCTCTAAGTATCATACCTCTGTATTCCTCTCTTTCCCTGCCCTAATTCTCATTTCATCTCCTTT
CTTTCCCCCTTTTCTTCTCCATCCTTCAACAACTCCTCCGCGTTCATGAACCGTTTCGAGCGAGCAATCTCTCAAAATGTTTCTCAAAATGGTGGTAAATCTTAA
ACCCCTGTATCCTCCATTACTTGATTTTGTTCCAATTAATCTTTTTACATTTTCTTACAACGTAGGAGCTCTACCAAGGTTCGAACTCCCCGCCAATCCCCAAGA
TCTGCAGACTCCTAAATCTTTGAGTATCGTTTTGCGAAATGCACAACACCTCCTAAGTGAATATGCAATATATTCACTTCCGCGCATCGCTGAGCGTTTCGAGCA
AGACGGTTCCTCGACCGATCCAATGGTGAGGGGTCAAATACAGGAAGAGTTAGTGCAAGTAGGACTTAGGATGCAACAGTTCGGCGCGCTTCTTCTCGATCTCGG
CAGTTCAATCTTGACGCTCCATTTGGAACAATCCACGATTGACCATAAAGATCTTGCTCAAAGGATTACATCTACCAATTCTCCTATGGATGTGTTTCGCGCTGT
GGTTGAGAGTTCGGCTCAGGTTTCTGGCTGCAACAATGAAGATATTGCAGATGAGTTGTGCAGCCATGACACACTTGCTGAGGAATATTTAGGGATATTATCAAA
TGATATCAAACGACGTTTGCAAGGCAATTGAGAGCAGGAAAAATAAGAAGGTGTTGAAGGATATTTGCAGATTTGAGGGCGGAATTCCATCAATGCCTTGCCTTC
CTCAACTTTATTTTTCTTCCTTTCTTTTTTTTTTTTCATTCTACTTTTGCTTTCATTATAGATTCCTTAGGTTTGTAATA
Protein sequenceShow/hide protein sequence
MFLKMVVNLKPLYPPLLDFVPINLFTFSYNVGALPRFELPANPQDLQTPKSLSIVLRNAQHLLSEYAIYSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQ
QFGALLLDLGSSILTLHLEQSTIDHKDLAQRITSTNSPMDVFRAVVESSAQVSGCNNEDIADELCSHDTLAEEYLGILSNDIKRRLQGN