; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021945 (gene) of Snake gourd v1 genome

Gene IDTan0021945
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLarge proline-rich protein bag6-B isoform X1
Genome locationLG01:116274694..116277390
RNA-Seq ExpressionTan0021945
SyntenyTan0021945
Gene Ontology termsGO:0030433 - ubiquitin-dependent ERAD pathway (biological process)
GO:0071818 - BAT3 complex (cellular component)
GO:0031593 - polyubiquitin modification-dependent protein binding (molecular function)
GO:0051787 - misfolded protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037937.1 hypothetical protein SDJN02_01570 [Cucurbita argyrosperma subsp. argyrosperma]4.4e-7386.59Show/hide
Query:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG
        MNRFERAI    SQNGG LPRFELPTNPQDLQTP+SL IVLRNAQ LLS+YAI SLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQ FGALLLDLG
Subjt:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG

Query:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN
        SSILTL LEQS ID +DLAQR+ STNSPMDVFRAVVESSAQVSG +SEDIADELCS +TLAEEYL ILSNDI RRLQ N
Subjt:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN

XP_022982322.1 uncharacterized protein LOC111481190 isoform X1 [Cucurbita maxima]2.6e-7386.59Show/hide
Query:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG
        MNRFERAI    SQNGGVLPRFELPTNPQDLQTP+SL IVLRNAQ LLSDYAI SLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQ FGALLLDLG
Subjt:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG

Query:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN
        SSILTL LEQS ID +DLAQR+ STNSPMDVFRAVVESSAQVSG ++EDIADELCS + LAEEYL ILSNDI RRLQ N
Subjt:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN

XP_022982323.1 uncharacterized protein LOC111481190 isoform X2 [Cucurbita maxima]9.8e-7386.52Show/hide
Query:  MNRFERAISQN---GGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGS
        MNRFERAISQN    GVLPRFELPTNPQDLQTP+SL IVLRNAQ LLSDYAI SLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQ FGALLLDLGS
Subjt:  MNRFERAISQN---GGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGS

Query:  SILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN
        SILTL LEQS ID +DLAQR+ STNSPMDVFRAVVESSAQVSG ++EDIADELCS + LAEEYL ILSNDI RRLQ N
Subjt:  SILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN

XP_023525663.1 uncharacterized protein LOC111789201 isoform X2 [Cucurbita pepo subsp. pepo]9.8e-7386.03Show/hide
Query:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG
        MNRFERAI    SQNGG LPRFELPTNPQDLQTP+SL IVLRNAQ LLS+YAI SLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQ FGALLLDLG
Subjt:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG

Query:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN
        SSILTL LEQS ID +DLAQR+ STNSPMDVFRAVVESSAQVSG ++EDIADELCS +TLAEEYL ILSNDI RRLQ N
Subjt:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN

XP_038899964.1 uncharacterized protein LOC120087140 isoform X1 [Benincasa hispida]1.2e-7888.27Show/hide
Query:  MNRFERAISQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGSSIL
        MNRFERAISQNGG LPRFELPTNPQ+LQTPESLGIVLRNAQ+LLSDYAISSLPRIAERFEQDGSSTDP VRGQIQEELVQ+GLRMQ FGALLLDLGSSIL
Subjt:  MNRFERAISQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGSSIL

Query:  TLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSNSNQE
        TLRLEQS IDLQDLAQRV+STN PMDVFRAVVESSA++SGSS +D+A ELC+DETLAEEYLE+ ++DINRRLQSNS QE
Subjt:  TLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSNSNQE

TrEMBL top hitse value%identityAlignment
A0A1S3BV17 uncharacterized protein LOC103493795 isoform X15.2e-7282.87Show/hide
Query:  MNRFERA--ISQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGSS
        MNRFERA  ISQNGG  PRFELPTNPQ+LQ+PESL IVLRNAQ+LLSDYAI SLPRIAERFEQDGSS+DP VRGQIQEELVQVGLRMQ FGALLLDLGSS
Subjt:  MNRFERA--ISQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGSS

Query:  ILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSNSNQE
        ILTLRLEQS +DLQDLA+RV+STN PMDVFRAVVESSA++SGSS+ DIA+ LC+ ++LAEEYLEIL++DI RRLQSNS QE
Subjt:  ILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSNSNQE

A0A5A7VE91 Large proline-rich protein bag6-B isoform X15.2e-7282.87Show/hide
Query:  MNRFERA--ISQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGSS
        MNRFERA  ISQNGG  PRFELPTNPQ+LQ+PESL IVLRNAQ+LLSDYAI SLPRIAERFEQDGSS+DP VRGQIQEELVQVGLRMQ FGALLLDLGSS
Subjt:  MNRFERA--ISQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGSS

Query:  ILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSNSNQE
        ILTLRLEQS +DLQDLA+RV+STN PMDVFRAVVESSA++SGSS+ DIA+ LC+ ++LAEEYLEIL++DI RRLQSNS QE
Subjt:  ILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSNSNQE

A0A6J1FPT9 uncharacterized protein LOC111445881 isoform X11.8e-7285.47Show/hide
Query:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG
        MNRFERAI    SQNGG LPRFELP NPQDLQTP+SL IVLRNAQ LLS+YAI SLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQ FGALLLDLG
Subjt:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG

Query:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN
        SSILTL LEQS ID +DLAQR+ STNSPMDVFRAVVESSAQVSG ++EDIADELCS +TLAEEYL ILSNDI RRLQ N
Subjt:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN

A0A6J1IWC0 uncharacterized protein LOC111481190 isoform X11.2e-7386.59Show/hide
Query:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG
        MNRFERAI    SQNGGVLPRFELPTNPQDLQTP+SL IVLRNAQ LLSDYAI SLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQ FGALLLDLG
Subjt:  MNRFERAI----SQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLG

Query:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN
        SSILTL LEQS ID +DLAQR+ STNSPMDVFRAVVESSAQVSG ++EDIADELCS + LAEEYL ILSNDI RRLQ N
Subjt:  SSILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN

A0A6J1J4J7 uncharacterized protein LOC111481190 isoform X24.7e-7386.52Show/hide
Query:  MNRFERAISQN---GGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGS
        MNRFERAISQN    GVLPRFELPTNPQDLQTP+SL IVLRNAQ LLSDYAI SLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQ FGALLLDLGS
Subjt:  MNRFERAISQN---GGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGS

Query:  SILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN
        SILTL LEQS ID +DLAQR+ STNSPMDVFRAVVESSAQVSG ++EDIADELCS + LAEEYL ILSNDI RRLQ N
Subjt:  SILTLRLEQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN

SwissProt top hitse value%identityAlignment
D5LXJ0 Ubiquitin-like domain-containing protein CIP732.8e-0631.82Show/hide
Query:  LPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGSSILTLRLEQS
        L + P+ L +P SL  VL + ++++ + A   L ++A + E      DP+ R   Q   ++ G+   + GA LL+LG + +TLRL Q+
Subjt:  LPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGSSILTLRLEQS

Arabidopsis top hitse value%identityAlignment
AT5G42220.1 Ubiquitin-like superfamily protein4.6e-2858.12Show/hide
Query:  MNRFERAISQNG----------GVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGA
        +NR E+A+SQNG          G  PR ELP N +   TPE+L +VLRNAQ LLS   +SSL  IA R EQDGSS+DP +R QIQ E VQVGL MQH GA
Subjt:  MNRFERAISQNG----------GVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGA

Query:  LLLDLGSSILTLRLEQS
        LLL+LG +ILTLR+  S
Subjt:  LLLDLGSSILTLRLEQS

AT5G42220.1 Ubiquitin-like superfamily protein1.5e-0739.44Show/hide
Query:  EQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN
        + S++++Q +AQ +  ++ P DVFRA+VE++A     S +++ +ELC DE L++EY E+L  DI  RL+ +
Subjt:  EQSKIDLQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCGTTTCGAGCGTGCAATATCTCAGAATGGTGGGGTTCTACCAAGGTTTGAACTACCCACCAATCCTCAAGATTTGCAAACCCCAGAATCTTTGGGTATTGTTTT
GCGAAATGCACAACAACTCCTAAGTGATTATGCTATATCTTCACTACCACGCATTGCTGAGCGATTCGAGCAAGATGGCTCTTCGACTGATCCCATGGTGAGGGGTCAAA
TACAGGAAGAGTTAGTGCAAGTAGGACTTAGGATGCAACATTTTGGAGCCCTTCTTCTCGATCTTGGTAGTTCAATCTTGACGCTTCGCTTAGAACAATCAAAGATTGAC
CTTCAAGATCTTGCTCAAAGGGTTATATCTACCAATTCTCCTATGGATGTGTTTCGCGCTGTGGTAGAGAGTTCAGCTCAAGTTTCTGGCAGTAGTAGTGAAGATATTGC
AGATGAGTTGTGCAGTGATGAGACACTTGCTGAGGAATATTTAGAGATATTATCAAATGATATAAACCGACGTTTACAAAGCAATTCAAATCAGGAAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAACCGTTTCGAGCGTGCAATATCTCAGAATGGTGGGGTTCTACCAAGGTTTGAACTACCCACCAATCCTCAAGATTTGCAAACCCCAGAATCTTTGGGTATTGTTTT
GCGAAATGCACAACAACTCCTAAGTGATTATGCTATATCTTCACTACCACGCATTGCTGAGCGATTCGAGCAAGATGGCTCTTCGACTGATCCCATGGTGAGGGGTCAAA
TACAGGAAGAGTTAGTGCAAGTAGGACTTAGGATGCAACATTTTGGAGCCCTTCTTCTCGATCTTGGTAGTTCAATCTTGACGCTTCGCTTAGAACAATCAAAGATTGAC
CTTCAAGATCTTGCTCAAAGGGTTATATCTACCAATTCTCCTATGGATGTGTTTCGCGCTGTGGTAGAGAGTTCAGCTCAAGTTTCTGGCAGTAGTAGTGAAGATATTGC
AGATGAGTTGTGCAGTGATGAGACACTTGCTGAGGAATATTTAGAGATATTATCAAATGATATAAACCGACGTTTACAAAGCAATTCAAATCAGGAAAAATAA
Protein sequenceShow/hide protein sequence
MNRFERAISQNGGVLPRFELPTNPQDLQTPESLGIVLRNAQQLLSDYAISSLPRIAERFEQDGSSTDPMVRGQIQEELVQVGLRMQHFGALLLDLGSSILTLRLEQSKID
LQDLAQRVISTNSPMDVFRAVVESSAQVSGSSSEDIADELCSDETLAEEYLEILSNDINRRLQSNSNQEK