; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016858 (gene) of Snake gourd v1 genome

Gene IDTan0016858
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCOPII coat assembly protein SEC16, putative
Genome locationLG03:68814534..68814995
RNA-Seq ExpressionTan0016858
SyntenyTan0016858
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926106.1 uncharacterized protein LOC111433319 [Cucurbita moschata]1.1e-5682.5Show/hide
Query:  MADSAALKTYPPPSPSPS--NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRLVK
        MA SAA KTYPPPSPSPS  N+ RR+ IPSSA MPP NGT+SP PPPSPIDLRLLSKSSSQSYTSLKDILPS+A AVNSPTA   ANSGYEI IRNRLVK
Subjt:  MADSAALKTYPPPSPSPS--NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRLVK

Query:  QAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        QAAWAYLQPMSASSYSAGPN FH  WLRFS GNPI+ CLGFIRG IIP+IIRV R CICL
Subjt:  QAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

XP_022963126.1 uncharacterized protein LOC111463426 [Cucurbita moschata]6.0e-5581.76Show/hide
Query:  MADSAALKTYPPPSPSPSNVRRRNSIPSSAIMPPLNGTVSPPPPSPIDLRLL-SKSSSQSYTSLKDILP--SAAAVNSPTA---ANSGYEISIRNRLVKQ
        MADSAA KTY  PS   S + RR SIPSSAIMPP N  VSPP  SPIDLR+L SKSSSQSYTSLKDILP  SAAA NSPTA   ANSGYEISIRNRLVKQ
Subjt:  MADSAALKTYPPPSPSPSNVRRRNSIPSSAIMPPLNGTVSPPPPSPIDLRLL-SKSSSQSYTSLKDILP--SAAAVNSPTA---ANSGYEISIRNRLVKQ

Query:  AAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        AAWAYLQPMSASSYS G NFFHRFWLRFSA NPIA  LGFIRGSIIPAI+RVIRFCICL
Subjt:  AAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

XP_022979075.1 uncharacterized protein LOC111478820 [Cucurbita maxima]5.8e-5883.12Show/hide
Query:  MADSAALKTYPPPSPSPS--NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRLVK
        MA SAA KTYPPPSPSPS  N+ RR+ IPSSA MPP NGT+SP PPPSPIDLRLLSKSSSQSYTSLKDILPS+A AVNSPTA   ANSGYEI IRNRLVK
Subjt:  MADSAALKTYPPPSPSPS--NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRLVK

Query:  QAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        QAAWAYLQPMSASSYSAGPN FHRFWLRFS GNPI+ CLGFIRG IIP+IIRV R CIC+
Subjt:  QAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

XP_023544532.1 uncharacterized protein LOC111804081 [Cucurbita pepo subsp. pepo]3.8e-5782.1Show/hide
Query:  MADSAALKTYPPPSPSPS----NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRL
        MA SAA KTYPPPSPSPS    N+ RR+ IPSSA MPP NGT+SP PPPSPIDLRLLSKSSSQSYTSLKDILPS+A AVNSPTA   ANSGYEI IRNRL
Subjt:  MADSAALKTYPPPSPSPS----NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRL

Query:  VKQAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        VKQAAWAYLQPMSASSYSAGPN FH FWLRFS GNPI+ CLGFIRG IIP+IIRV R CICL
Subjt:  VKQAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

XP_038883302.1 uncharacterized protein LOC120074291 [Benincasa hispida]7.6e-5883.95Show/hide
Query:  MADSAALKTYPPPSPSPSNVRRRNSIPSSAIMPPLNGTVS----PPPPSPIDLRLLS-KSSSQSYTSLKDILPSA-AAVNSPTA---ANSGYEISIRNRL
        MA SAA  TYPPPSPS SN+R R  IPSS  MPP NGTVS    PPPPSPI+LRLLS KSSSQSYTSLKDILPSA AAVNSPTA   ANSGYEISIRNRL
Subjt:  MADSAALKTYPPPSPSPSNVRRRNSIPSSAIMPPLNGTVS----PPPPSPIDLRLLS-KSSSQSYTSLKDILPSA-AAVNSPTA---ANSGYEISIRNRL

Query:  VKQAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        VKQAAWAYLQPMSASSYSAGPNFFHRFWLRFS  NPI GCLGFIRG+IIPAIIRV RFCICL
Subjt:  VKQAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

TrEMBL top hitse value%identityAlignment
A0A6J1BX34 uncharacterized protein LOC1110055108.7e-5282Show/hide
Query:  PPPSPSPSNVRRRNS--IPSSAIMPPLNGTV-SPPPPSPIDLRLLS-KSSSQSYTSLKDILPSAAAVNSPTAA---NSGYEISIRNRLVKQAAWAYLQPM
        P PS SP  +RRRNS  IPSSA M P +GTV  PPPPSPIDLRLLS KSSSQSYTSLKDILPS AAVNSPTAA   NSGYEISIRNRLVKQAAWAYLQPM
Subjt:  PPPSPSPSNVRRRNS--IPSSAIMPPLNGTV-SPPPPSPIDLRLLS-KSSSQSYTSLKDILPSAAAVNSPTAA---NSGYEISIRNRLVKQAAWAYLQPM

Query:  SASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        SASSYSAGPNFFHR  LR SAGNPI  CLGFIRG++IPAI+RVIR CICL
Subjt:  SASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

A0A6J1EDY2 uncharacterized protein LOC1114333195.3e-5782.5Show/hide
Query:  MADSAALKTYPPPSPSPS--NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRLVK
        MA SAA KTYPPPSPSPS  N+ RR+ IPSSA MPP NGT+SP PPPSPIDLRLLSKSSSQSYTSLKDILPS+A AVNSPTA   ANSGYEI IRNRLVK
Subjt:  MADSAALKTYPPPSPSPS--NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRLVK

Query:  QAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        QAAWAYLQPMSASSYSAGPN FH  WLRFS GNPI+ CLGFIRG IIP+IIRV R CICL
Subjt:  QAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

A0A6J1HF81 uncharacterized protein LOC1114634262.9e-5581.76Show/hide
Query:  MADSAALKTYPPPSPSPSNVRRRNSIPSSAIMPPLNGTVSPPPPSPIDLRLL-SKSSSQSYTSLKDILP--SAAAVNSPTA---ANSGYEISIRNRLVKQ
        MADSAA KTY  PS   S + RR SIPSSAIMPP N  VSPP  SPIDLR+L SKSSSQSYTSLKDILP  SAAA NSPTA   ANSGYEISIRNRLVKQ
Subjt:  MADSAALKTYPPPSPSPSNVRRRNSIPSSAIMPPLNGTVSPPPPSPIDLRLL-SKSSSQSYTSLKDILP--SAAAVNSPTA---ANSGYEISIRNRLVKQ

Query:  AAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        AAWAYLQPMSASSYS G NFFHRFWLRFSA NPIA  LGFIRGSIIPAI+RVIRFCICL
Subjt:  AAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

A0A6J1I5R0 uncharacterized protein LOC1114712762.7e-5380.25Show/hide
Query:  MADSAALKTYPPPSPSPSNVRRRNSIPSSAIMPPLNGTVSPPPPSPIDLRLL-SKSSSQSYTSLKDILPSAAAVNSPTA---ANSGYEISIRNRLVKQAA
        MADSAA +TY  PS   S + RR SIPSSAIMPP N  VSPP  SPIDLR+L SKSS QSYTSLKDILPS AA NSPTA   ANSGYEISIRNRLVKQAA
Subjt:  MADSAALKTYPPPSPSPSNVRRRNSIPSSAIMPPLNGTVSPPPPSPIDLRLL-SKSSSQSYTSLKDILPSAAAVNSPTA---ANSGYEISIRNRLVKQAA

Query:  WAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        WAYLQPMSASSYS G N FHRFWLRFSA NPIA  LGFIRGSIIPAI+RVIRFCICL
Subjt:  WAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

A0A6J1IS66 uncharacterized protein LOC1114788202.8e-5883.12Show/hide
Query:  MADSAALKTYPPPSPSPS--NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRLVK
        MA SAA KTYPPPSPSPS  N+ RR+ IPSSA MPP NGT+SP PPPSPIDLRLLSKSSSQSYTSLKDILPS+A AVNSPTA   ANSGYEI IRNRLVK
Subjt:  MADSAALKTYPPPSPSPS--NVRRRNSIPSSAIMPPLNGTVSP-PPPSPIDLRLLSKSSSQSYTSLKDILPSAA-AVNSPTA---ANSGYEISIRNRLVK

Query:  QAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL
        QAAWAYLQPMSASSYSAGPN FHRFWLRFS GNPI+ CLGFIRG IIP+IIRV R CIC+
Subjt:  QAAWAYLQPMSASSYSAGPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52520.1 unknown protein1.5e-0847.87Show/hide
Query:  NGTVSPPPPSPIDLRLLSKSSSQS-----YTSLKDILPSAA-AVNSP------TAANSGYEISIRNRLVKQAAWAYLQPMSASSYSAGPNFFHR
        NG VS      +DL L+S + S       YTSLKDILPS++  V+SP       +A SG  I+IRNRLVKQAA +YLQP S  + S+ P+F  R
Subjt:  NGTVSPPPPSPIDLRLLSKSSSQS-----YTSLKDILPSAA-AVNSP------TAANSGYEISIRNRLVKQAAWAYLQPMSASSYSAGPNFFHR

AT5G06280.1 unknown protein2.7e-1343.51Show/hide
Query:  PPSPSP-SNVRRRNSIPSSAIM--PPLNGTVSPPPPSPIDLRLLS--KSSSQSYTSLKDI----------LPSAAAVNSPTAANSGYEISIRNRLVKQAA
        PPS SP + +RR  SI +   +  P +   ++  PPS  D  L+S   SS  +YTSL+DI          LPS     SP  + +  +ISIRNRLVKQAA
Subjt:  PPSPSP-SNVRRRNSIPSSAIM--PPLNGTVSPPPPSPIDLRLLS--KSSSQSYTSLKDI----------LPSAAAVNSPTAANSGYEISIRNRLVKQAA

Query:  WAYLQP--MSASSYSAGPNFFHRFWLRFSAG
         +YLQP  +++S  SAG  FF R WL  SAG
Subjt:  WAYLQP--MSASSYSAGPNFFHRFWLRFSAG

AT5G06280.3 unknown protein2.7e-1343.51Show/hide
Query:  PPSPSP-SNVRRRNSIPSSAIM--PPLNGTVSPPPPSPIDLRLLS--KSSSQSYTSLKDI----------LPSAAAVNSPTAANSGYEISIRNRLVKQAA
        PPS SP + +RR  SI +   +  P +   ++  PPS  D  L+S   SS  +YTSL+DI          LPS     SP  + +  +ISIRNRLVKQAA
Subjt:  PPSPSP-SNVRRRNSIPSSAIM--PPLNGTVSPPPPSPIDLRLLS--KSSSQSYTSLKDI----------LPSAAAVNSPTAANSGYEISIRNRLVKQAA

Query:  WAYLQP--MSASSYSAGPNFFHRFWLRFSAG
         +YLQP  +++S  SAG  FF R WL  SAG
Subjt:  WAYLQP--MSASSYSAGPNFFHRFWLRFSAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATTCCGCCGCACTAAAAACCTATCCACCGCCGTCGCCATCACCCTCAAATGTTCGCCGCAGAAACTCGATTCCCTCCTCCGCCATCATGCCTCCTCTTAACGG
CACCGTTTCTCCTCCTCCGCCTTCGCCGATTGACTTGAGGCTCCTCTCCAAGTCCTCTTCTCAATCCTACACCTCTCTCAAGGACATCCTCCCCTCCGCCGCCGCCGTCA
ACTCTCCCACCGCCGCCAACTCCGGCTACGAAATCTCTATCCGCAACCGCCTCGTTAAGCAGGCCGCTTGGGCTTATCTCCAACCTATGTCCGCTTCTTCCTATTCCGCC
GGCCCAAATTTCTTCCACCGTTTCTGGCTCCGATTCTCCGCCGGAAATCCGATCGCCGGTTGTCTTGGATTTATCAGAGGAAGCATAATTCCGGCTATAATCCGAGTCAT
TCGGTTCTGCATTTGCTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATTCCGCCGCACTAAAAACCTATCCACCGCCGTCGCCATCACCCTCAAATGTTCGCCGCAGAAACTCGATTCCCTCCTCCGCCATCATGCCTCCTCTTAACGG
CACCGTTTCTCCTCCTCCGCCTTCGCCGATTGACTTGAGGCTCCTCTCCAAGTCCTCTTCTCAATCCTACACCTCTCTCAAGGACATCCTCCCCTCCGCCGCCGCCGTCA
ACTCTCCCACCGCCGCCAACTCCGGCTACGAAATCTCTATCCGCAACCGCCTCGTTAAGCAGGCCGCTTGGGCTTATCTCCAACCTATGTCCGCTTCTTCCTATTCCGCC
GGCCCAAATTTCTTCCACCGTTTCTGGCTCCGATTCTCCGCCGGAAATCCGATCGCCGGTTGTCTTGGATTTATCAGAGGAAGCATAATTCCGGCTATAATCCGAGTCAT
TCGGTTCTGCATTTGCTTGTGA
Protein sequenceShow/hide protein sequence
MADSAALKTYPPPSPSPSNVRRRNSIPSSAIMPPLNGTVSPPPPSPIDLRLLSKSSSQSYTSLKDILPSAAAVNSPTAANSGYEISIRNRLVKQAAWAYLQPMSASSYSA
GPNFFHRFWLRFSAGNPIAGCLGFIRGSIIPAIIRVIRFCICL