; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007687 (gene) of Snake gourd v1 genome

Gene IDTan0007687
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionExostosin domain-containing protein
Genome locationLG09:66618819..66619819
RNA-Seq ExpressionTan0007687
SyntenyTan0007687
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR004263 - Exostosin-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443936.1 PREDICTED: probable glycosyltransferase At5g20260 [Cucumis melo]1.0e-3264.75Show/hide
Query:  ASLDFSHKLPLFLLPPFF-LLLLLLCLFPPNQ--NPFPLIISQNISFFHQSKQP----PPPQLLLQFPPTPAPSAVEPPSP----HFSGHKKREKIDEDL
        +S +F HKL  FLL PFF LLLLLLC FPPN+  NPF  I+S+N+  FH  KQP     PPQ  LQFPP  APSA+ PPSP      +  KK E I+E L
Subjt:  ASLDFSHKLPLFLLPPFF-LLLLLLCLFPPNQ--NPFPLIISQNISFFHQSKQP----PPPQLLLQFPPTPAPSAVEPPSP----HFSGHKKREKIDEDL

Query:  ARARAAIREAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ
        A ARAAIR+AIVT+NYTSEK ESFIPRGRVYRNAYAFHQ
Subjt:  ARARAAIREAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ

XP_011655344.1 probable glycosyltransferase At5g20260 [Cucumis sativus]1.4e-2964.96Show/hide
Query:  ASLDFSHKLPLFLLPPFF-LLLLLLCLFPPNQ--NPFPLIISQNISFFHQSKQP----PPPQLLLQFPPTPA-PSAVEPPSPHFSGHKKR-EKIDEDLAR
        +SL+F HKL  FLL PFF LLLLLLC FPPN   NPF  I+S+N+  FH SKQP     PPQ  LQFPPT A  +A   P  + S  KK+ E I+E LA 
Subjt:  ASLDFSHKLPLFLLPPFF-LLLLLLCLFPPNQ--NPFPLIISQNISFFHQSKQP----PPPQLLLQFPPTPA-PSAVEPPSPHFSGHKKR-EKIDEDLAR

Query:  ARAAIREAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ
        ARAAIR AIVT+NYTSEK ESFIPRGRVYRNAYAFHQ
Subjt:  ARAAIREAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ

XP_022135540.1 probable glycosyltransferase At5g20260 [Momordica charantia]2.7e-3672.09Show/hide
Query:  FSHK-LPLFLLPPFFLLLLLLCLFPPNQNPFPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKK----REKIDEDLARARAAIREA
        FSH+  PLFLLP FFLLLLLLC FPPNQN FP  ISQNI FFH  K PPP     Q  P  APS+VEPP    SGHKK     EKI+EDLARARAAIREA
Subjt:  FSHK-LPLFLLPPFFLLLLLLCLFPPNQNPFPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKK----REKIDEDLARARAAIREA

Query:  IVTQNYTSEKVESFIPRGRVYRNAYAFHQ
        IV +NYTSE+ ESFIPRGRVYRNAYAFHQ
Subjt:  IVTQNYTSEKVESFIPRGRVYRNAYAFHQ

XP_022987712.1 probable glycosyltransferase At5g20260 isoform X1 [Cucurbita maxima]3.6e-3372.87Show/hide
Query:  SHKLPLFLLPPFFLLLLLLCLFPPN-QNPFPLIISQNISFFHQSKQ--PPPPQLLLQFPPTPAPSAVEP---PSPHFSGHKKREKIDEDLARARAAIREA
        SH+LPLFLLPP   LLLLL LFPPN  NPFPLI +QNI FFH SKQ  PPPPQL  QFP    P+AVEP   P P  S +K   +I+EDLARARAAIREA
Subjt:  SHKLPLFLLPPFFLLLLLLCLFPPN-QNPFPLIISQNISFFHQSKQ--PPPPQLLLQFPPTPAPSAVEP---PSPHFSGHKKREKIDEDLARARAAIREA

Query:  IVTQNYTSEKVESFIPRGRVYRNAYAFHQ
        IV +NYTSEKVESFIPRGRVYRNAYAFHQ
Subjt:  IVTQNYTSEKVESFIPRGRVYRNAYAFHQ

XP_038879941.1 probable glycosyltransferase At5g20260 [Benincasa hispida]4.4e-3975.76Show/hide
Query:  MASLDFSHKLPLFLLPP-FFLLLLLLCLFPP-NQNPFPLIISQNISFFHQSKQPPPP-QLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAI
        MASLDFSHKLP FLL P FFLLLLLLC FPP NQNPF  II QN SFFH SKQPPPP Q  LQFPP  APSAV  P  H S HKK + I++ LA ARAAI
Subjt:  MASLDFSHKLPLFLLPP-FFLLLLLLCLFPP-NQNPFPLIISQNISFFHQSKQPPPP-QLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAI

Query:  REAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ
        R AI+T+NYTSEK ESFIPRGRVYRNAYAFHQ
Subjt:  REAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ

TrEMBL top hitse value%identityAlignment
A0A0A0LTL1 Exostosin domain-containing protein6.8e-3064.96Show/hide
Query:  ASLDFSHKLPLFLLPPFF-LLLLLLCLFPPNQ--NPFPLIISQNISFFHQSKQP----PPPQLLLQFPPTPA-PSAVEPPSPHFSGHKKR-EKIDEDLAR
        +SL+F HKL  FLL PFF LLLLLLC FPPN   NPF  I+S+N+  FH SKQP     PPQ  LQFPPT A  +A   P  + S  KK+ E I+E LA 
Subjt:  ASLDFSHKLPLFLLPPFF-LLLLLLCLFPPNQ--NPFPLIISQNISFFHQSKQP----PPPQLLLQFPPTPA-PSAVEPPSPHFSGHKKR-EKIDEDLAR

Query:  ARAAIREAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ
        ARAAIR AIVT+NYTSEK ESFIPRGRVYRNAYAFHQ
Subjt:  ARAAIREAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ

A0A1S3B8R4 probable glycosyltransferase At5g202605.0e-3364.75Show/hide
Query:  ASLDFSHKLPLFLLPPFF-LLLLLLCLFPPNQ--NPFPLIISQNISFFHQSKQP----PPPQLLLQFPPTPAPSAVEPPSP----HFSGHKKREKIDEDL
        +S +F HKL  FLL PFF LLLLLLC FPPN+  NPF  I+S+N+  FH  KQP     PPQ  LQFPP  APSA+ PPSP      +  KK E I+E L
Subjt:  ASLDFSHKLPLFLLPPFF-LLLLLLCLFPPNQ--NPFPLIISQNISFFHQSKQP----PPPQLLLQFPPTPAPSAVEPPSP----HFSGHKKREKIDEDL

Query:  ARARAAIREAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ
        A ARAAIR+AIVT+NYTSEK ESFIPRGRVYRNAYAFHQ
Subjt:  ARARAAIREAIVTQNYTSEKVESFIPRGRVYRNAYAFHQ

A0A6J1C1B6 probable glycosyltransferase At5g202601.3e-3672.09Show/hide
Query:  FSHK-LPLFLLPPFFLLLLLLCLFPPNQNPFPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKK----REKIDEDLARARAAIREA
        FSH+  PLFLLP FFLLLLLLC FPPNQN FP  ISQNI FFH  K PPP     Q  P  APS+VEPP    SGHKK     EKI+EDLARARAAIREA
Subjt:  FSHK-LPLFLLPPFFLLLLLLCLFPPNQNPFPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKK----REKIDEDLARARAAIREA

Query:  IVTQNYTSEKVESFIPRGRVYRNAYAFHQ
        IV +NYTSE+ ESFIPRGRVYRNAYAFHQ
Subjt:  IVTQNYTSEKVESFIPRGRVYRNAYAFHQ

A0A6J1H9Z5 probable glycosyltransferase At5g20260 isoform X12.2e-2866.94Show/hide
Query:  SHKLPLFLLPPFFLLLLLLCLFPPN-QNPFPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQN
        SH+LPLFLLPP  LLLLLL LF PN  NPFPLII+QNI FFH SK PPP       PP P PSA              ++I+EDLARARA IREAIV +N
Subjt:  SHKLPLFLLPPFFLLLLLLCLFPPN-QNPFPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQN

Query:  YTSEKVESFIPRGRVYRNAYAFHQ
        YTSE VESFIPRGRVYRNAYAFHQ
Subjt:  YTSEKVESFIPRGRVYRNAYAFHQ

A0A6J1JB42 probable glycosyltransferase At5g20260 isoform X11.7e-3372.87Show/hide
Query:  SHKLPLFLLPPFFLLLLLLCLFPPN-QNPFPLIISQNISFFHQSKQ--PPPPQLLLQFPPTPAPSAVEP---PSPHFSGHKKREKIDEDLARARAAIREA
        SH+LPLFLLPP   LLLLL LFPPN  NPFPLI +QNI FFH SKQ  PPPPQL  QFP    P+AVEP   P P  S +K   +I+EDLARARAAIREA
Subjt:  SHKLPLFLLPPFFLLLLLLCLFPPN-QNPFPLIISQNISFFHQSKQ--PPPPQLLLQFPPTPAPSAVEP---PSPHFSGHKKREKIDEDLARARAAIREA

Query:  IVTQNYTSEKVESFIPRGRVYRNAYAFHQ
        IV +NYTSEKVESFIPRGRVYRNAYAFHQ
Subjt:  IVTQNYTSEKVESFIPRGRVYRNAYAFHQ

SwissProt top hitse value%identityAlignment
Q3E9A4 Probable glycosyltransferase At5g202606.4e-0937.4Show/hide
Query:  LPLFLLPPFFLLLLLLCLFPPNQNPFPLIISQNISFFHQSKQ---PPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQNY
        + L L+P   LLL+LL  +  + +  P + S  +S F  +      P P L ++F    +  +     P   G+ KR  I+E LA++R+AIREA+  + +
Subjt:  LPLFLLPPFFLLLLLLCLFPPNQNPFPLIISQNISFFHQSKQ---PPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQNY

Query:  TSEKVESFIPRGRVYRNAYAFHQ
         S+K E+F+PRG VYRNA+AFHQ
Subjt:  TSEKVESFIPRGRVYRNAYAFHQ

Arabidopsis top hitse value%identityAlignment
AT3G07620.1 Exostosin family protein7.5e-0540.98Show/hide
Query:  SGHKKRE-KIDEDLARARAAIREAIVTQNYTSEKV---ESFIPRGRVYRNAYAFHQLRFLL
        SG+ KR+ K++ +LA AR  IREA +  + T+      E ++P G +YRN YAFH+   L+
Subjt:  SGHKKRE-KIDEDLARARAAIREAIVTQNYTSEKV---ESFIPRGRVYRNAYAFHQLRFLL

AT3G42180.1 Exostosin family protein7.5e-0533.06Show/hide
Query:  PLFLLPPFFLLLLLLCLFPPNQNP----FPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQNY
        PL L+    L  LL   FP N++P    F  +   ++     + Q       L  PP                    EK +E+L +ARAAIR A+  +N 
Subjt:  PLFLLPPFFLLLLLLCLFPPNQNP----FPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQNY

Query:  TS-EKVESFIPRGRVYRNAYAFHQ
        TS E+V ++IP G++YRN++AFHQ
Subjt:  TS-EKVESFIPRGRVYRNAYAFHQ

AT3G42180.3 Exostosin family protein7.5e-0533.06Show/hide
Query:  PLFLLPPFFLLLLLLCLFPPNQNP----FPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQNY
        PL L+    L  LL   FP N++P    F  +   ++     + Q       L  PP                    EK +E+L +ARAAIR A+  +N 
Subjt:  PLFLLPPFFLLLLLLCLFPPNQNP----FPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQNY

Query:  TS-EKVESFIPRGRVYRNAYAFHQ
        TS E+V ++IP G++YRN++AFHQ
Subjt:  TS-EKVESFIPRGRVYRNAYAFHQ

AT5G20260.1 Exostosin family protein4.5e-1037.4Show/hide
Query:  LPLFLLPPFFLLLLLLCLFPPNQNPFPLIISQNISFFHQSKQ---PPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQNY
        + L L+P   LLL+LL  +  + +  P + S  +S F  +      P P L ++F    +  +     P   G+ KR  I+E LA++R+AIREA+  + +
Subjt:  LPLFLLPPFFLLLLLLCLFPPNQNPFPLIISQNISFFHQSKQ---PPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQNY

Query:  TSEKVESFIPRGRVYRNAYAFHQ
         S+K E+F+PRG VYRNA+AFHQ
Subjt:  TSEKVESFIPRGRVYRNAYAFHQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCTTAGATTTCTCTCACAAACTCCCTTTATTTTTACTTCCACCTTTCTTCCTCCTCCTTCTTCTTCTTTGCCTTTTCCCACCAAATCAAAACCCTTTCCCTCT
GATAATATCTCAAAATATTTCCTTTTTCCACCAATCCAAACAACCCCCACCGCCGCAACTCCTCCTTCAATTTCCTCCCACCCCCGCCCCATCCGCCGTGGAGCCGCCGT
CGCCTCACTTCTCCGGCCACAAGAAGCGGGAAAAGATAGACGAAGATTTGGCTCGAGCTCGAGCAGCGATTAGAGAAGCGATTGTGACTCAGAACTATACGTCGGAAAAG
GTGGAGAGTTTCATACCTAGAGGACGAGTTTACAGAAACGCATACGCTTTTCATCAGTTAAGATTCCTACTGGCTTTTTTCATTTACTTTCCATCTTTGGCTCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCCTTAGATTTCTCTCACAAACTCCCTTTATTTTTACTTCCACCTTTCTTCCTCCTCCTTCTTCTTCTTTGCCTTTTCCCACCAAATCAAAACCCTTTCCCTCT
GATAATATCTCAAAATATTTCCTTTTTCCACCAATCCAAACAACCCCCACCGCCGCAACTCCTCCTTCAATTTCCTCCCACCCCCGCCCCATCCGCCGTGGAGCCGCCGT
CGCCTCACTTCTCCGGCCACAAGAAGCGGGAAAAGATAGACGAAGATTTGGCTCGAGCTCGAGCAGCGATTAGAGAAGCGATTGTGACTCAGAACTATACGTCGGAAAAG
GTGGAGAGTTTCATACCTAGAGGACGAGTTTACAGAAACGCATACGCTTTTCATCAGTTAAGATTCCTACTGGCTTTTTTCATTTACTTTCCATCTTTGGCTCTATAA
Protein sequenceShow/hide protein sequence
MASLDFSHKLPLFLLPPFFLLLLLLCLFPPNQNPFPLIISQNISFFHQSKQPPPPQLLLQFPPTPAPSAVEPPSPHFSGHKKREKIDEDLARARAAIREAIVTQNYTSEK
VESFIPRGRVYRNAYAFHQLRFLLAFFIYFPSLAL