; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002198 (gene) of Snake gourd v1 genome

Gene IDTan0002198
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG05:84644119..84646129
RNA-Seq ExpressionTan0002198
SyntenyTan0002198
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573252.1 hypothetical protein SDJN03_27139, partial [Cucurbita argyrosperma subsp. sororia]4.2e-8491.11Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        MGMLV+V VF+ DLVAFALAVAAEQRRTTA VVQSGNSKFCAY+SDIATGLGVGSLLVLFASQVI+MVASRCLCCGKALRPS SRAWAITLFITCWVCF+
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAEICLLAASVRNAYHTKY+SS+MNE+ISCKMLRRGVFGAGAAFIVFTCVASELFY+SFSKAH+QTSSFAKDTGIRMA++
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

XP_022139805.1 uncharacterized protein LOC111010632 [Momordica charantia]2.1e-8391.11Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        M MLVLV VF+FDLVAFALAVAAEQRRTTA VVQSGN++FCAY+SDIATGLGVGSLL LFASQVIIMVASRCLCCGK+LRPS SRAWA+TLFITCWVCFL
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAE CLLAASVRNAYHTKY+SS++NEQISCKMLR+GVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

XP_022954617.1 uncharacterized protein LOC111456828 [Cucurbita moschata]2.5e-8491.67Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        MGMLV+V VF+ DLVAFALAVAAEQRRTTA VVQSGNSKFCAY+SDIATGLGVGSLLVLFASQVI+MVASRCLCCGKALRPS SRAWAITLFITCWVCFL
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAEICLLAASVRNAYHTKY+SS+MNE+ISCKMLRRGVFGAGAAFIVFTCVASELFY+SFSKAH+QTSSFAKDTGIRMA++
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

XP_023542784.1 uncharacterized protein LOC111802591 [Cucurbita pepo subsp. pepo]5.5e-8491.11Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        MGMLV+V VF+ DLVAFALAVAAEQRRTTA VVQSGNSKFCAY+SDIATGLGVGSLLVLFA+QVI+MVASRCLCCGKALRPS SRAWAITLFITCWVCFL
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAEICLLAASVRNAYHTKY+SS+MNE+ISCKMLRRGVFGAGAAFIVFTCVASELFY+SFSKAH+QTSSFAKDTGIRMA++
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

XP_038905949.1 uncharacterized protein LOC120091872 [Benincasa hispida]1.4e-8290Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        MGMLVL+ VFIFDLVAFALAVAAEQRRTTA VVQSG SKFCAY+SD+ATGLGVGSLL+LFASQVI+MVASRCLCCG+ LRP  SRAWAITLFITCW+CF 
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAEICLLAASVRNAYHTKY+SSLMNEQISCKMLRRGVFGAGAAFIVFTC ASELFYVSFSKAH QTSSFAKDTGIRMA+L
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

TrEMBL top hitse value%identityAlignment
A0A1S3B5C0 uncharacterized protein LOC1034861887.3e-8288.33Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        MGMLVL+ VFIFDLVAFALAVAAEQRRTTA V QSGNS+FCAY+SDIATGLGVGS L+LFASQVI+MVASRCLCCG+ LRP  SRAWAITLFITCW+CF 
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAEICLLAASVRNAYHTKY+SS+MNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAH +T+SFAKDTGIRMA+L
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

A0A5A7UTY8 DnaJ subfamily C member 227.3e-8288.33Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        MGMLVL+ VFIFDLVAFALAVAAEQRRTTA V QSGNS+FCAY+SDIATGLGVGS L+LFASQVI+MVASRCLCCG+ LRP  SRAWAITLFITCW+CF 
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAEICLLAASVRNAYHTKY+SS+MNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAH +T+SFAKDTGIRMA+L
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

A0A6J1CEZ1 uncharacterized protein LOC1110106321.0e-8391.11Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        M MLVLV VF+FDLVAFALAVAAEQRRTTA VVQSGN++FCAY+SDIATGLGVGSLL LFASQVIIMVASRCLCCGK+LRPS SRAWA+TLFITCWVCFL
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAE CLLAASVRNAYHTKY+SS++NEQISCKMLR+GVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

A0A6J1GRF5 uncharacterized protein LOC1114568281.2e-8491.67Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        MGMLV+V VF+ DLVAFALAVAAEQRRTTA VVQSGNSKFCAY+SDIATGLGVGSLLVLFASQVI+MVASRCLCCGKALRPS SRAWAITLFITCWVCFL
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAEICLLAASVRNAYHTKY+SS+MNE+ISCKMLRRGVFGAGAAFIVFTCVASELFY+SFSKAH+QTSSFAKDTGIRMA++
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

A0A6J1K1A5 uncharacterized protein LOC1114901891.1e-8290.56Show/hide
Query:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL
        MGMLV+V VF+ DLVAFALAVAAEQRRTTA VV SGNSKFCAY+SDIATGLGVGSLLVLFASQVI+MVASRCLC GKALRPS SRAWAITLFITCWVCFL
Subjt:  MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFL

Query:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        IAEICLLAASVRNAYHTKY+SS+MNE+ISCKMLRRG+FGAGAAFIVFTCVASELFYVSFSKAH+QTSSFAKDTGIRMA++
Subjt:  IAEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)4.3e-3445.34Show/hide
Query:  LVLVAVFIFDLVAFALAVAAEQRRTTAMVVQS--GNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI
        LV + V    LVAF  ++AAE+RR+    +Q    N+ FC Y+SD+ATG GVG+ L L +S+ ++M  ++C+C G+ L P   RAW+I  FI+ W+ FL+
Subjt:  LVLVAVFIFDLVAFALAVAAEQRRTTAMVVQS--GNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI

Query:  AEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKA
        AE C++A + +NAYHTKY+SS   +  SC  LR+G+F AGA FIV T V +  +Y+ F+K+
Subjt:  AEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKA

AT1G52910.1 Protein of unknown function (DUF1218)3.2e-4556.71Show/hide
Query:  LVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKF--CAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI
        LV++ VFI DL+A  LA+AAEQRR+   VV  G  +F  C Y SDIAT  G G+ ++LF SQVIIMVASRC CCGKAL+P  SRA  I LF+ CWV FLI
Subjt:  LVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKF--CAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI

Query:  AEICLLAASVRNAYHTKYISSL-MNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHD
        AE+CLLA S+RNAYHT Y     +    SC+++R+GVF AGA+F +FT + S+ +Y+S+S+A D
Subjt:  AEICLLAASVRNAYHTKYISSL-MNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHD

AT1G61065.1 Protein of unknown function (DUF1218)2.9e-5463.69Show/hide
Query:  MLVLVAVFIFDLVAFALAVAAEQRRTTAMVV-QSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI
        +L+L+ VF+FDL+AF LAVAAEQRRTT  +  +S +  +C Y+ DIATGLGVGS LVL ASQ++IMVASRCLCCG+AL PS SR+WAI LFIT WV F I
Subjt:  MLVLVAVFIFDLVAFALAVAAEQRRTTAMVV-QSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI

Query:  AEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL
        A++CLLA SVRNAYHTKY     N   SC+ LR+GVFGAGAAFIV T + SEL+YV+ S+A D   S  +D GIRM++L
Subjt:  AEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL

AT3G15480.1 Protein of unknown function (DUF1218)5.1e-4354.27Show/hide
Query:  LVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSK--FCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI
        LV++ VFI DL+A  LA+AAEQRR+   V    + +  +C Y +DIAT  G G+ ++LF SQV+IM ASRC CCGK+L P  SRA AI LF+ CWV FLI
Subjt:  LVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSK--FCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI

Query:  AEICLLAASVRNAYHTKYISS-LMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHD
        AE+CLLAAS+RNAYHT+Y     + +  SC+++R+GVF AGAAF +FT + S+ +YV +S+A D
Subjt:  AEICLLAASVRNAYHTKYISS-LMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHD

AT4G27435.1 Protein of unknown function (DUF1218)7.3e-4253.29Show/hide
Query:  LVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSK--FCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI
        +V   VF+F+L+AF LAVAAEQRR+TA VVQ    +  +C Y+SD ATG GVG+ L   ASQ++IM+ SRC CCGK L+P  SRA A+ LFI  W+ FLI
Subjt:  LVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSK--FCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLI

Query:  AEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSS
        AEICLLA SV NAYHTKY +  M+    C+ LR+GVF AGA+F+ F  + S+ +Y  +  A + + S
Subjt:  AEICLLAASVRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATGCTCGTTTTGGTTGCGGTGTTTATCTTTGATCTGGTTGCTTTTGCGCTTGCTGTTGCTGCAGAGCAGAGAAGAACCACTGCCATGGTCGTTCAATCAGGCAA
TTCTAAATTCTGTGCCTATAATTCTGATATTGCAACTGGCTTAGGTGTGGGTTCACTTCTGGTCCTATTTGCTAGCCAAGTGATCATAATGGTGGCAAGTCGATGCTTAT
GCTGTGGGAAAGCTTTGCGACCGAGTCGTTCGAGGGCTTGGGCAATTACCCTTTTCATCACTTGCTGGGTATGTTTTCTCATTGCTGAGATCTGTCTGCTGGCTGCTTCG
GTGCGAAATGCGTATCATACCAAGTACATTAGTTCTCTGATGAACGAGCAAATTTCGTGCAAGATGCTGAGAAGAGGAGTGTTCGGGGCCGGGGCTGCTTTTATCGTCTT
CACATGTGTAGCATCAGAGCTGTTCTATGTTAGCTTTTCCAAGGCTCATGACCAGACTTCCTCCTTTGCCAAAGACACTGGCATTCGAATGGCAAACCTATAG
mRNA sequenceShow/hide mRNA sequence
AAAAAAAATAAAAACAAAAAGCAGAGTTATTGCAGAGCTCACGGCCATTTGATTTTCCCATTCGTTCTTCTTATAAACCTCTGCAAGTGAAATCCATTTGCGAGCTGTTT
CTTTAGGTTGTTGGATCGTTGAGACATGGGTATGCTCGTTTTGGTTGCGGTGTTTATCTTTGATCTGGTTGCTTTTGCGCTTGCTGTTGCTGCAGAGCAGAGAAGAACCA
CTGCCATGGTCGTTCAATCAGGCAATTCTAAATTCTGTGCCTATAATTCTGATATTGCAACTGGCTTAGGTGTGGGTTCACTTCTGGTCCTATTTGCTAGCCAAGTGATC
ATAATGGTGGCAAGTCGATGCTTATGCTGTGGGAAAGCTTTGCGACCGAGTCGTTCGAGGGCTTGGGCAATTACCCTTTTCATCACTTGCTGGGTATGTTTTCTCATTGC
TGAGATCTGTCTGCTGGCTGCTTCGGTGCGAAATGCGTATCATACCAAGTACATTAGTTCTCTGATGAACGAGCAAATTTCGTGCAAGATGCTGAGAAGAGGAGTGTTCG
GGGCCGGGGCTGCTTTTATCGTCTTCACATGTGTAGCATCAGAGCTGTTCTATGTTAGCTTTTCCAAGGCTCATGACCAGACTTCCTCCTTTGCCAAAGACACTGGCATT
CGAATGGCAAACCTATAGAAGGTGTGAGGAGCTTCTTTATCAGAGTTGGGTTTTATAGGATTTTGTCAAACATGTGTGACTTGTTGATGTGAAAAATGAACAACTGAGAG
TAACAGATTACTTATTGAATACAGATGTACAAATATGATTTGATAGTTGAGTGAAGATTTCATGTTGTGGTTTGTGTCTTATGCTGTGTCAATGACTATGTTTTTCTTTA
GTATTTGAGCTAATCCTTGTCCGTC
Protein sequenceShow/hide protein sequence
MGMLVLVAVFIFDLVAFALAVAAEQRRTTAMVVQSGNSKFCAYNSDIATGLGVGSLLVLFASQVIIMVASRCLCCGKALRPSRSRAWAITLFITCWVCFLIAEICLLAAS
VRNAYHTKYISSLMNEQISCKMLRRGVFGAGAAFIVFTCVASELFYVSFSKAHDQTSSFAKDTGIRMANL