; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021033 (gene) of Snake gourd v1 genome

Gene IDTan0021033
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1195)
Genome locationLG03:74777627..74781386
RNA-Seq ExpressionTan0021033
SyntenyTan0021033
Gene Ontology termsGO:0008643 - carbohydrate transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010608 - Protein of unknown function DUF1195


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008440698.1 PREDICTED: uncharacterized protein LOC103485038 [Cucumis melo]3.7e-8593.18Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDD+DF+IHDDLDVLEMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSV S+V+DTPS+QST+AREFTK QK+ADKS Q A+KTGSKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL

XP_011658007.1 uncharacterized protein LOC101216891 [Cucumis sativus]1.4e-8493.18Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDD+DFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSV S+V+DTPS+QST+AREFTK QK+ADKS Q A+KT SKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL

XP_022132386.1 uncharacterized protein LOC111005257 [Momordica charantia]1.5e-8191.48Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSD VDF+I+DDLDVLEMEEREK+VKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSV SIVID  SEQSTSAR F KMQK+ADK+ +TA+ TGSKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL

XP_022978567.1 uncharacterized protein LOC111478506 [Cucurbita maxima]2.3e-8290.91Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKK+SSVSS+FGKGRYKFWAL AILLLAFWSMFTGTVSLRWSAGNLNGLSDD+DF++ +DLDVLEMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSE+PGDREAAISEIARMS+ SIV+DTPSEQSTSAREFTKMQK+ADKS  TA KTGSKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL

XP_038883913.1 uncharacterized protein LOC120074752 [Benincasa hispida]7.6e-8694.32Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDD+DF+IHDDLDVLEMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSV SIV+DTPSEQST+AREFTK QK+ADKS +TA+KTGSKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL

TrEMBL top hitse value%identityAlignment
A0A0A0KGF8 Uncharacterized protein6.9e-8593.18Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDD+DFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSV S+V+DTPS+QST+AREFTK QK+ADKS Q A+KT SKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL

A0A1S3B2C0 uncharacterized protein LOC1034850381.8e-8593.18Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDD+DF+IHDDLDVLEMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSV S+V+DTPS+QST+AREFTK QK+ADKS Q A+KTGSKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL

A0A6J1BS47 uncharacterized protein LOC1110052577.1e-8291.48Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSD VDF+I+DDLDVLEMEEREK+VKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSV SIVID  SEQSTSAR F KMQK+ADK+ +TA+ TGSKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL

A0A6J1GEZ2 uncharacterized protein LOC1114533252.1e-8190.96Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKKESSVSS+FGKGRYKFWAL AILLLAFWSMFTGTVSLRWS GNLNGLSDD+DF++ +DLDVLEMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKV-ADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMS+ SIV+DTPSEQSTSAREFTKMQK+ ADKS  TA KTGSKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKV-ADKSWQTASKTGSKL

A0A6J1ILF2 uncharacterized protein LOC1114785061.1e-8290.91Show/hide
Query:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI
        MKDDDVLPTTIASAGKK+SSVSS+FGKGRYKFWAL AILLLAFWSMFTGTVSLRWSAGNLNGLSDD+DF++ +DLDVLEMEEREKIVKHMWDVYTNNRRI
Subjt:  MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRI

Query:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL
        RLPRFWQEAFEAAYEDLTSE+PGDREAAISEIARMS+ SIV+DTPSEQSTSAREFTKMQK+ADKS  TA KTGSKL
Subjt:  RLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19380.1 Protein of unknown function (DUF1195)2.1e-2551.61Show/hide
Query:  YKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAI
        YK W L A+LLLAF SM TG+VSL+   G  +       FS  DDLDVLE+EEREK+V+ MWDVY  +  +++PRFW+EAFEAAYE L S+    R AA+
Subjt:  YKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAI

Query:  SEIARMSVRSIVIDTPSEQSTSAR
        S+IA++S+   V      +STSA+
Subjt:  SEIARMSVRSIVIDTPSEQSTSAR

AT4G36660.1 Protein of unknown function (DUF1195)6.0e-5767.46Show/hide
Query:  MKDDDVLPTTIASA---------GKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMW
        MK+DD LPTT  +           KKESS S LFG+GRYKFWA AAILLLAFWSMFTGTV+LR S GNLN LS+D+    +D+LDVLEMEEREK+VKHMW
Subjt:  MKDDDVLPTTIASA---------GKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMW

Query:  DVYTNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQK
        DVYTN+RRI+LPRFWQEAF AAYE+LTS+VPG REAAI EIA+MS RSI +D P  +S SAR+  +  K
Subjt:  DVYTNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQK

AT5G65650.1 Protein of unknown function (DUF1195)1.6e-6269.78Show/hide
Query:  MKDDDVLPTTIAS------AGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVY
        MKD D LP + +S       GKKE+  S+LF KGRYKFWALAAILLLAFWSM TGTV+LRWSAGN+N  +DD+ F IH+DLDVLEMEEREK+VKHMWDVY
Subjt:  MKDDDVLPTTIAS------AGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVY

Query:  TNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQT-ASKTGSK
         N RRIRLPRFWQEAFEAAYE+LTS+VP   EAAISEIARMS+RSIVID P   ST+ RE TK  K+ADK   T  SK  S+
Subjt:  TNNRRIRLPRFWQEAFEAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQT-ASKTGSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGACGACGATGTCCTTCCAACGACGATAGCGAGTGCCGGGAAGAAAGAGAGCTCGGTTTCGAGTTTATTCGGCAAAGGCCGCTACAAGTTCTGGGCATTGGCTGC
CATTTTGTTGCTCGCATTTTGGTCCATGTTCACCGGCACCGTTTCTCTTCGATGGTCCGCCGGTAATCTCAACGGCCTATCTGATGATGTCGATTTTAGCATTCATGACG
ATCTCGATGTGCTTGAGATGGAGGAAAGAGAGAAGATAGTGAAGCACATGTGGGACGTTTACACAAATAATCGACGGATCAGGTTGCCGCGTTTCTGGCAAGAGGCATTT
GAGGCTGCGTACGAGGACCTGACTAGTGAAGTGCCTGGTGATAGAGAGGCTGCTATCTCCGAGATCGCCCGGATGTCCGTGCGCTCCATTGTTATTGATACGCCTTCGGA
GCAATCAACGAGTGCACGAGAGTTCACAAAGATGCAGAAAGTAGCAGATAAAAGCTGGCAGACAGCTTCCAAAACTGGGAGTAAGCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATTTCTTTTAAGGAAGCCTATCTTTCTCTCACTTCTCTTTTCTCCTTTCGCTATTTCTCTCTCTATCTCTCTCTCTTCGGTGAGAGAAGAAACTCTCTTCTCCGTCCAAA
TCTGGGTCTTTTAGTGCAAAAGAGCGATAGCATGATTTCTCATTTTTTCTGAAATCGATTCTGCAGCAGTGATTCCAGCATTTTGATTTTTCTTTTTCCTTCTTATTGTC
GTTGATCGATATGAAGGACGACGATGTCCTTCCAACGACGATAGCGAGTGCCGGGAAGAAAGAGAGCTCGGTTTCGAGTTTATTCGGCAAAGGCCGCTACAAGTTCTGGG
CATTGGCTGCCATTTTGTTGCTCGCATTTTGGTCCATGTTCACCGGCACCGTTTCTCTTCGATGGTCCGCCGGTAATCTCAACGGCCTATCTGATGATGTCGATTTTAGC
ATTCATGACGATCTCGATGTGCTTGAGATGGAGGAAAGAGAGAAGATAGTGAAGCACATGTGGGACGTTTACACAAATAATCGACGGATCAGGTTGCCGCGTTTCTGGCA
AGAGGCATTTGAGGCTGCGTACGAGGACCTGACTAGTGAAGTGCCTGGTGATAGAGAGGCTGCTATCTCCGAGATCGCCCGGATGTCCGTGCGCTCCATTGTTATTGATA
CGCCTTCGGAGCAATCAACGAGTGCACGAGAGTTCACAAAGATGCAGAAAGTAGCAGATAAAAGCTGGCAGACAGCTTCCAAAACTGGGAGTAAGCTGTGACATGGATGA
TGTTCATGATGATCCAAGGTTCGAGGCTTCTCTATTCTACCCTTTCTGCCACTAGATTGCTGTCTGAAGTTTGTAGGAGGCAGTGATTGGCAAATCTATATTCTTTAAAA
TTTTGATTTTTTCCCCTTTGCTTTACACTATTGCAAGATGGGTCAAACAGCCTTTTTGCCTAGAATGTTAGAGTGATATTTGTTAATGTTATTTATAAAGGTTAAAGATA
AATTTTGAAAGTCTCATATTCAACGATCTTTTAAGCAGGTAGA
Protein sequenceShow/hide protein sequence
MKDDDVLPTTIASAGKKESSVSSLFGKGRYKFWALAAILLLAFWSMFTGTVSLRWSAGNLNGLSDDVDFSIHDDLDVLEMEEREKIVKHMWDVYTNNRRIRLPRFWQEAF
EAAYEDLTSEVPGDREAAISEIARMSVRSIVIDTPSEQSTSAREFTKMQKVADKSWQTASKTGSKL