; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018903 (gene) of Snake gourd v1 genome

Gene IDTan0018903
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1195)
Genome locationLG02:88087511..88089046
RNA-Seq ExpressionTan0018903
SyntenyTan0018903
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010608 - Protein of unknown function DUF1195


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022962155.1 uncharacterized protein LOC111462692 isoform X2 [Cucurbita moschata]1.9e-7590.45Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG
        MK +LSPTMAALKK+TPSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSA TFARFYDGPRKPIFDDLDILEVEERE+ VRHMWNLY+HGG
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG

Query:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE
        GGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK+  ++
Subjt:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE

XP_022962157.1 uncharacterized protein LOC111462692 isoform X4 [Cucurbita moschata]9.5e-7593.42Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG
        MK +LSPTMAALKK+TPSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSA TFARFYDGPRKPIFDDLDILEVEERE+ VRHMWNLY+HGG
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG

Query:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK
        GGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK
Subjt:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK

XP_022997545.1 uncharacterized protein LOC111492438 isoform X2 [Cucurbita maxima]2.5e-7589.81Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG
        MK +L PTMAALKK++PSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEERE+ VRHMWNLY+HGG
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG

Query:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE
        GGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK+  ++
Subjt:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE

XP_023546987.1 uncharacterized protein LOC111805922 isoform X3 [Cucurbita pepo subsp. pepo]1.9e-7589.81Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG
        MK +LSPTMA LKK++PSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEERE+ VRHMWNLY+HGG
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG

Query:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE
        GGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK+  ++
Subjt:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE

XP_023546989.1 uncharacterized protein LOC111805922 isoform X5 [Cucurbita pepo subsp. pepo]9.5e-7592.76Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG
        MK +LSPTMA LKK++PSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEERE+ VRHMWNLY+HGG
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG

Query:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK
        GGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK
Subjt:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK

TrEMBL top hitse value%identityAlignment
A0A6J1HBY9 uncharacterized protein LOC111462692 isoform X44.6e-7593.42Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG
        MK +LSPTMAALKK+TPSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSA TFARFYDGPRKPIFDDLDILEVEERE+ VRHMWNLY+HGG
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG

Query:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK
        GGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK
Subjt:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK

A0A6J1HEA2 uncharacterized protein LOC111462692 isoform X29.3e-7690.45Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG
        MK +LSPTMAALKK+TPSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSA TFARFYDGPRKPIFDDLDILEVEERE+ VRHMWNLY+HGG
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG

Query:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE
        GGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK+  ++
Subjt:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE

A0A6J1HG60 uncharacterized protein LOC111462692 isoform X11.5e-7386.06Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDIL--------EVEEREKAVRHM
        MK +LSPTMAALKK+TPSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSA TFARFYDGPRKPIFDDLDIL        EVEERE+ VRHM
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDIL--------EVEEREKAVRHM

Query:  WNLYTHGGGGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE
        WNLY+HGGGGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK+  ++
Subjt:  WNLYTHGGGGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE

A0A6J1K9Z2 uncharacterized protein LOC111492438 isoform X21.2e-7589.81Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG
        MK +L PTMAALKK++PSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEERE+ VRHMWNLY+HGG
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGG

Query:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE
        GGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK+  ++
Subjt:  GGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE

A0A6J1KBQ5 uncharacterized protein LOC111492438 isoform X11.9e-7385.45Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDIL--------EVEEREKAVRHM
        MK +L PTMAALKK++PSETGVSFFLSR+ARYKFW LAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDIL        EVEERE+ VRHM
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDIL--------EVEEREKAVRHM

Query:  WNLYTHGGGGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE
        WNLY+HGGGGRLPRFW EAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSK+  ++
Subjt:  WNLYTHGGGGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19380.1 Protein of unknown function (DUF1195)1.6e-2751.43Show/hide
Query:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIF-DDLDILEVEEREKAVRHMWNLYTHG
        MK + S T+  ++K             + A YK W L A+LLLAF SM TGSVSLK   G F    DG     F DDLD+LE+EEREK VR MW++Y   
Subjt:  MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIF-DDLDILEVEEREKAVRHMWNLYTHG

Query:  GGGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSL
        GG ++PRFW EAFEAAYE LI D  AVR+AA+ +IA++SL
Subjt:  GGGRLPRFWSEAFEAAYEDLIGDVPAVRDAALLEIARMSL

AT4G36660.1 Protein of unknown function (DUF1195)7.6e-3852.94Show/hide
Query:  MAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGGGGRLPRFW
        MA  KKE+         L  R RYKFWA AAILLLAFWSMFTG+V+L+ S G   R  +    P +D+LD+LE+EEREK V+HMW++YT+    +LPRFW
Subjt:  MAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGGGGRLPRFW

Query:  SEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKELKTN
         EAF AAYE+L  DVP VR+AA+ EIA+MS +S+ +DP P +S + +++L  N
Subjt:  SEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKELKTN

AT5G65650.1 Protein of unknown function (DUF1195)5.1e-4259.03Show/hide
Query:  TMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGGGGRLPRF
        ++AA       ETG S   S + RYKFWALAAILLLAFWSM TG+V+L+WSAG    F D    PI +DLD+LE+EEREK V+HMW++Y +G   RLPRF
Subjt:  TMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGGGGRLPRF

Query:  WSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKS
        W EAFEAAYE+L  DVP V +AA+ EIARMS++S+ +DP P+ S
Subjt:  WSEAFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCCGAGCTTTCACCGACTATGGCGGCTCTGAAGAAGGAGACCCCATCAGAAACTGGGGTCTCCTTTTTCCTCTCTAGAAGGGCTCGTTACAAGTTCTGGGCCTT
GGCCGCCATTCTCCTCCTCGCCTTCTGGTCCATGTTCACCGGCTCTGTCTCCCTCAAATGGTCCGCCGGAACCTTCGCCAGATTCTACGACGGTCCCCGCAAGCCGATCT
TCGACGATCTTGACATTCTGGAAGTTGAAGAGCGGGAGAAGGCTGTTCGACACATGTGGAACCTGTACACTCACGGCGGCGGCGGGCGGTTGCCGAGATTTTGGTCGGAG
GCTTTTGAAGCGGCGTACGAGGACTTGATCGGCGATGTTCCCGCCGTTCGAGATGCTGCCCTTTTGGAGATCGCAAGAATGTCTCTGCAATCTGTTCACGTCGATCCAAT
TCCGATCAAGTCCAAGATTGGGAGCAAAGAGCTCAAAACAAACGCAATTGGCTGA
mRNA sequenceShow/hide mRNA sequence
TAGAAGTTGAAATTTCAAGTTGTTGAGAGAGATGAAATAATCTTCCACCTTACATTTTGTTACTAAAAAAAATGGAAAGGCTAATATTTGAATCAACGGCGCAGATTTGG
ATTCAACAAAACCACCACAGTTCACAGCAATCCCCAACCCATTGGGTCCCTTCTGCGTCGGTGAATTGCAGACGAAGGTCTGTCCATGGCGTTGGCCCCATTCACACACA
TTCACTCTTCATAAATTCCCAATAAGAACTCTTCCATTTCATCTTCTAGTTCTCGTTTGCATTACCATAAATTAATTTCTCTTCATTCTCTTCTCTCTCTCACTCCACTG
TTAAACCCCAACACAACCCCATCTCTCTCTGTAACCCCAAAATGAAATCCGAGCTTTCACCGACTATGGCGGCTCTGAAGAAGGAGACCCCATCAGAAACTGGGGTCTCC
TTTTTCCTCTCTAGAAGGGCTCGTTACAAGTTCTGGGCCTTGGCCGCCATTCTCCTCCTCGCCTTCTGGTCCATGTTCACCGGCTCTGTCTCCCTCAAATGGTCCGCCGG
AACCTTCGCCAGATTCTACGACGGTCCCCGCAAGCCGATCTTCGACGATCTTGACATTCTGGAAGTTGAAGAGCGGGAGAAGGCTGTTCGACACATGTGGAACCTGTACA
CTCACGGCGGCGGCGGGCGGTTGCCGAGATTTTGGTCGGAGGCTTTTGAAGCGGCGTACGAGGACTTGATCGGCGATGTTCCCGCCGTTCGAGATGCTGCCCTTTTGGAG
ATCGCAAGAATGTCTCTGCAATCTGTTCACGTCGATCCAATTCCGATCAAGTCCAAGATTGGGAGCAAAGAGCTCAAAACAAACGCAATTGGCTGAGTAGCAGCAGGGCT
CTGCTAAGTTGGCTAGCTGGTTGGGTTGTGTTTATTTGGGAGAATCTTTGAATTTATTTGCTTAAAGTTGTGTACTATATTAAAAAAAACAAAACAATGGAATGTACAAA
ATTTACCATACCAATCCAATATCAATGGTAAAAGCGAAATTACTCCCTC
Protein sequenceShow/hide protein sequence
MKSELSPTMAALKKETPSETGVSFFLSRRARYKFWALAAILLLAFWSMFTGSVSLKWSAGTFARFYDGPRKPIFDDLDILEVEEREKAVRHMWNLYTHGGGGRLPRFWSE
AFEAAYEDLIGDVPAVRDAALLEIARMSLQSVHVDPIPIKSKIGSKELKTNAIG