; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014108 (gene) of Snake gourd v1 genome

Gene IDTan0014108
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationLG08:2767365..2771976
RNA-Seq ExpressionTan0014108
SyntenyTan0014108
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139277.1 uncharacterized protein LOC101212944 [Cucumis sativus]1.9e-8694.19Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLV FAVVSALAVCGPALYWRFKKA  LGDS+ SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

XP_008457359.1 PREDICTED: uncharacterized protein LOC103497063 [Cucumis melo]8.7e-8794.77Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLV FAVVSALAVCGPALYWRFKKA  LGDS+ SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

XP_022143173.1 uncharacterized protein LOC111013108 [Momordica charantia]1.3e-8795.93Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLVIFAVVSALAVCGPALYWRFKKAF LGDS+ SC PCICDCPPPLSLLKIAPGLANLSVTDCG NDPDLK EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

XP_022938183.1 uncharacterized protein LOC111444344 [Cucurbita moschata]2.5e-8693.57Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLV FAVVSALAVCGPAL+WRFKKA HLGDS+TSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLK+EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG
        EAV+GEHTRHMNIT+FEAKRAASQYQREAEKCI+ATETCEEARERAEAL IKERKLTSLWERRARQMGW+G
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG

XP_038890751.1 uncharacterized protein LOC120080238 [Benincasa hispida]3.9e-8794.77Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLVIFAVVSALAVCGPALYWRFKKA  LGDS+ SC PCICDCPPPLSLLKI+PGLANLS+TDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

TrEMBL top hitse value%identityAlignment
A0A0A0LG38 Uncharacterized protein9.4e-8794.19Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLV FAVVSALAVCGPALYWRFKKA  LGDS+ SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A1S3C6L6 uncharacterized protein LOC1034970634.2e-8794.77Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLV FAVVSALAVCGPALYWRFKKA  LGDS+ SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A5A7VC79 DUF1068 domain-containing protein4.2e-8794.77Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLV FAVVSALAVCGPALYWRFKKA  LGDS+ SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A6J1CQ21 uncharacterized protein LOC1110131086.5e-8895.93Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLVIFAVVSALAVCGPALYWRFKKAF LGDS+ SC PCICDCPPPLSLLKIAPGLANLSVTDCG NDPDLK EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A6J1FI57 uncharacterized protein LOC1114443441.2e-8693.57Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLV FAVVSALAVCGPAL+WRFKKA HLGDS+TSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLK+EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG
        EAV+GEHTRHMNIT+FEAKRAASQYQREAEKCI+ATETCEEARERAEAL IKERKLTSLWERRARQMGW+G
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)3.2e-3143.37Show/hide
Query:  RSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAV
        R  + L+  L +  +  A  + GP LYW   +A     S +SC  C C+C    S + I   L+N S  DC  +DP++ ++ EK + +LLTEELKL+EA 
Subjt:  RSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAV

Query:  SGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGW
        S E  +  ++ L EAK+  S YQ+EA+KC +  ETCEEARE+AE  + +++KLTS WE RARQ GW
Subjt:  SGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGW

AT2G24290.1 Protein of unknown function (DUF1068)2.3e-6973.71Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTS---CAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEEL
        M+RRSG+C+R CLVIF+VVSAL VCGPALYW+  K F +G +R++   C PC+CD PPPLSLL+IAPGLANLS+T CG +DP+LK+EMEK FVDLLTEEL
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTS---CAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEEL

Query:  KLQEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        KLQEAV+ EH+RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERA+AL++KERK+T LWERRARQ+GWEGE
Subjt:  KLQEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

AT2G32580.1 Protein of unknown function (DUF1068)1.4e-3446.34Show/hide
Query:  SCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAVSGE
        + L+  L + A+     + GP LYW   +A  L  S TSC+ C+CDC   L LL I  GL+N S TDC   DP++ ++ EK + +LLTEELK +EA S E
Subjt:  SCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAVSGE

Query:  HTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE
          + ++  L EAK+  S YQ+EA+KC +  ETCEEARE+AE  +++++KLTS+WE+RARQ G++
Subjt:  HTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE

AT4G04360.1 Protein of unknown function (DUF1068)8.8e-2941.52Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDS-RTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKL
        M+RR     +   V+  +     + GP+LYW   +   + DS  +SC PC+CDC     LL I  GL+N S  DC  ++ +  +E E  F +++ EELKL
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDS-RTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKL

Query:  QEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE
        +EA + E     +  L +AK+AASQYQ+EA+KC    ETCE ARE+AEA + ++R+L+ +WE RARQ GW+
Subjt:  QEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE

AT4G30996.1 Protein of unknown function (DUF1068)6.9e-7478.74Show/hide
Query:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTS--CAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELK
        M RRSG C+R CLVIFAVVSAL VCGPALYW+F K F +G +R +  C PC+CDCPPPLSLL+IAPGLANLS+TDCG +DP+LKQEMEKQFVDLLTEELK
Subjt:  MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTS--CAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELK

Query:  LQEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        LQEAV+ EH+RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERAEAL+IKERK+TSLWE+RARQ GWEGE
Subjt:  LQEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACGCCGATCTGGGTCTTGCCTGCGGTGTTGTCTTGTGATTTTTGCTGTAGTTTCTGCTTTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAGGCTTT
CCATTTGGGTGATTCCAGAACCTCCTGTGCTCCTTGCATCTGCGATTGTCCACCCCCATTATCCCTTTTGAAGATTGCCCCTGGTCTGGCCAATCTCTCCGTAACAGATT
GTGGAGGTAATGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTAGACCTCTTGACAGAGGAACTGAAGCTTCAAGAAGCAGTTTCTGGCGAACATACTCGCCAT
ATGAACATCACTTTATTCGAGGCAAAAAGGGCAGCTTCTCAGTATCAGAGGGAGGCTGAAAAGTGCATTGCTGCAACAGAAACTTGTGAAGAGGCTCGAGAACGCGCCGA
GGCATTGATGATCAAGGAGAGAAAGCTTACATCATTGTGGGAGCGACGAGCCCGCCAAATGGGTTGGGAAGGGGAATAA
mRNA sequenceShow/hide mRNA sequence
TAAATGGTGGGGTTGGCTTTATTTAATTATTGGCTCCTTCTCAAAGCTATTGTAGAATGAATTCAACTGAATCGAGAGGAAAAAATGGAAACTTTTTTTAATTTTTATGT
TCAATTCACCGACTTCCCCCAAAGAAATTCTAGCTTAGCCTATAGCTAAAGTTTTAATTTTCCTTTTCCCCTCTTTTTCCATTCATTGGTTTCAAACTTTCACTGCTGCA
ATTGGATCTCACTTCCTCACTTTCTCAGTAACGCGGAAAGAGAGACGAAAAATACGAAAAAAAAAAACAAGAATCAAAGGGGAAAAAAGGGAGATTGGAAGATCAGGGTG
AGCAATGTCACGCCGATCTGGGTCTTGCCTGCGGTGTTGTCTTGTGATTTTTGCTGTAGTTTCTGCTTTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAGG
CTTTCCATTTGGGTGATTCCAGAACCTCCTGTGCTCCTTGCATCTGCGATTGTCCACCCCCATTATCCCTTTTGAAGATTGCCCCTGGTCTGGCCAATCTCTCCGTAACA
GATTGTGGAGGTAATGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTAGACCTCTTGACAGAGGAACTGAAGCTTCAAGAAGCAGTTTCTGGCGAACATACTCG
CCATATGAACATCACTTTATTCGAGGCAAAAAGGGCAGCTTCTCAGTATCAGAGGGAGGCTGAAAAGTGCATTGCTGCAACAGAAACTTGTGAAGAGGCTCGAGAACGCG
CCGAGGCATTGATGATCAAGGAGAGAAAGCTTACATCATTGTGGGAGCGACGAGCCCGCCAAATGGGTTGGGAAGGGGAATAAATCTATGATTTCAAAGGTCAATCTCTT
AGTTCAACTAGTCGTCGAGTCCTTCAACCCACGAGATCAATAGAAGCTGGCAAGCGAAGTGCTTGGTTCATCAATTTCTCAAGGCATAACGTTCTCTCTTTCCTCCATTA
ATTGTGTTGCACTGTTCCTGAAAACATCTCAATTCTACATCATAATTCATACATGACAATGTTATAGATCCTACTCTTCTGCATTTGCTATATATATACATATATATATA
TGATTATCTCTTCAACCGTGA
Protein sequenceShow/hide protein sequence
MSRRSGSCLRCCLVIFAVVSALAVCGPALYWRFKKAFHLGDSRTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGGNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRH
MNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE