; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014624 (gene) of Snake gourd v1 genome

Gene IDTan0014624
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG05:83627098..83628264
RNA-Seq ExpressionTan0014624
SyntenyTan0014624
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573220.1 hypothetical protein SDJN03_27107, partial [Cucurbita argyrosperma subsp. sororia]6.1e-8388.83Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        MARNYGFLVCILVMVMDAVAGLLAIRAEKAQN+V LQS S+WVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCI IRN+QHF++SNANRRLGLLF
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV
        M+LSWITLAIGFSML+AGTVDNSK KNSCEISSHGLFL+GGI+CFIHGL T+AYYVSATAAYREE+RK K  PS PQHV
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV

XP_022954638.1 uncharacterized protein LOC111456841 [Cucurbita moschata]1.5e-8188.64Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        MARNYGFLVCILVMVMDAVAGLLAIRAEKAQN+V LQS S+WVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCI IRN+QHF++SNANRRLGLLF
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDP
        M+LSWITLAIGFSML+AGTVDNSK KNSC+ISSHGLFL+GGI+CFIHGLCT+AYYVSATAAYREE+RK K  PS P
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDP

XP_022994470.1 uncharacterized protein LOC111490180 [Cucurbita maxima]8.0e-8388.27Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        MARNYGFLVCILVMVMDAVAGLLAIRAEKAQN+V LQS S+W YECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCI IRN+QHF++SNANRRLGLLF
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV
        M+LSWITLAIGFSML+AGTVDNSK KNSC+ISSHGLFL+GGI+CFIHGLCT+AYYVSATAAYREE+RK K  PS PQHV
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV

XP_023542772.1 uncharacterized protein LOC111802580 [Cucurbita pepo subsp. pepo]2.1e-8388.83Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        MA+NYGFLVCILVMVMDAVAGLLAIRAEKAQN+V LQS S+WVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCI IRN+QHF++SNANRRLGLLF
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV
        M+LSWITLAIGFSML+AGTVDNSK KNSCEISSHGLFL+GGI+CFIHGLCT+AYYVSATAAYREE+RK K  PS PQHV
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV

XP_038894860.1 uncharacterized protein LOC120083260 [Benincasa hispida]8.0e-8386.59Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        MARNYGFLVCILVMV+DAVAG+L I+AEKAQNRV+L+SVSIWV  CSRKPRDDAFSQGLAATILLG+AH IAKVLGGCICIRN QHF+ES AN+RLGLLF
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV
        M+LSWITLAIG S+L+AGTVDNSKWKNSCEISSHGLFL GGI+CFIHGLCT+AYYVSATAAYREEQRK KA PS+PQHV
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ0 Uncharacterized protein6.4e-7077.9Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGG--CICIRNIQHFKESNANRRLGL
        M RNYGFLVCILV+V+DAVAGLL I AEKAQNRV+L+S+SI + ECSRKPRDDAFS+GLAA+ILLGLAH IAKVLGG  CICIRN Q+ +E +AN+ LG 
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGG--CICIRNIQHFKESNANRRLGL

Query:  LFMMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV
        LFM+LSWITLAIGFS+LMA T+DNSKWKNSCEISSHGLFL GGI+CF HGLCT+AYYVSATAAYREEQR  K  P +PQ V
Subjt:  LFMMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV

A0A1S3B4I4 uncharacterized protein LOC1034857081.3e-6775.69Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCIC--IRNIQHFKESNANRRLGL
        M RNYGFLVCILVMV+D VAGLL I AEKAQNRV+L+S+SI V ECSRKPRDDAFS+GLAA ILLGLAH IA VLGGC C  I N Q+ ++ +AN+ LGL
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCIC--IRNIQHFKESNANRRLGL

Query:  LFMMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV
         FM+LSWITL IGFS+LMA T+DNSKWKNSCEISSHGLFL GGI+CF+HGLCT+AYYVSATAAYREEQR  K +P +PQ V
Subjt:  LFMMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV

A0A6J1CGZ6 uncharacterized protein LOC1110107663.4e-7985.23Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        MA+NYGFLVCILVMVMDAVAG+L IRAEKAQNRV+LQSVS+WVYECSRKPRDDAFSQGLA TILLGLAHAIAKVLG CICIR+ QHF+ES+AN+RLGL F
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDP
        M+LSWITLAIGFSMLMAGTVDNS WKNSCEISS GLFL GGI+CF HGLCT+AYYVSATAA REEQRK   N S P
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDP

A0A6J1GRM8 uncharacterized protein LOC1114568417.3e-8288.64Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        MARNYGFLVCILVMVMDAVAGLLAIRAEKAQN+V LQS S+WVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCI IRN+QHF++SNANRRLGLLF
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDP
        M+LSWITLAIGFSML+AGTVDNSK KNSC+ISSHGLFL+GGI+CFIHGLCT+AYYVSATAAYREE+RK K  PS P
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDP

A0A6J1K198 uncharacterized protein LOC1114901803.9e-8388.27Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        MARNYGFLVCILVMVMDAVAGLLAIRAEKAQN+V LQS S+W YECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCI IRN+QHF++SNANRRLGLLF
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV
        M+LSWITLAIGFSML+AGTVDNSK KNSC+ISSHGLFL+GGI+CFIHGLCT+AYYVSATAAYREE+RK K  PS PQHV
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)3.9e-1126.86Show/hide
Query:  LVCILVMV-MDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCI--CIRNIQHFKESNANRRLGLLFMMLS
        +VCI++ V +D VAG + ++A+ AQ     Q V     EC + P   AF  G+ A   L  AH  A V+G  I    + +    ++       +  + L 
Subjt:  LVCILVMV-MDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCI--CIRNIQHFKESNANRRLGLLFMMLS

Query:  WITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV
        W+    G  +L  G   N++ +  C  +++ +F +GG +CF+H + +  YY+S+  A      + K N + P  +
Subjt:  WITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV

AT1G11500.1 Protein of unknown function (DUF1218)7.5e-3141.21Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRV----ILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRL
        M    GFLV ++++  D  A +L I AE AQ++       Q        C R P D AF++G+AA +LL + H +A VLGGC  IR+ Q FK + AN+ L
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRV----ILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRL

Query:  GLLFMMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREE----QRKHKANPS
         + F++LSWI   + +S LM GT+ NS+    C +     FL+GGI C  HG+ T AYYVSA AA +E+    Q+++ AN S
Subjt:  GLLFMMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREE----QRKHKANPS

AT2G32280.1 Protein of unknown function (DUF1218)1.7e-3542.07Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        M +  G LVC++++ +D  A +L I+AE AQN+V  + + +W++EC R+P  DAF  GL A  +L +AH +  ++GGC+CI +   F+ S++ R++ +  
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYRE
        ++L+WI  A+GF  ++ GT+ NSK ++SC  + H    +GGILCF+H L  +AYYVSATAA  E
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYRE

AT4G21310.1 Protein of unknown function (DUF1218)2.9e-4350.91Show/hide
Query:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF
        MARN GF +CIL++ MD  AG+L I AE AQN+V  + + +W++EC R P   AF  GLAA ILL LAH  A  LGGC+C+ + Q  ++S+AN++L +  
Subjt:  MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLF

Query:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREE
        ++ +WI LAI FSML+ GT+ NS+ + +C IS H +  +GGILCF+HGL  +AYY+SATA+ RE+
Subjt:  MMLSWITLAIGFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREE

AT5G17210.1 Protein of unknown function (DUF1218)1.1e-0826.88Show/hide
Query:  VCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLFMMLSWITL
        V  L+ ++ AV   +A  A + +   +  +VS  + +C+  PR  AF+ G  + + L +A  I  V  GC C R  +    S +N  + L+  ++SW T 
Subjt:  VCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLFMMLSWITL

Query:  AIGFSMLMAGTVDNSKWKNS--------CEISSHGLFLVGGILCFIHGLCTIAYYVSATA
         I F +L++G   N +            C I   G+F  G +L  +     I YY+  T+
Subjt:  AIGFSMLMAGTVDNSKWKNS--------CEISSHGLFLVGGILCFIHGLCTIAYYVSATA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCGAAACTATGGCTTTCTGGTGTGCATTTTGGTCATGGTAATGGACGCTGTTGCCGGACTACTTGCCATTCGAGCTGAAAAGGCTCAGAATCGGGTGATACTACA
ATCGGTGAGCATATGGGTATATGAATGCAGCAGAAAGCCGAGAGATGATGCTTTTAGTCAGGGGCTGGCTGCGACTATTCTGCTTGGCCTTGCTCATGCCATTGCTAAAG
TACTTGGTGGGTGTATTTGTATTAGGAATATACAGCATTTCAAAGAATCAAACGCTAACAGGCGATTGGGATTGCTATTCATGATGCTCTCATGGATTACTTTGGCTATT
GGGTTCTCTATGTTGATGGCTGGGACGGTGGACAATTCCAAGTGGAAAAACTCTTGTGAGATATCAAGTCATGGGCTGTTTTTGGTAGGTGGGATTCTGTGTTTCATTCA
TGGGCTCTGTACTATCGCTTATTATGTTTCTGCAACAGCAGCTTATAGAGAAGAACAGAGGAAACACAAAGCAAATCCTTCTGATCCTCAACATGTTTAA
mRNA sequenceShow/hide mRNA sequence
TTGTGCAATCATGGGTTTCCTTTATTCCAACTTCCATTCCTCCAATTCACAGCTACTAAACTTTTATTCCCGATTGCAAATCTTGAGAAGAGCTCTGGATTTGCAGCAGA
AATGGCGCGAAACTATGGCTTTCTGGTGTGCATTTTGGTCATGGTAATGGACGCTGTTGCCGGACTACTTGCCATTCGAGCTGAAAAGGCTCAGAATCGGGTGATACTAC
AATCGGTGAGCATATGGGTATATGAATGCAGCAGAAAGCCGAGAGATGATGCTTTTAGTCAGGGGCTGGCTGCGACTATTCTGCTTGGCCTTGCTCATGCCATTGCTAAA
GTACTTGGTGGGTGTATTTGTATTAGGAATATACAGCATTTCAAAGAATCAAACGCTAACAGGCGATTGGGATTGCTATTCATGATGCTCTCATGGATTACTTTGGCTAT
TGGGTTCTCTATGTTGATGGCTGGGACGGTGGACAATTCCAAGTGGAAAAACTCTTGTGAGATATCAAGTCATGGGCTGTTTTTGGTAGGTGGGATTCTGTGTTTCATTC
ATGGGCTCTGTACTATCGCTTATTATGTTTCTGCAACAGCAGCTTATAGAGAAGAACAGAGGAAACACAAAGCAAATCCTTCTGATCCTCAACATGTTTAACCCAGTTGA
CAGCCATTGCCTGTAATGACAATATAACTTCTAATTTATTAGTTCCTTAAGAAATTTTGAGTCCACGACTTAGTTCACATTGGAAACATCTATTGTTTTCCTGTTTCAGC
AATTGCCTGCCAAATCTAAGCAACTTTAAGAAACTGCTTCACATGGGAAACATCCATTTTTCTTCTTCTTTTGAGACTTCTGTTTGGTAAAGATTCCGTTGAAAATGCAT
GACTAAA
Protein sequenceShow/hide protein sequence
MARNYGFLVCILVMVMDAVAGLLAIRAEKAQNRVILQSVSIWVYECSRKPRDDAFSQGLAATILLGLAHAIAKVLGGCICIRNIQHFKESNANRRLGLLFMMLSWITLAI
GFSMLMAGTVDNSKWKNSCEISSHGLFLVGGILCFIHGLCTIAYYVSATAAYREEQRKHKANPSDPQHV