; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014640 (gene) of Snake gourd v1 genome

Gene IDTan0014640
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTrinucleotide repeat-containing gene 18 protein-like protein
Genome locationLG11:882913..884934
RNA-Seq ExpressionTan0014640
SyntenyTan0014640
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602714.1 hypothetical protein SDJN03_07947, partial [Cucurbita argyrosperma subsp. sororia]1.0e-6187.94Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD
        +LPGIPKKQSPARLR DSASPLTSLLPLPP STT PSSKRFGFHDWRKSNRQN QRDPFFDAFVECSKEP+AA AAEQ+P+AELW+ GSNGKAV+RSLSD
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD

Query:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        RFGFLN YSSCKRTCGVSESIVY PR  RSSFDLLNHR GG
Subjt:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

KAG7033401.1 hypothetical protein SDJN02_07457, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-6187.94Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD
        +LPGIPKKQSPARLR DSASPLTSLLPLPP STT PSSKRFGFHDWRKSNRQN QRDPFFDAFVECSKEP+AA AAEQ+P+AELW+ GSNGKAV+RSLSD
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD

Query:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        RFGFLN YSSCKRTCGVSESIVY PR  RSSFDLLNHR GG
Subjt:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

XP_011649907.1 uncharacterized protein LOC105434677 [Cucumis sativus]3.6e-6288.57Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDR
        HLPGIPKKQSPARLRR SASPL+S LPLPPNSTTP SSKRFGF DWRKSNRQN QRDPFFDAF+ECSKEPT AAA     AELWSGGSNGKA+TRSLSDR
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDR

Query:  FGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        FGFLNLYSSCKRTCGVSESIVYLPRT RSSFDLLN RTGG
Subjt:  FGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

XP_023528120.1 uncharacterized protein LOC111791129 [Cucurbita pepo subsp. pepo]3.9e-6187.23Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD
        +LPGIPKKQSPARLR DSASPLTSLLPLPP STT PSSKRFGFHDWRKSNRQN QRDPFFDAFVECSKEP+AA AAEQ+P+AELW+ GSNGKAV+RSLSD
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD

Query:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        R GFLN YSSCKRTCGVSESIVY PR  RSSFDLLNHR GG
Subjt:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

XP_038885392.1 uncharacterized protein LOC120075791 [Benincasa hispida]1.8e-6187.32Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWS--GGSNGKAVTRSLS
        HLPGIPKKQSPARLRR SASPL+SLLPLPPNS TPPSSKRFGF DWRKSNRQN QRDPFFDAF+ECSKEP+ AAA     AELWS  GGSNGKA+TRSLS
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWS--GGSNGKAVTRSLS

Query:  DRFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        DRFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLN R+GG
Subjt:  DRFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

TrEMBL top hitse value%identityAlignment
A0A0A0LQP3 Uncharacterized protein1.7e-6288.57Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDR
        HLPGIPKKQSPARLRR SASPL+S LPLPPNSTTP SSKRFGF DWRKSNRQN QRDPFFDAF+ECSKEPT AAA     AELWSGGSNGKA+TRSLSDR
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDR

Query:  FGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        FGFLNLYSSCKRTCGVSESIVYLPRT RSSFDLLN RTGG
Subjt:  FGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

A0A1S3B2H5 uncharacterized protein LOC1034854231.6e-6086.43Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDR
        HLPGIPKKQSPARLRR SASPL+SLLPLPPNSTT  SSKRFGF DWRKSNRQN QRDPFFDAF+ECSKEPT AA +    AELWSG SNGKA+TRSLSDR
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDR

Query:  FGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        FGFLNLYSSCKRTCGVSESIVYLPRTP SSFDLLN R+GG
Subjt:  FGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

A0A5D3C8I6 Trinucleotide repeat-containing gene 18 protein-like protein1.6e-6086.43Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDR
        HLPGIPKKQSPARLRR SASPL+SLLPLPPNSTT  SSKRFGF DWRKSNRQN QRDPFFDAF+ECSKEPT AA +    AELWSG SNGKA+TRSLSDR
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDR

Query:  FGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        FGFLNLYSSCKRTCGVSESIVYLPRTP SSFDLLN R+GG
Subjt:  FGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

A0A6J1GNZ5 uncharacterized protein LOC1114561374.2e-6187.23Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD
        +LPGIPKKQSPARLR DSASPLTSLLPLPP STT PSSKRFGFHDWRKSNRQN QRDPFFDAFVECSKEP+AA AAEQ+P+AELW+ GSNGKAV+RSLSD
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD

Query:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        RFGFLN  SSCKRTCGVSESIVY PR  RSSFDLLNHR GG
Subjt:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

A0A6J1JTM6 uncharacterized protein LOC1114877392.7e-6085.82Show/hide
Query:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD
        +LPGIPKKQSPARLR DSASPLT LLPLPP ST  PSSKRFGFHDWRKSNRQN QRDPFFDAFVECSKEP+AA AAEQ+P+AELW+ GSNGKAV+RSLSD
Subjt:  HLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAA-AAEQEPSAELWSGGSNGKAVTRSLSD

Query:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        RFGFLN YSSCKRTC VSESIVY PR  RSSFDLLNHR+GG
Subjt:  RFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22680.1 unknown protein5.0e-1440.29Show/hide
Query:  LPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDRF
        LPG PK+ S  R        L++LLPLPP+ +    + R      +K+      RDPF  A VECSK  T             +GG + K + +S S   
Subjt:  LPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECSKEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDRF

Query:  GFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG
        G LNLYSSC+R C VSESIVYLP++  +S+D L+  T G
Subjt:  GFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG

AT1G71970.1 unknown protein8.6e-1437.67Show/hide
Query:  LPGIPKKQSPARLRRDSASPLTSLLPLPP--NSTTPPSSKRFGFHDWRKSNRQNLQ-RDPFFDAFVECSKEPTAAAAEQEPSAELW---------SGGSN
        LPG PK  S     R +      LLPLPP  N + P + K+   ++  K N   +  +DPF  A +ECSK+    +   +    +          SGGS+
Subjt:  LPGIPKKQSPARLRRDSASPLTSLLPLPP--NSTTPPSSKRFGFHDWRKSNRQNLQ-RDPFFDAFVECSKEPTAAAAEQEPSAELW---------SGGSN

Query:  GKAVTRSLSDRFGFLNLYSSCKRTCGVSESIVYLPRTPR-SSFDLL
              S+ DRFG +NLY SC+RTC V+ESIVYLPR  + +S+D L
Subjt:  GKAVTRSLSDRFGFLNLYSSCKRTCGVSESIVYLPRTPR-SSFDLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCGACCGGCTACCGGCTGAGCCGAGCCTCGAGAGAAGTGCACTGTTACAGGAAGCTAGGGGGAGAAAACCATGGCTTAACCACATGATCGATACGCGATCTGC
TCTCGTTACAGGCCAGACGCATTTGCCCGGAATTCCCAAGAAACAGTCTCCGGCGAGGCTCCGGCGAGATTCCGCTTCTCCTCTGACCTCTCTCCTCCCTCTGCCTCCCA
ATTCCACCACTCCACCGTCCTCCAAGCGCTTCGGATTTCACGATTGGAGGAAATCAAACCGCCAAAACTTGCAGCGAGATCCTTTCTTCGACGCCTTCGTCGAGTGCTCT
AAAGAACCTACCGCCGCCGCCGCCGAACAAGAGCCGTCCGCCGAGCTCTGGAGCGGCGGCAGCAATGGCAAAGCAGTTACGAGAAGCCTGAGCGACCGATTCGGATTCTT
GAATCTGTACTCTTCTTGCAAACGAACGTGCGGCGTTTCGGAATCCATCGTGTATCTTCCGAGAACGCCGAGGAGTTCGTTCGATCTGCTAAACCACCGCACCGGCGGGT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCGACCGGCTACCGGCTGAGCCGAGCCTCGAGAGAAGTGCACTGTTACAGGAAGCTAGGGGGAGAAAACCATGGCTTAACCACATGATCGATACGCGATCTGC
TCTCGTTACAGGCCAGACGCATTTGCCCGGAATTCCCAAGAAACAGTCTCCGGCGAGGCTCCGGCGAGATTCCGCTTCTCCTCTGACCTCTCTCCTCCCTCTGCCTCCCA
ATTCCACCACTCCACCGTCCTCCAAGCGCTTCGGATTTCACGATTGGAGGAAATCAAACCGCCAAAACTTGCAGCGAGATCCTTTCTTCGACGCCTTCGTCGAGTGCTCT
AAAGAACCTACCGCCGCCGCCGCCGAACAAGAGCCGTCCGCCGAGCTCTGGAGCGGCGGCAGCAATGGCAAAGCAGTTACGAGAAGCCTGAGCGACCGATTCGGATTCTT
GAATCTGTACTCTTCTTGCAAACGAACGTGCGGCGTTTCGGAATCCATCGTGTATCTTCCGAGAACGCCGAGGAGTTCGTTCGATCTGCTAAACCACCGCACCGGCGGGT
GA
Protein sequenceShow/hide protein sequence
MASDRLPAEPSLERSALLQEARGRKPWLNHMIDTRSALVTGQTHLPGIPKKQSPARLRRDSASPLTSLLPLPPNSTTPPSSKRFGFHDWRKSNRQNLQRDPFFDAFVECS
KEPTAAAAEQEPSAELWSGGSNGKAVTRSLSDRFGFLNLYSSCKRTCGVSESIVYLPRTPRSSFDLLNHRTGG