; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017920 (gene) of Snake gourd v1 genome

Gene IDTan0017920
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationLG11:135499..135963
RNA-Seq ExpressionTan0017920
SyntenyTan0017920
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]2.6e-2141.13Show/hide
Query:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH
        +ILQ++ S + + PS+CPLC + ++   H+F+ C  +   W ++   F L W F  +L  SV QLL+G  L     I+W    +ALL +IW ERNQR+FH
Subjt:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH

Query:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNA
        D  +       AA L A+AW SL   F  +SI DI  NWNA
Subjt:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNA

KAA0062564.1 GPI-anchor transamidase isoform X1 [Cucumis melo var. makuwa]8.5e-2541.38Show/hide
Query:  ILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFHD
        ++QR L + CL PS C LC EE E    LF  C ++  CW  LL  FG+ W F G+   ++ Q+L G  L     ++WGN  +ALLSDIWFE NQR+F  
Subjt:  ILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFHD

Query:  LRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFIFPA
            W    + A   A+ W  L+  F  +SI D+  NW AFI  A
Subjt:  LRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFIFPA

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]1.1e-2140.56Show/hide
Query:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH
        +ILQ++ S + + PS+CPLC + ++   H+F+ C  +   W ++   F L W F  +L  SV QLL+G  L     I+W    +ALL +IW ERNQR+FH
Subjt:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH

Query:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI
        D  +       AA L A+AW SL   F  +SI DI  NWN F+
Subjt:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]2.6e-2644.37Show/hide
Query:  QSTFYKVDILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPP-LSLRASILWGNAVRALLSDIWF
        Q      DI+Q++     L PS C LC +  E   HLF  C FA  CW+ L  +F + W F     ++VYQLL GPP LS     LW N V+ALLS++WF
Subjt:  QSTFYKVDILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPP-LSLRASILWGNAVRALLSDIWF

Query:  ERNQRVFHDLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI
        ERN R+F + R+ ++ SF +A  KAS W SL  +F   S S I  NW AFI
Subjt:  ERNQRVFHDLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]4.1e-2742.66Show/hide
Query:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH
        ++LQ++     L P+VCP C   +E   HLF  C ++  CW+KLL  F L      + K +V+QLLA P       +LW NAV+ALL+D+WFERNQR+F+
Subjt:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH

Query:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI
        +     +   EAA  +AS+W  LS  F  +S+SD   NW AFI
Subjt:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI

TrEMBL top hitse value%identityAlignment
A0A438CSQ5 Transposon TX1 uncharacterized 149 kDa protein4.1e-1734.04Show/hide
Query:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH
        D+LQ R  H  L P +C LC E+ E+  HLF+ CS     W +L +   + WV   ++ + ++    G   S R  +LW NA  AL+  +W ERN R+F 
Subjt:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH

Query:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNA
        D  +  E  +++ H  AS W+  S  F G  ++ +Q +W A
Subjt:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNA

A0A5A7T2Y0 zf-RVT domain-containing protein1.2e-2141.13Show/hide
Query:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH
        +ILQ++ S + + PS+CPLC + ++   H+F+ C  +   W ++   F L W F  +L  SV QLL+G  L     I+W    +ALL +IW ERNQR+FH
Subjt:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH

Query:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNA
        D  +       AA L A+AW SL   F  +SI DI  NWNA
Subjt:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNA

A0A5A7V5N8 GPI-anchor transamidase isoform X14.1e-2541.38Show/hide
Query:  ILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFHD
        ++QR L + CL PS C LC EE E    LF  C ++  CW  LL  FG+ W F G+   ++ Q+L G  L     ++WGN  +ALLSDIWFE NQR+F  
Subjt:  ILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFHD

Query:  LRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFIFPA
            W    + A   A+ W  L+  F  +SI D+  NW AFI  A
Subjt:  LRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFIFPA

A0A5D3DE60 zf-RVT domain-containing protein5.6e-2240.56Show/hide
Query:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH
        +ILQ++ S + + PS+CPLC + ++   H+F+ C  +   W ++   F L W F  +L  SV QLL+G  L     I+W    +ALL +IW ERNQR+FH
Subjt:  DILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFH

Query:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI
        D  +       AA L A+AW SL   F  +SI DI  NWN F+
Subjt:  DLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI

A0A6J1DIE2 uncharacterized protein LOC1110207651.3e-2644.37Show/hide
Query:  QSTFYKVDILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPP-LSLRASILWGNAVRALLSDIWF
        Q      DI+Q++     L PS C LC +  E   HLF  C FA  CW+ L  +F + W F     ++VYQLL GPP LS     LW N V+ALLS++WF
Subjt:  QSTFYKVDILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPP-LSLRASILWGNAVRALLSDIWF

Query:  ERNQRVFHDLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI
        ERN R+F + R+ ++ SF +A  KAS W SL  +F   S S I  NW AFI
Subjt:  ERNQRVFHDLRKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTCGACATTCTACAAAGTCGACATTCTACAAAGAAGGCTGTCGCACATGTGTCTCCAACCTTCGGTCTGCCCCCTATGTTTCGAAGAGGCAGAAACAGGCTTCCA
CTTGTTTGTGGGCTGCTCCTTCGCTGGGAATTGTTGGTCGAAGCTTCTCCGGGAGTTTGGCCTAGGGTGGGTCTTTGCAGGGAACCTCAAGGAGAGTGTGTATCAGTTGT
TGGCAGGCCCTCCGCTATCGCTAAGGGCCTCAATTTTGTGGGGAAACGCAGTAAGGGCGCTGTTGTCAGATATCTGGTTTGAGCGAAATCAGAGGGTCTTTCATGATCTT
AGGAAGCCTTGGGAAATCAGTTTCGAAGCTGCTCATCTCAAGGCGTCGGCTTGGAGTTCTCTTTCAGGGGCGTTTGCAGGATTCTCAATTTCTGACATCCAGACCAATTG
GAATGCTTTTATTTTTCCTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGTCGACATTCTACAAAGTCGACATTCTACAAAGAAGGCTGTCGCACATGTGTCTCCAACCTTCGGTCTGCCCCCTATGTTTCGAAGAGGCAGAAACAGGCTTCCA
CTTGTTTGTGGGCTGCTCCTTCGCTGGGAATTGTTGGTCGAAGCTTCTCCGGGAGTTTGGCCTAGGGTGGGTCTTTGCAGGGAACCTCAAGGAGAGTGTGTATCAGTTGT
TGGCAGGCCCTCCGCTATCGCTAAGGGCCTCAATTTTGTGGGGAAACGCAGTAAGGGCGCTGTTGTCAGATATCTGGTTTGAGCGAAATCAGAGGGTCTTTCATGATCTT
AGGAAGCCTTGGGAAATCAGTTTCGAAGCTGCTCATCTCAAGGCGTCGGCTTGGAGTTCTCTTTCAGGGGCGTTTGCAGGATTCTCAATTTCTGACATCCAGACCAATTG
GAATGCTTTTATTTTTCCTGCTTAG
Protein sequenceShow/hide protein sequence
MQSTFYKVDILQRRLSHMCLQPSVCPLCFEEAETGFHLFVGCSFAGNCWSKLLREFGLGWVFAGNLKESVYQLLAGPPLSLRASILWGNAVRALLSDIWFERNQRVFHDL
RKPWEISFEAAHLKASAWSSLSGAFAGFSISDIQTNWNAFIFPA