; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019211 (gene) of Snake gourd v1 genome

Gene IDTan0019211
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNA-directed RNA polymerase
Genome locationLG02:96924433..96936831
RNA-Seq ExpressionTan0019211
SyntenyTan0019211
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598213.1 Type 2 DNA topoisomerase 6 subunit B-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.7e-5575.33Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MGDPT LDRML HL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFT+RGNYTKHLDK LEEAAVEFYP VKFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
        KEYPFIEMFHSPQQ            AS+QGKI+DP++TKYSVKVLP ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF

XP_022132238.1 uncharacterized protein LOC111005143 [Momordica charantia]1.1e-5676.67Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MG+PT  DRM+SHL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFT+RGNYTKHLDK LEEAAVEFYPNVKFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
        KEYPFIEMFHSPQQ            ASNQGKIADPS+TKYSVKVLP ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF

XP_022962537.1 uncharacterized protein LOC111462937 [Cucurbita moschata]7.0e-5474Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MGDP  LDRML HL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFT+RGNYTKHLDK LEEAAVEFYP VKFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
        KEYPFIEMFHSP+Q            AS+QGKI+DP++TKYSVKVLP ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF

XP_022996547.1 uncharacterized protein LOC111491762 [Cucurbita maxima]3.7e-5575.33Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MGDPT LDRML HL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFT+RGNYTKHLDK LEEAAVEFYP VKFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
        KEYPFIEMFHSPQQ            AS+QGKI+DP++TKYSVKVLP ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF

XP_038885436.1 uncharacterized protein LOC120075825 isoform X1 [Benincasa hispida]1.7e-5575.33Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MGDPT LDR+LSHL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFT+RGNYTKHLDK LEEAAVEFYPN+KFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
        KEYPFIEMFHSPQQ            ASNQGK++D +VTKYSVKVLP ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF

TrEMBL top hitse value%identityAlignment
A0A1S3BIH6 uncharacterized protein LOC1034899404.9e-5373.33Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MGDPT LD +L HL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIR NY+KHLDK LEEAAVEFYPNVKFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
        KEYPFIEMFHSP+Q            AS+QGKIAD +VTKYSVKV+P ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF

A0A5A7U4H2 Uncharacterized protein6.4e-5372.55Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MGDPT LD +L HL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIR NY+KHLDK LEEAAVEFYPNVKFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSFQAL
        KEYPFIEMFHSP+Q            AS+QGKIAD +VTKYSVKV+P+   +L
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSFQAL

A0A6J1BSI0 uncharacterized protein LOC1110051435.6e-5776.67Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MG+PT  DRM+SHL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFT+RGNYTKHLDK LEEAAVEFYPNVKFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
        KEYPFIEMFHSPQQ            ASNQGKIADPS+TKYSVKVLP ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF

A0A6J1HCY2 uncharacterized protein LOC1114629373.4e-5474Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MGDP  LDRML HL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFT+RGNYTKHLDK LEEAAVEFYP VKFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
        KEYPFIEMFHSP+Q            AS+QGKI+DP++TKYSVKVLP ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF

A0A6J1K286 uncharacterized protein LOC1114917621.8e-5575.33Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MGDPT LDRML HL             LGPSRVIHFTSEREFVQLLHEGYPVVVAFT+RGNYTKHLDK LEEAAVEFYP VKFMRVECPKYPGFCISRQR
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
        KEYPFIEMFHSPQQ            AS+QGKI+DP++TKYSVKVLP ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G57230.1 Thioredoxin superfamily protein5.3e-5264.67Show/hide
Query:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR
        MGD T LDRML  L             LGPSRV+HFTSEREFVQLLH+GYPVVVAFTIR NYT+HLD+ LEEAA EFYPN+KFMRVECPKYPGFCI+RQ+
Subjt:  MGDPTTLDRMLSHLH------------LGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQR

Query:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF
         EYPFIE+FHSPQ  G            N+GK+ DP++T+YSVKV+P ++
Subjt:  KEYPFIEMFHSPQQVGIAHLILCLVLASNQGKIADPSVTKYSVKVLPVSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCCAACAACTTTGGATAGAATGCTCAGTCATCTTCATCTTGGGCCTTCTAGGGTCATACATTTTACCTCTGAACGTGAGTTTGTTCAGCTCCTTCACGAAGG
TTATCCAGTCGTCGTTGCTTTTACAATCAGAGGTAACTACACAAAGCATCTTGACAAAGAATTGGAGGAAGCTGCGGTTGAGTTTTATCCAAATGTGAAATTTATGCGTG
TTGAGTGCCCAAAGTATCCTGGTTTCTGCATTTCACGGCAGAGAAAGGAATATCCATTTATTGAGATGTTTCATAGTCCACAACAAGTAGGAATTGCCCATCTGATTTTG
TGTTTGGTCTTGGCATCTAACCAGGGAAAGATTGCTGATCCTAGCGTTACGAAGTACTCGGTGAAGGTTCTACCTGTAAGTTTTCAAGCTTTGTCAACCTCAATTCTACA
CAATATCTCCTCTTCCATCACATATTCAATTATGACCCCAGTGCCTACGGATTCAGAGAATTTTTCAAGCGTCATGGGATATATGGTCGTTGAATCAAAAGTAATAACAT
TTTATTATCTGCTATGTTTTTATAGTATTATGCAATCCGAAATCCTGAACCAAACATCAGAGACTGCAGTTGGGTCCTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACCCAACAACTTTGGATAGAATGCTCAGTCATCTTCATCTTGGGCCTTCTAGGGTCATACATTTTACCTCTGAACGTGAGTTTGTTCAGCTCCTTCACGAAGG
TTATCCAGTCGTCGTTGCTTTTACAATCAGAGGTAACTACACAAAGCATCTTGACAAAGAATTGGAGGAAGCTGCGGTTGAGTTTTATCCAAATGTGAAATTTATGCGTG
TTGAGTGCCCAAAGTATCCTGGTTTCTGCATTTCACGGCAGAGAAAGGAATATCCATTTATTGAGATGTTTCATAGTCCACAACAAGTAGGAATTGCCCATCTGATTTTG
TGTTTGGTCTTGGCATCTAACCAGGGAAAGATTGCTGATCCTAGCGTTACGAAGTACTCGGTGAAGGTTCTACCTGTAAGTTTTCAAGCTTTGTCAACCTCAATTCTACA
CAATATCTCCTCTTCCATCACATATTCAATTATGACCCCAGTGCCTACGGATTCAGAGAATTTTTCAAGCGTCATGGGATATATGGTCGTTGAATCAAAAGTAATAACAT
TTTATTATCTGCTATGTTTTTATAGTATTATGCAATCCGAAATCCTGAACCAAACATCAGAGACTGCAGTTGGGTCCTTGTAG
Protein sequenceShow/hide protein sequence
MGDPTTLDRMLSHLHLGPSRVIHFTSEREFVQLLHEGYPVVVAFTIRGNYTKHLDKELEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQVGIAHLIL
CLVLASNQGKIADPSVTKYSVKVLPVSFQALSTSILHNISSSITYSIMTPVPTDSENFSSVMGYMVVESKVITFYYLLCFYSIMQSEILNQTSETAVGSL