; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022740 (gene) of Snake gourd v1 genome

Gene IDTan0022740
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationLG11:37545324..37550328
RNA-Seq ExpressionTan0022740
SyntenyTan0022740
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051909.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.7e-2066.67Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM
        DDV+   + LHVPEG IT+ KAKKIQEAFTLH+QKL NAQ ETK FE + LYNVS+ SQ+E+GVKMAREKL   +DGT ++KSVQ +
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM

KAA0065364.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.0e-2063.54Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKMFDVLIKKMS
        DDV+   + LHVPEG+IT+ KAKKIQEAFTLH+QKL NAQ ETK FE + LYNVS+ SQ+E+ VKMAREKL  L+DGT ++KSVQ   +V I+  S
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKMFDVLIKKMS

KAA0067206.1 hypothetical protein E6C27_scaffold418G00080 [Cucumis melo var. makuwa]5.0e-2065.52Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM
        DDV+   + LHVPEG IT+ KAKKIQEAFTLH+QKL NAQ E K FEP+ LYNVS+ SQ+E+ +KMA EKL  L+DGT D+KSVQ +
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM

TYK02449.1 F15O4.13 [Cucumis melo var. makuwa]3.5e-2167.82Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM
        DDV+   + LHVPEG IT+ KAKKIQEAFTLH+QKL NAQ ETK FE + LYNVS+ SQ+E+GVKMAREKL  L+DGT ++KSVQ +
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM

TYK20861.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.0e-2063.54Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKMFDVLIKKMS
        DDV+   + LHVPEG+IT+ KAKKIQEAFTLH+QKL NAQ ETK FE + LYNVS+ SQ+E+ VKMAREKL  L+DGT ++KSVQ   +V I+  S
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKMFDVLIKKMS

TrEMBL top hitse value%identityAlignment
A0A5A7UEI2 Transposon Ty3-I Gag-Pol polyprotein8.3e-2166.67Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM
        DDV+   + LHVPEG IT+ KAKKIQEAFTLH+QKL NAQ ETK FE + LYNVS+ SQ+E+GVKMAREKL   +DGT ++KSVQ +
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM

A0A5A7VDK8 Transposon Ty3-I Gag-Pol polyprotein4.9e-2163.54Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKMFDVLIKKMS
        DDV+   + LHVPEG+IT+ KAKKIQEAFTLH+QKL NAQ ETK FE + LYNVS+ SQ+E+ VKMAREKL  L+DGT ++KSVQ   +V I+  S
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKMFDVLIKKMS

A0A5A7VL78 RT_RNaseH_2 domain-containing protein2.4e-2065.52Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM
        DDV+   + LHVPEG IT+ KAKKIQEAFTLH+QKL NAQ E K FEP+ LYNVS+ SQ+E+ +KMA EKL  L+DGT D+KSVQ +
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM

A0A5D3BWE8 F15O4.131.7e-2167.82Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM
        DDV+   + LHVPEG IT+ KAKKIQEAFTLH+QKL NAQ ETK FE + LYNVS+ SQ+E+GVKMAREKL  L+DGT ++KSVQ +
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKM

A0A5D3DB73 Transposon Ty3-I Gag-Pol polyprotein4.9e-2163.54Show/hide
Query:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKMFDVLIKKMS
        DDV+   + LHVPEG+IT+ KAKKIQEAFTLH+QKL NAQ ETK FE + LYNVS+ SQ+E+ VKMAREKL  L+DGT ++KSVQ   +V I+  S
Subjt:  DDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKMFDVLIKKMS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.7e-0740.74Show/hide
Query:  NGVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK
        +G++ + EK++AI+++P PT   E+++F GL  +YR+FI NF+ IA P+ + +K
Subjt:  NGVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK

P10401 Retrovirus-related Pol polyprotein from transposon gypsy7.0e-0953.7Show/hide
Query:  NGVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK
        +G + D EKVKAI+E+P P    +VRSF GLAS+YR FIK+F++IA P+ +++K
Subjt:  NGVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK

P20825 Retrovirus-related Pol polyprotein from transposon 2972.5e-0642.59Show/hide
Query:  NGVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK
        +G++ +  KVKAI  +P PT   E+R+F GL  +YR+FI N++ IA P+   +K
Subjt:  NGVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK

P92523 Uncharacterized mitochondrial protein AtMg008602.0e-0849.06Show/hide
Query:  GVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK
        GV  D  K++A+  WP P N  E+R F GL  +YRRF+KN+  I  PL EL+K
Subjt:  GVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.3e-0739.29Show/hide
Query:  NGVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVKSM
        +G++ D +KV+AI E P PT+  E++ F G+ S+YR+FI++++ +A PL  L + +
Subjt:  NGVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVKSM

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-0949.06Show/hide
Query:  GVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK
        GV  D  K++A+  WP P N  E+R F GL  +YRRF+KN+  I  PL EL+K
Subjt:  GVQVDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCAAAGGATGATGTGAATCAAACCATTGATCCATTGCATGTACCTGAAGGAGCAATAACAAGGAGCAAGGCCAAGAAGATTCAAGAGGCTTTCACATTGCATCT
CCAAAAGCTCGTGAATGCACAAGGAGAGACCAAGATTTTTGAGCCCCAAATTCTTTATAATGTTAGTACAACAAGTCAAGATGAGAATGGAGTGAAGATGGCACGAGAAA
AGTTGTCTTGTTTGAAAGATGGCACGGAGGACAAAAAAAGTGTGCAGAAAATGTTTGATGTCCTTATAAAAAAAATGTCACAAGATAATAAAGAAAAAGTTCTGCAAGTG
GTAGATCCAAATATGGCTATTCTTCAAGCAATTCAAGGTATGATGGAGATGATGAGAGAAGAAAGGGACGAAAGGAGAGCACAACAACAAAGAGAAGAACGGATCTTGCA
AGAAGATGAAGGCATGTTTGATTTACAGGTACAAGAAAGAAACTTAGGAGGAAGAGGAAATAATGTGAGTTTGGTTATAAGAAAGGGGGAAATGAGGAATGTGTTGTTGA
CACAACAACCAAAATTTGTATTTATGTGCAAAGGGATGTGTTTAACAACTGACCCAAAATCTAATGATGTTTTGCCAAGTTCTTCTAAGTCTCTTTTGAATGGTGTACAA
GTTGATGAAGAAAAGGTAAAGGCTATCCGAGAGTGGCCAACACCTACCAATGCAAATGAGGTGAGATCTTTTCATGGTTTGGCTAGTTTTTATAGGAGGTTTATTAAGAA
CTTCAGTAGTATAGCCTCACCATTAAATGAACTTGTCAAAAGCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCAAAGGATGATGTGAATCAAACCATTGATCCATTGCATGTACCTGAAGGAGCAATAACAAGGAGCAAGGCCAAGAAGATTCAAGAGGCTTTCACATTGCATCT
CCAAAAGCTCGTGAATGCACAAGGAGAGACCAAGATTTTTGAGCCCCAAATTCTTTATAATGTTAGTACAACAAGTCAAGATGAGAATGGAGTGAAGATGGCACGAGAAA
AGTTGTCTTGTTTGAAAGATGGCACGGAGGACAAAAAAAGTGTGCAGAAAATGTTTGATGTCCTTATAAAAAAAATGTCACAAGATAATAAAGAAAAAGTTCTGCAAGTG
GTAGATCCAAATATGGCTATTCTTCAAGCAATTCAAGGTATGATGGAGATGATGAGAGAAGAAAGGGACGAAAGGAGAGCACAACAACAAAGAGAAGAACGGATCTTGCA
AGAAGATGAAGGCATGTTTGATTTACAGGTACAAGAAAGAAACTTAGGAGGAAGAGGAAATAATGTGAGTTTGGTTATAAGAAAGGGGGAAATGAGGAATGTGTTGTTGA
CACAACAACCAAAATTTGTATTTATGTGCAAAGGGATGTGTTTAACAACTGACCCAAAATCTAATGATGTTTTGCCAAGTTCTTCTAAGTCTCTTTTGAATGGTGTACAA
GTTGATGAAGAAAAGGTAAAGGCTATCCGAGAGTGGCCAACACCTACCAATGCAAATGAGGTGAGATCTTTTCATGGTTTGGCTAGTTTTTATAGGAGGTTTATTAAGAA
CTTCAGTAGTATAGCCTCACCATTAAATGAACTTGTCAAAAGCATGTGA
Protein sequenceShow/hide protein sequence
MKSKDDVNQTIDPLHVPEGAITRSKAKKIQEAFTLHLQKLVNAQGETKIFEPQILYNVSTTSQDENGVKMAREKLSCLKDGTEDKKSVQKMFDVLIKKMSQDNKEKVLQV
VDPNMAILQAIQGMMEMMREERDERRAQQQREERILQEDEGMFDLQVQERNLGGRGNNVSLVIRKGEMRNVLLTQQPKFVFMCKGMCLTTDPKSNDVLPSSSKSLLNGVQ
VDEEKVKAIREWPTPTNANEVRSFHGLASFYRRFIKNFSSIASPLNELVKSM