; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004594 (gene) of Snake gourd v1 genome

Gene IDTan0004594
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG01:90445854..90446468
RNA-Seq ExpressionTan0004594
SyntenyTan0004594
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2725009.1 hypothetical protein I3760_01G047300 [Carya illinoinensis]5.7e-2031.94Show/hide
Query:  LKDDESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWRVIKN
        ++++ +WNE +I+E F   + E IL+ P+   G   +++WG   KG+FS+RSAY++ +    N   S     +    W+ +WK N+  +AK+ +W+   N
Subjt:  LKDDESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWRVIKN

Query:  VIPTKGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC---RDDWL
         I TK NL+ K +  N +C +C+   E+ +H++W C    D W+
Subjt:  VIPTKGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC---RDDWL

PNX96793.1 ribonuclease H, partial [Trifolium pratense]2.2e-1934.18Show/hide
Query:  WNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHP-RLWNCLWKANLLLRAKICVWRVIKNVIPTK
        WNE +I + F + D + IL  PI  K     + W +   G++S++S YH  I+ ++N + +  A+ + P  +W  LWK  +  +    +WR++ N IP K
Subjt:  WNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHP-RLWNCLWKANLLLRAKICVWRVIKNVIPTK

Query:  GNLIKKGIDTNPVCLLCRNKRENTIHVIWSCRDDWLPFAFWDWLAKNLSEEELNLAKL
        GNL KKG+  +P+C  C N  E+  HV   C  +W   A   W A  L+   LNL +L
Subjt:  GNLIKKGIDTNPVCLLCRNKRENTIHVIWSCRDDWLPFAFWDWLAKNLSEEELNLAKL

XP_042939515.1 uncharacterized protein LOC122274552 [Carya illinoinensis]8.3e-1928.87Show/hide
Query:  VSVFLKDD-ESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPR-LWNCLWKANLLLRAKICV
        VS  ++D+ ++W +E+++  F   + + I + PI  K    ++IWG   KG F+++SAYH  +  + N +  E ++GD  + LW  LW+     + KI +
Subjt:  VSVFLKDD-ESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPR-LWNCLWKANLLLRAKICV

Query:  WRVIKNVIPTKGNLIKKGIDTNPVCLLCRNKRENTIHVIWSCR-------DDWLPF-----------AFWDWLAKNLSEEELNLAKLLFGRYGM
        W+ +  ++PT+  L K+ +  N  C +C  + E  IHV+W CR       +D  P            + W  L   L E EL++   +F  YG+
Subjt:  WRVIKNVIPTKGNLIKKGIDTNPVCLLCRNKRENTIHVIWSCR-------DDWLPF-----------AFWDWLAKNLSEEELNLAKLLFGRYGM

XP_042958109.1 uncharacterized protein LOC122293655 [Carya illinoinensis]5.7e-2028.87Show/hide
Query:  VSVFLKDD-ESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPR-LWNCLWKANLLLRAKICV
        VS  ++D+ ++W +E+++  F   + + I + PI  KG   ++IWG   KG F+++SAYH  +  + N +  E ++GD  + LW  LW+     + KI +
Subjt:  VSVFLKDD-ESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPR-LWNCLWKANLLLRAKICV

Query:  WRVIKNVIPTKGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC-------RDDWLPF-----------AFWDWLAKNLSEEELNLAKLLFGRYGM
        W+ +  ++PT+  L K+ +  N  C +C  + E  IHV+W C        +D  P            + W  L   L E EL++   +F  YG+
Subjt:  WRVIKNVIPTKGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC-------RDDWLPF-----------AFWDWLAKNLSEEELNLAKLLFGRYGM

XP_042958152.1 uncharacterized protein LOC122293725 [Carya illinoinensis]4.9e-1932.86Show/hide
Query:  VSVFLKDDESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWR
        VS  +    SWNE +IK  F   + E I + P+   G +  +IWG   KG+FS+RSAYH+    +           +  R W+ +WK N+  +  I  W+
Subjt:  VSVFLKDDESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWR

Query:  VIKNVIPTKGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC
             + T+ NL  + I  NP+C +C  + E  +HVIW C
Subjt:  VIKNVIPTKGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC

TrEMBL top hitse value%identityAlignment
A0A2K3N168 Ribonuclease H (Fragment)1.1e-1934.18Show/hide
Query:  WNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHP-RLWNCLWKANLLLRAKICVWRVIKNVIPTK
        WNE +I + F + D + IL  PI  K     + W +   G++S++S YH  I+ ++N + +  A+ + P  +W  LWK  +  +    +WR++ N IP K
Subjt:  WNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHP-RLWNCLWKANLLLRAKICVWRVIKNVIPTK

Query:  GNLIKKGIDTNPVCLLCRNKRENTIHVIWSCRDDWLPFAFWDWLAKNLSEEELNLAKL
        GNL KKG+  +P+C  C N  E+  HV   C  +W   A   W A  L+   LNL +L
Subjt:  GNLIKKGIDTNPVCLLCRNKRENTIHVIWSCRDDWLPFAFWDWLAKNLSEEELNLAKL

A0A2N9GTH0 Uncharacterized protein2.4e-1931.61Show/hide
Query:  WNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINH-ETNPSCSEVATGDHPR-LWNCLWKANLLLRAKICVWRVIKNVIPT
        W  EV+ + F     E IL  P+  +    ++IW     G++ ++SAY++  +H E   +C E + G+  + LW+ +W  ++  + K  VWR   N++PT
Subjt:  WNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINH-ETNPSCSEVATGDHPR-LWNCLWKANLLLRAKICVWRVIKNVIPT

Query:  KGNLIKKGIDTNPVCLLCRNKRENTIHVIWSCRDDWLPFAFWDWLAKNLSEEELN
        K NL KK I ++  C +C +  E+TIH +W C     P A   WL+ NL   +++
Subjt:  KGNLIKKGIDTNPVCLLCRNKRENTIHVIWSCRDDWLPFAFWDWLAKNLSEEELN

A0A2N9IWN7 Uncharacterized protein2.4e-1936.36Show/hide
Query:  WNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYH--MAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWRVIKNVIPT
        W E VI   F  L+   IL  P+  +  + +IIW   P G+FSIRSAYH  + ++    PS S  +TG  P LWN +W   +  + +  +WR  ++ +PT
Subjt:  WNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYH--MAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWRVIKNVIPT

Query:  KGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC
        K NL ++ +  +P+C  C +  E+  HVIW+C
Subjt:  KGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC

A0A2Z6NX89 zf-RVT domain-containing protein8.9e-1931.03Show/hide
Query:  SWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWRVIKNVIPTK
        SWN  +I + F   + + ILN P+  +    ++IW  +  G FS+RSA+HM I    N +  E ++ ++  +W  +WK       K  +WR+ ++++PT+
Subjt:  SWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWRVIKNVIPTK

Query:  GNLIKKGIDTNPVCLLCRNKRENTIHVIWSCR---DDWL----------PFAFWDWLAKNLSEEELNLAKLLFG
        G L +KG+  + +CLLC    EN  H+   CR     W             +  DWL + LS + + LA  LFG
Subjt:  GNLIKKGIDTNPVCLLCRNKRENTIHVIWSCR---DDWL----------PFAFWDWLAKNLSEEELNLAKLLFG

A0A7N2LKM6 zf-RVT domain-containing protein2.1e-2031.82Show/hide
Query:  ESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWRVIKNVIPT
        + W+  V+   F     EDI    +     K E++W E+ KG FS+++AY +AI  +        +T D  R+WNCLW+ ++  + +  VWR   +++PT
Subjt:  ESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWRVIKNVIPT

Query:  KGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC
        + NL ++ +  +PVC +C+ + E   H +WSC
Subjt:  KGNLIKKGIDTNPVCLLCRNKRENTIHVIWSC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGGAAAGAGAGTCTCTGTCTTCCTTAAAGATGATGAAAGTTGGAATGAGGAAGTGATTAAAGAGGGTTTCTGCAACCTTGATTGTGAGGACATTTTAAATACCCC
TATAGGCCCCAAAGGGTTTAAAGGCGAAATTATTTGGGGGGAGGATCCTAAGGGGGTGTTTTCAATCAGAAGCGCTTATCATATGGCTATTAATCACGAGACCAACCCGT
CGTGCTCTGAGGTGGCTACGGGGGATCACCCTAGACTTTGGAACTGCTTATGGAAAGCCAACTTACTTCTAAGAGCCAAGATTTGTGTTTGGAGAGTGATAAAAAATGTC
ATTCCCACTAAAGGCAATCTTATTAAAAAAGGAATTGATACTAACCCTGTTTGTTTATTATGCAGGAACAAGCGCGAGAACACCATTCATGTCATTTGGAGTTGTAGGGA
CGATTGGTTGCCTTTTGCCTTTTGGGATTGGTTGGCCAAGAACCTATCTGAGGAGGAGCTGAATTTGGCCAAATTATTGTTTGGAAGATATGGAATGCTAGAAATCTCAT
AA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGGAAAGAGAGTCTCTGTCTTCCTTAAAGATGATGAAAGTTGGAATGAGGAAGTGATTAAAGAGGGTTTCTGCAACCTTGATTGTGAGGACATTTTAAATACCCC
TATAGGCCCCAAAGGGTTTAAAGGCGAAATTATTTGGGGGGAGGATCCTAAGGGGGTGTTTTCAATCAGAAGCGCTTATCATATGGCTATTAATCACGAGACCAACCCGT
CGTGCTCTGAGGTGGCTACGGGGGATCACCCTAGACTTTGGAACTGCTTATGGAAAGCCAACTTACTTCTAAGAGCCAAGATTTGTGTTTGGAGAGTGATAAAAAATGTC
ATTCCCACTAAAGGCAATCTTATTAAAAAAGGAATTGATACTAACCCTGTTTGTTTATTATGCAGGAACAAGCGCGAGAACACCATTCATGTCATTTGGAGTTGTAGGGA
CGATTGGTTGCCTTTTGCCTTTTGGGATTGGTTGGCCAAGAACCTATCTGAGGAGGAGCTGAATTTGGCCAAATTATTGTTTGGAAGATATGGAATGCTAGAAATCTCAT
AA
Protein sequenceShow/hide protein sequence
MVGKRVSVFLKDDESWNEEVIKEGFCNLDCEDILNTPIGPKGFKGEIIWGEDPKGVFSIRSAYHMAINHETNPSCSEVATGDHPRLWNCLWKANLLLRAKICVWRVIKNV
IPTKGNLIKKGIDTNPVCLLCRNKRENTIHVIWSCRDDWLPFAFWDWLAKNLSEEELNLAKLLFGRYGMLEIS