; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004745 (gene) of Snake gourd v1 genome

Gene IDTan0004745
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionzf-RVT domain-containing protein
Genome locationLG06:21666075..21666551
RNA-Seq ExpressionTan0004745
SyntenyTan0004745
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW26228.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]4.3e-2437.25Show/hide
Query:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA
        +W++ H  +NT D LQ + PH +L P +C LC+++GES  HLF+ CS T   W +LF    ++WV   ++ + +F    G   S RG +LW NA   L+ 
Subjt:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA
         +W ERN RIF D   + E  ++++H  AS W+  S  F G  ++ +Q +W A
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA

RVW39517.1 putative ribonuclease H protein [Vitis vinifera]5.7e-2437.25Show/hide
Query:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA
        +W++ H  +NT D LQ + PH +L P +C LC+ +GES  HLF+ CS T   W +LF    ++W+   N+ + +F    G   S RG +LW NA   L+ 
Subjt:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA
         +W ERN RIF D   + E  ++++H  AS W+  S  F G  ++ +Q +W A
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]1.5e-2438.71Show/hide
Query:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA
        +WI++   +N+++ LQ+K P +++ PS+CPLCL+  ++  H+F+ C  +   W ++F  FNL W F  +L  SV QLL+G  L     I+W    + LL 
Subjt:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI
        +IW ERNQRIF+D          A  L A+AW SL + F  +S+ DI  NW  F+
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]5.6e-3247.74Show/hide
Query:  WILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAG-PLLSSRGSILWGNAVRGLLA
        WIL  G LNTAD +Q+K P  +LLPS C LC + GE   HLF  C F   CW  LF +FN++W F     ++V+QLL G P LSS    LW N V+ LL+
Subjt:  WILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAG-PLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI
        ++WFERN R+F + R  ++  F +   KAS W SL  +F   S S I ANW AFI
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]5.1e-3347.74Show/hide
Query:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA
        +WI+L G LN A+ LQ+K P  SL P+VCP CL   E   HLF  C ++  CW KL   FNL      + K +VFQLLA P       +LW NAV+ LLA
Subjt:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI
        D+WFERNQRIF +   S +   EA   +AS+W  LS  F  +S+SD   NW+AFI
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI

TrEMBL top hitse value%identityAlignment
A0A438CSQ5 Transposon TX1 uncharacterized 149 kDa protein2.1e-2437.25Show/hide
Query:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA
        +W++ H  +NT D LQ + PH +L P +C LC+++GES  HLF+ CS T   W +LF    ++WV   ++ + +F    G   S RG +LW NA   L+ 
Subjt:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA
         +W ERN RIF D   + E  ++++H  AS W+  S  F G  ++ +Q +W A
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA

A0A438DVL1 Putative ribonuclease H protein2.7e-2437.25Show/hide
Query:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA
        +W++ H  +NT D LQ + PH +L P +C LC+ +GES  HLF+ CS T   W +LF    ++W+   N+ + +F    G   S RG +LW NA   L+ 
Subjt:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA
         +W ERN RIF D   + E  ++++H  AS W+  S  F G  ++ +Q +W A
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA

A0A5A7T2Y0 zf-RVT domain-containing protein1.8e-2338.56Show/hide
Query:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA
        +WI++   + +++ LQ+K P +++ PS+CPLCL+  ++  H+F+ C  +   W ++F  FNL W F  +L  SV QLL+G  L     I+W    + LL 
Subjt:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA
        +IW ERNQRIF+D          A  L A+AW SL + F  +S+ DI  NW A
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKA

A0A5D3DE60 zf-RVT domain-containing protein7.2e-2538.71Show/hide
Query:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA
        +WI++   +N+++ LQ+K P +++ PS+CPLCL+  ++  H+F+ C  +   W ++F  FNL W F  +L  SV QLL+G  L     I+W    + LL 
Subjt:  MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI
        +IW ERNQRIF+D          A  L A+AW SL + F  +S+ DI  NW  F+
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI

A0A6J1DIE2 uncharacterized protein LOC1110207652.7e-3247.74Show/hide
Query:  WILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAG-PLLSSRGSILWGNAVRGLLA
        WIL  G LNTAD +Q+K P  +LLPS C LC + GE   HLF  C F   CW  LF +FN++W F     ++V+QLL G P LSS    LW N V+ LL+
Subjt:  WILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAG-PLLSSRGSILWGNAVRGLLA

Query:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI
        ++WFERN R+F + R  ++  F +   KAS W SL  +F   S S I ANW AFI
Subjt:  DIWFERNQRIFNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.2e-0527.78Show/hide
Query:  WILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLAD
        W++    L+T D LQ        +P+ C LC    +S  HLF EC F+   W       NL       L + +  LL+ P       ++   A    +  
Subjt:  WILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLAD

Query:  IWFERNQRIFNDLRNSWEVCFEAMHL
        IW ERNQR+ + +  S E   + + L
Subjt:  IWFERNQRIFNDLRNSWEVCFEAMHL

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.0e-0432.26Show/hide
Query:  LPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLADIWFERNQRIFNDLRNS
        +PS   LC    E+  HLF ECSF+   W     +F     F   L  +   +L  PL S   +IL    ++  +  +W ERN RIF  + +S
Subjt:  LPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLADIWFERNQRIFNDLRNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATTCTGTTGCACGGGAGTCTAAACACGGCTGACACGCTGCAAAGAAAATTGCCGCACATGAGTCTTCTTCCTTCAGTCTGTCCCCTATGCTTGCAGGAAGGAGA
ATCAGAGTTTCATTTGTTTATAGAATGTTCATTCACAAGGAACTGTTGGTTAAAGCTATTTTGGGAATTCAATCTAGAGTGGGTTTTTGCAGGAAACCTCAAGGAAAGCG
TGTTCCAGTTGCTAGCAGGTCCTTTGCTTTCGTCGAGAGGTTCGATTCTGTGGGGAAATGCAGTAAGGGGGTTGTTGGCAGACATATGGTTCGAACGAAATCAGAGGATT
TTCAACGATCTTAGAAATTCTTGGGAAGTTTGCTTCGAAGCCATGCACCTCAAGGCGTCGGCATGGAGTTCTCTATCAAGGGCATTCACTGGATTCTCAGTTTCTGATAT
TCAAGCTAATTGGAAGGCTTTTATTTTTCCTGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGATTCTGTTGCACGGGAGTCTAAACACGGCTGACACGCTGCAAAGAAAATTGCCGCACATGAGTCTTCTTCCTTCAGTCTGTCCCCTATGCTTGCAGGAAGGAGA
ATCAGAGTTTCATTTGTTTATAGAATGTTCATTCACAAGGAACTGTTGGTTAAAGCTATTTTGGGAATTCAATCTAGAGTGGGTTTTTGCAGGAAACCTCAAGGAAAGCG
TGTTCCAGTTGCTAGCAGGTCCTTTGCTTTCGTCGAGAGGTTCGATTCTGTGGGGAAATGCAGTAAGGGGGTTGTTGGCAGACATATGGTTCGAACGAAATCAGAGGATT
TTCAACGATCTTAGAAATTCTTGGGAAGTTTGCTTCGAAGCCATGCACCTCAAGGCGTCGGCATGGAGTTCTCTATCAAGGGCATTCACTGGATTCTCAGTTTCTGATAT
TCAAGCTAATTGGAAGGCTTTTATTTTTCCTGTCTAA
Protein sequenceShow/hide protein sequence
MWILLHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESEFHLFIECSFTRNCWLKLFWEFNLEWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGLLADIWFERNQRI
FNDLRNSWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQANWKAFIFPV