; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003028 (gene) of Snake gourd v1 genome

Gene IDTan0003028
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionzf-RVT domain-containing protein
Genome locationLG10:18500448..18500924
RNA-Seq ExpressionTan0003028
SyntenyTan0003028
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]4.8e-2336.6Show/hide
Query:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS
        +WI++   + +++ LQ+K P +++ PS+CPLCL+  ++  H+F+ C  +   W ++F  FNL W F  +L  SV QLL+G  L     I+W    +  L 
Subjt:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS

Query:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEA
        ++W ERNQR+F+D  +       A  L A+AW SL + F  +S+ DI +NW A
Subjt:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEA

KAA0062564.1 GPI-anchor transamidase isoform X1 [Cucumis melo var. makuwa]4.8e-2335.9Show/hide
Query:  WILMHGSLNTADTL--QRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSL
        W L H +     +L  QR+L +  L+PS C LCL+EGE    LF  C ++  CW  L   F + W F G+   ++ Q+L G  L     ++WGN  +  L
Subjt:  WILMHGSLNTADTL--QRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSL

Query:  SDMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV
        SD+WFE NQR+F      W    +     A+ W  L++ F  +S+ D+ VNW AF+
Subjt:  SDMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]2.5e-2436.77Show/hide
Query:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS
        +WI++   +N+++ LQ+K P +++ PS+CPLCL+  ++  H+F+ C  +   W ++F  FNL W F  +L  SV QLL+G  L     I+W    +  L 
Subjt:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS

Query:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV
        ++W ERNQR+F+D  +       A  L A+AW SL + F  +S+ DI +NW  F+
Subjt:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]9.6e-3247.1Show/hide
Query:  WILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAG-PLLSSRGSILWGNAVRGSLS
        WIL  G LNTAD +Q+K P  +LLPS C LC + GE   HLF  C FA  CW+ LF +FN+ W F     ++V+QLL G P LSS    LW N V+  LS
Subjt:  WILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAG-PLLSSRGSILWGNAVRGSLS

Query:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV
        ++WFERN R+F + R+ ++  F +   KAS W SL  +F   S S I  NW AF+
Subjt:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]1.5e-3245.16Show/hide
Query:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS
        +WI++ G LN A+ LQ+K P  SL P+VCP CL   E  LHLF  C ++  CW+KL   FNL      + K +VFQLLA P       +LW NAV+  L+
Subjt:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS

Query:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV
        D+WFERNQR+F +     +   EA   +AS+W  LS  F  +S+SD  +NWEAF+
Subjt:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV

TrEMBL top hitse value%identityAlignment
A0A438CSQ5 Transposon TX1 uncharacterized 149 kDa protein3.0e-2335.29Show/hide
Query:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS
        +W++ H  +NT D LQ + PH +L P +C LC+++GES  HLF+ CS     W +LF+   + WV   ++ + +F    G   S RG +LW NA    + 
Subjt:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS

Query:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEA
         +W ERN R+F D  +  E  ++++H  AS W+  S  F G  ++ +Q++W A
Subjt:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEA

A0A5A7T2Y0 zf-RVT domain-containing protein2.3e-2336.6Show/hide
Query:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS
        +WI++   + +++ LQ+K P +++ PS+CPLCL+  ++  H+F+ C  +   W ++F  FNL W F  +L  SV QLL+G  L     I+W    +  L 
Subjt:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS

Query:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEA
        ++W ERNQR+F+D  +       A  L A+AW SL + F  +S+ DI +NW A
Subjt:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEA

A0A5A7V5N8 GPI-anchor transamidase isoform X12.3e-2335.9Show/hide
Query:  WILMHGSLNTADTL--QRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSL
        W L H +     +L  QR+L +  L+PS C LCL+EGE    LF  C ++  CW  L   F + W F G+   ++ Q+L G  L     ++WGN  +  L
Subjt:  WILMHGSLNTADTL--QRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSL

Query:  SDMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV
        SD+WFE NQR+F      W    +     A+ W  L++ F  +S+ D+ VNW AF+
Subjt:  SDMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV

A0A5D3DE60 zf-RVT domain-containing protein1.2e-2436.77Show/hide
Query:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS
        +WI++   +N+++ LQ+K P +++ PS+CPLCL+  ++  H+F+ C  +   W ++F  FNL W F  +L  SV QLL+G  L     I+W    +  L 
Subjt:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLS

Query:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV
        ++W ERNQR+F+D  +       A  L A+AW SL + F  +S+ DI +NW  F+
Subjt:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV

A0A6J1DIE2 uncharacterized protein LOC1110207654.7e-3247.1Show/hide
Query:  WILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAG-PLLSSRGSILWGNAVRGSLS
        WIL  G LNTAD +Q+K P  +LLPS C LC + GE   HLF  C FA  CW+ LF +FN+ W F     ++V+QLL G P LSS    LW N V+  LS
Subjt:  WILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAG-PLLSSRGSILWGNAVRGSLS

Query:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV
        ++WFERN R+F + R+ ++  F +   KAS W SL  +F   S S I  NW AF+
Subjt:  DMWFERNQRVFNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.3e-0626.72Show/hide
Query:  WILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLSD
        W+ M   L+T D   R +    + P +C  C    E+  HLF +C FAR  W       +   VF   L E   + L  P      + +   +   S+  
Subjt:  WILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLSD

Query:  MWFERNQRVFNDIRKP
        +W ERN R+ +   +P
Subjt:  MWFERNQRVFNDIRKP

AT3G25270.1 Ribonuclease H-like superfamily protein7.9e-0828.89Show/hide
Query:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKL---FREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRG
        +W L+ G+L T D L+R+  H+   P  C  C QE E+  HLF +C +A+  W       +E        G   E+  +LL    L++R   L+  A+  
Subjt:  MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKL---FREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRG

Query:  SLSDMWFERNQRVFNDIRKPWEVCFEAMHLKASAW
         L  +W  RNQ VF      W+   +        W
Subjt:  SLSDMWFERNQRVFNDIRKPWEVCFEAMHLKASAW

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-0432.22Show/hide
Query:  LPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLSDMWFERNQRVFNDI
        +PS   LC    E+  HLF ECSF+ + W     +F     F   L  +   +L  PL S   +IL    ++ ++  +W ERN R+F  I
Subjt:  LPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLSDMWFERNQRVFNDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATTCTGATGCACGGGAGTCTAAACACGGCGGACACGTTGCAGAGGAAGTTGCCTCACATGAGTCTTCTTCCTTCAGTTTGCCCCCTATGCCTACAGGAAGGAGA
ATCTGGGCTTCATTTGTTTGTAGAATGTTCATTCGCGAGGAGTTGTTGGTCAAAACTATTTCGAGAATTCAACCTAGGATGGGTTTTTGCAGGAAATCTAAAGGAAAGTG
TGTTCCAACTGCTGGCAGGCCCATTGCTTTCATCAAGAGGCTCGATTCTTTGGGGCAATGCAGTAAGGGGTTCTTTGTCGGACATGTGGTTCGAGCGGAATCAGAGGGTC
TTCAATGATATTAGGAAGCCTTGGGAAGTTTGTTTTGAAGCCATGCACCTCAAGGCATCGGCTTGGAGTTCTTTATCAAGGGCATTTACTGGGTTCTCAGTTTCTGATAT
TCAAGTTAATTGGGAGGCTTTTGTTTTTCCTATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGATTCTGATGCACGGGAGTCTAAACACGGCGGACACGTTGCAGAGGAAGTTGCCTCACATGAGTCTTCTTCCTTCAGTTTGCCCCCTATGCCTACAGGAAGGAGA
ATCTGGGCTTCATTTGTTTGTAGAATGTTCATTCGCGAGGAGTTGTTGGTCAAAACTATTTCGAGAATTCAACCTAGGATGGGTTTTTGCAGGAAATCTAAAGGAAAGTG
TGTTCCAACTGCTGGCAGGCCCATTGCTTTCATCAAGAGGCTCGATTCTTTGGGGCAATGCAGTAAGGGGTTCTTTGTCGGACATGTGGTTCGAGCGGAATCAGAGGGTC
TTCAATGATATTAGGAAGCCTTGGGAAGTTTGTTTTGAAGCCATGCACCTCAAGGCATCGGCTTGGAGTTCTTTATCAAGGGCATTTACTGGGTTCTCAGTTTCTGATAT
TCAAGTTAATTGGGAGGCTTTTGTTTTTCCTATTTAA
Protein sequenceShow/hide protein sequence
MWILMHGSLNTADTLQRKLPHMSLLPSVCPLCLQEGESGLHLFVECSFARSCWSKLFREFNLGWVFAGNLKESVFQLLAGPLLSSRGSILWGNAVRGSLSDMWFERNQRV
FNDIRKPWEVCFEAMHLKASAWSSLSRAFTGFSVSDIQVNWEAFVFPI