; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020915 (gene) of Snake gourd v1 genome

Gene IDTan0020915
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionzf-RVT domain-containing protein
Genome locationLG01:42979914..42980390
RNA-Seq ExpressionTan0020915
SyntenyTan0020915
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]2.4e-2237.25Show/hide
Query:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV
        +WI++F  +  +E++Q+K P + + PS+C LC K +++  H+F+ C  S   W ++F +F + W F  +L   V+ LL G  L K   ++W    + +L+
Subjt:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV

Query:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHA
        +IW ERNQRIF   A        A  L A++WC+L K F  +SI DI LNW+A
Subjt:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHA

KAA0062564.1 GPI-anchor transamidase isoform X1 [Cucumis melo var. makuwa]1.1e-2440.85Show/hide
Query:  VIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILVDIWFERNQRIFQG
        V+QR+L + CL PS C LC +  E    LF  C +S KCW  +  +F V W F G+    +  +L+G  L +   ++WGN  + +L DIWFE NQRIF+G
Subjt:  VIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILVDIWFERNQRIFQG

Query:  VAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI
            W  R +  +  A++WC L K F  +SI D+ +NW AFI
Subjt:  VAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]9.6e-2437.42Show/hide
Query:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV
        +WI++F  +N +E++Q+K P + + PS+C LC K +++  H+F+ C  S   W ++F +F + W F  +L   V+ LL G  L K   ++W    + +L+
Subjt:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV

Query:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI
        +IW ERNQRIF   A        A  L A++WC+L K F  +SI DI LNW+ F+
Subjt:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI

XP_022153214.1 uncharacterized protein LOC111020765 [Momordica charantia]8.4e-2844.59Show/hide
Query:  WILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMG-PCLGKRASVLWGNAVRVILV
        WIL  GKLN A++IQ+K PS  L PS C LC K  E   HLF  C F+ KCW  +F  F V W F     + V  LL G P L      LW N V+ +L 
Subjt:  WILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMG-PCLGKRASVLWGNAVRVILV

Query:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFIRS
        ++WFERN R+F+    L++  F + + KAS WC+L  +F   S S I  NW AFI S
Subjt:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFIRS

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]2.8e-3143.87Show/hide
Query:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV
        +WI++FG+LN+AEV+Q+K P+  L P+VC  C   +E   HLF  C +S  CW K+   F +      + +  V  LL  P   K   +LW NAV+ +L 
Subjt:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV

Query:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI
        D+WFERNQRIF   A   + R EA + +ASSWC L   F  +S+SD  LNW AFI
Subjt:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI

TrEMBL top hitse value%identityAlignment
A0A5A7T2Y0 zf-RVT domain-containing protein1.1e-2237.25Show/hide
Query:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV
        +WI++F  +  +E++Q+K P + + PS+C LC K +++  H+F+ C  S   W ++F +F + W F  +L   V+ LL G  L K   ++W    + +L+
Subjt:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV

Query:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHA
        +IW ERNQRIF   A        A  L A++WC+L K F  +SI DI LNW+A
Subjt:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHA

A0A5A7V5N8 GPI-anchor transamidase isoform X15.5e-2540.85Show/hide
Query:  VIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILVDIWFERNQRIFQG
        V+QR+L + CL PS C LC +  E    LF  C +S KCW  +  +F V W F G+    +  +L+G  L +   ++WGN  + +L DIWFE NQRIF+G
Subjt:  VIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILVDIWFERNQRIFQG

Query:  VAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI
            W  R +  +  A++WC L K F  +SI D+ +NW AFI
Subjt:  VAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI

A0A5D3DE60 zf-RVT domain-containing protein4.7e-2437.42Show/hide
Query:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV
        +WI++F  +N +E++Q+K P + + PS+C LC K +++  H+F+ C  S   W ++F +F + W F  +L   V+ LL G  L K   ++W    + +L+
Subjt:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV

Query:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI
        +IW ERNQRIF   A        A  L A++WC+L K F  +SI DI LNW+ F+
Subjt:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFI

A0A6J1DIE2 uncharacterized protein LOC1110207654.1e-2844.59Show/hide
Query:  WILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMG-PCLGKRASVLWGNAVRVILV
        WIL  GKLN A++IQ+K PS  L PS C LC K  E   HLF  C F+ KCW  +F  F V W F     + V  LL G P L      LW N V+ +L 
Subjt:  WILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMG-PCLGKRASVLWGNAVRVILV

Query:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFIRS
        ++WFERN R+F+    L++  F + + KAS WC+L  +F   S S I  NW AFI S
Subjt:  DIWFERNQRIFQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFIRS

M5XV38 zf-RVT domain-containing protein5.9e-1936.94Show/hide
Query:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV
        +W+   G++N  + IQR+ P MCL PS C LC + AE+  HLFI C +S + W+KM     V WV      E +   L     GKRA +L    V  I  
Subjt:  MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILV

Query:  DIWFERNQRIFQG-VAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFIR
        +IW ERNQRIFQG +    E  ++ ++  AS W ++   F  +  S I  +  A +R
Subjt:  DIWFERNQRIFQG-VAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45063.1 copper ion binding;electron carriers5.3e-0429.41Show/hide
Query:  PSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILVDIWFERNQRI
        PS+C LC    E+  H+F DC FS + W        V+      + +     L  PC  K+ + +   A +  +  IW ERN R+
Subjt:  PSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILVDIWFERNQRI

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.1e-0728.57Show/hide
Query:  WILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILVD
        W++ + +L+  + +Q    S+   P+ C LC    +S  HLF +C+FS   W   F     +      L +C L+ L+ P   K   ++   A    +  
Subjt:  WILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILVD

Query:  IWFERNQRIFQGVAWLWEARFEAMQL
        IW ERNQR+  GV+   E+  + +QL
Subjt:  IWFERNQRIFQGVAWLWEARFEAMQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATTCTTATTTTTGGAAAGTTGAATATTGCAGAAGTGATTCAAAGGAAGCTTCCTTCTATGTGCTTACAACCCTCGGTGTGTTTTTTGTGTTACAAGGGAGCTGA
ATCGGGTTTTCATCTCTTCATAGATTGTGAATTTTCTAAGAAATGTTGGTATAAGATGTTTCAAGTTTTTGAGGTTAGTTGGGTATTTGCTGGAAATTTGCAAGAGTGTG
TTCTACATCTGTTGATGGGGCCTTGTCTGGGCAAGAGGGCTTCAGTTTTGTGGGGGAATGCAGTTAGAGTCATCTTGGTGGATATTTGGTTCGAGAGGAATCAACGCATA
TTTCAAGGGGTTGCTTGGTTGTGGGAGGCTCGTTTTGAGGCTATGCAGCTTAAGGCCTCGTCCTGGTGTGCTCTATTCAAAGCGTTCGATGGTTTTTCCATATCAGATAT
TAGATTAAATTGGCATGCTTTTATTCGCTCTCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGATTCTTATTTTTGGAAAGTTGAATATTGCAGAAGTGATTCAAAGGAAGCTTCCTTCTATGTGCTTACAACCCTCGGTGTGTTTTTTGTGTTACAAGGGAGCTGA
ATCGGGTTTTCATCTCTTCATAGATTGTGAATTTTCTAAGAAATGTTGGTATAAGATGTTTCAAGTTTTTGAGGTTAGTTGGGTATTTGCTGGAAATTTGCAAGAGTGTG
TTCTACATCTGTTGATGGGGCCTTGTCTGGGCAAGAGGGCTTCAGTTTTGTGGGGGAATGCAGTTAGAGTCATCTTGGTGGATATTTGGTTCGAGAGGAATCAACGCATA
TTTCAAGGGGTTGCTTGGTTGTGGGAGGCTCGTTTTGAGGCTATGCAGCTTAAGGCCTCGTCCTGGTGTGCTCTATTCAAAGCGTTCGATGGTTTTTCCATATCAGATAT
TAGATTAAATTGGCATGCTTTTATTCGCTCTCATTAG
Protein sequenceShow/hide protein sequence
MWILIFGKLNIAEVIQRKLPSMCLQPSVCFLCYKGAESGFHLFIDCEFSKKCWYKMFQVFEVSWVFAGNLQECVLHLLMGPCLGKRASVLWGNAVRVILVDIWFERNQRI
FQGVAWLWEARFEAMQLKASSWCALFKAFDGFSISDIRLNWHAFIRSH